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2) INFORMATION FOR SEQ ID NO: 1 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Acinetobacter baumannii 

(B) STRAIN: ATCC 19606 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 

CAAACTCGTG AGCACATCCT TCTTTCTCGT CAGGTAGGTG TACCTTACAT 50 

CATCGTATTC TTAAACAAAT GCGACCTTGT TGATGACGAA GAATTACTTG 100 

AATTAGTAGA AATGGAAGTA CGTGAACTTC TTTCTACTTA TGACTTCCCA 150 

GGTGATGACA CTCCAGTAAT CCGTGGTTCA GCTCTTGCAG CGCTTAACGG 200 

TGAAGCTGGT CCTTACGGTG AAGAATCAGT TCTTGCTCTT GTAGCAGCAC 250 

TTGACTCTTA CATCCCAGAG CCAGAGCGTG CAATCGACAA AGCATTCTTG 300 

ATGCCAATCG AAGACGTATT CTCAATTTCT GGTCGTGGTA CAGTAGTAAC 350 

AGGCCGTGTT GAAGCTGGTA TCATCAAAGT TGGTGAAGAA GTAGAGATCG 400 

TTGGTATTAA AGATACAGTT AAAACAACTG TAACTGGCGT AGAAATGTTC 450 

CGTAAACTTC TTGACGAAGG CCGTGCAGGT GAGAACTGTG GTATCTTACT 500 

TCGTGGTACT AAGCGTGAAG AAGTACAACG TGGTCAAGTA CTTGCTAAAC 550 

CAGGTACAAT CAAGCCGCAC ACTAAATTCG ACGCAGAAGT ATACGTACTT 600 

TCTAAAGAAG AAGGTGGTCG TCACACTCCA TTCTTAAATG GTTACCGTCC 650 

ACAGTTCTAC TTCCGTACAA CTGACGTAAC TGGTGCRATC CAGTTGAAAG 700 

AAGGCGTTGA AATGGTAATG CCAGGTGACA ACGTTGAAAT GTCAGTAGAA 750 



2) INFORMATION FOR SEQ ID NO: 2 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

\ uT * tufuIJuc/yY ' Li near 
(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Actinomyces meyeri 

(B) STRAIN: ATCC 35568 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 



CGGTGCGATC CTCGTGGTCG CCGCGACCGA CGGCCCCATG 
GCGAGCACGT CCTGCTCGCC CGTCAGGTCG GCGTTCCCAC 
GCCCTCAACA AGTCCGACAT GGTTGACGAC GAGGAAATGA 
CGAGGAGGAG TGCCGCGACC TGCTGGAGTC CCAGGACTTC 
CCCCGATCGT CCAGGTTTCC GCTCTGAAGG CCCTCGAGGG 
TGGGTTGCCA AGATCGAGGA GCTCATGGAG GCTGTGGATT 
CACCCCCGAG CGCGATATGG ACAAGCCCTT CCTCATGCCG 
TCTTCACGAT CACAGGTCGT GGCACGGTCG TCACGGGGCG 
GGCAAGCTGC CGATCAACTC CGAGGTCGAG ATCCTCGGTA 



GCCCAGACCC 
CATCCTCATC 
TGGAACTGGT 
GATCGCGATG 
CGACGCGGAG 
CCTACATCCC 
ATCGAGGACG 
TGTTGAGCGT 
TCCGTGATCC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
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CCAGAAGACC ACGGTCACCG 
AGGCATGGGC CGGCGAGAAC 
GATGAGGTTG AGCGCGGCCA 
TCACACCGAG TTCGAGGGCC 
5 GCCGTCACAA CCCGTTCTTC 
ACCACGGACG TGACCGGCGT 
CATGCCTGGC GACACCACCG 
CCATGGAGCC CGGCTGGGCT 



GCATCGAGAT GTTCCACAAG 
TGTGGCCTGC TGCTGCGCGG 
GGTTGTGGCC ATTCCCGGCT 
AGGTTTACAT CCTCAAGAAG 
TCGAACTACC GTCCGCAGTT 
CATCACCCTC CCCGAGGGCA 
AGATCTCCGT TCAGCTGATC 
TCGCCA 



TCGATGGACG 500 

TACCAAGCGC 550 

CCATCACGCC 600 

GAAGAGGGCG 650 

CTACTTCCGT 700 

CCGACATGGT 750 

CAGCCCATCG 800 
826 
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2) INFORMATION FOR SEQ ID NO: 3 

(i) SEQUENCE CHARACTERISTICS : 
15 (A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aerococcus viridans 

(B) STRAIN: ATCC 11563 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 



TGGTGCGATC TTAGTAGTAT CTGCTGCTGA TGGTCCAATG CCACAAACTC 50 

GTGAGCACAT CCTTTTAGCT GGCCAAATCG GTGTTCCTGC ATTCGTAGTA 100 

30 TTCTTAAACA AAGTTGACCA AGTTGACGAT GAAGAATTAC TAGAATTAGT 150 

TGAAATGGAA GTTCGTGACT TATTATCTGA GTACAACTAC CCAGGTGACG 200 

ATCTACCTGT AATCGCTGGT TCTGCTTTAT TAGCATTACA AGGCGATGAA 250 

GCTCAAGAAG CTAAAATCAT GGAATTAATG GAAGCTGTAG ACTCTTACAT 300 

TCCAGAACCA GAACGTGACA ACGACAAACC ATTCATGATG CCAATTGAGG 350 

35 ATGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG TCGTGTTGAA 4 00 

CGTGGTGAAG TTCGTACAGG TGACGAAGTT GACATCGTTG GTATTGCTGA 450 

ACAAATCGGT AAATCAGTTG TAACTGGTGT TGAAATGTTC CGTAAAAACT 500 

TAGACTACGC TCAAGCTGGT GACAACATCG GTGCATTATT ACGTGGTGTT 550 

CAACGTGAAG ACATCCAACG TGGTCAAGTA TTGGCTGCTC CTGGTTCAAT 600 

40 CACTCCACAT ACTAAATTTA AAGCGCAAGT TTACGTTTTA TCTAAAGAAG 650 

AAGGTGGACG TCATACACCA TTCTTAACTA ACTACCGTCC ACAATTCTAC 700 

TuT^cTlALT/T CIUAc'aTi AC Tt^l^rilu^ ACI^I^'lTuJ IuCuACUU^'lT 7t>U 

TATGGTTATG CCTGGTGACA ACGTTGATAT GGACGTTGAA TTGATTCACC 800 

CAGTTGCGAT CGAAGATGGT ACTAAATTCT CTATC 835 



2) INFORMATION FOR SEQ ID NO: 4 



{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 827 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Achromobacter xylosoxidans subsp. 
denitrificans 

4 
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(B) STRAIN: ATCC 15173 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 

5 CCTGGTGGTG TCGGCCGCTG ACGGCCCGAT GCCGCAAACG CGCGAACACA 50 

TCCTGCTGAG CCGCCAGGTT GGCGTGCCGT ACATCATCGT CTTCCTGAAC 100 

AAGGCCGACA TGGTTGACGA CGCCGAGCTG CTTGAGCTGG TGGAAATGGA 150 

AGTTCGCGAR CTGCTGAGCA AGTACGACTT CCCGGGCGAC GACACCCCGA 200 

TCGTGAAGGG TTCGGCCAAG CTGGCGCTGG AAGGCGACAA GGGCGAACTG 250 

10 GGCGAACAGG CCATCATGGC GCTGGCCGCT GCGCTGGACT CGTACATCCC 300 

GACGCCTGAG CGTGCCGTTG ACGGCGCGTT CCTGATGCCG GTTGAAGACG 350 

TGTTCTCGAT CTCGGGTCGC GGCACCGTGG TGACCGGCCG TATCGAACGC 4 00 

GGCATCATCA AGGTCGGCGA GGAAATCGAA ATCGTCGGTC TGGTGCCGAC 450 

GGTGAAGACG ACCTGCACGG GCGTGGAAAT GTTCCGCAAG CTGCTGGACC 500 

15 AAGGTCAAGC CGGCGACAAC GTGGGCATCC TRCTGCGCGG CACCAAGCGT 550 

GAAGACGTCC AGCGCGGCCA GGTTCTGGCC AAGCCGGGCT CGATCACCCC 600 

GCACACGGAC TTCACGTCCG AGGTGTACAT CCTGTCCAAG GAAGAAGGCG 650 

GCCGTCACAC TCCGTTCTTC CAAGGCTATC GTCCCCAGTT CTACTTCCGC 700 

ACGACGGACG TGACGGGCAC GATCGAGCTG CCGGCCGACA AGGAAATGGT 750 

20 CCTGCCGGGC GACAACGTGG CCATGACGGT CAAGCTGCTG GCTCCGATCG 800 

CCATGGAAGA AGGCCTGCGT TCGCCAC 827 



25 2) INFORMATION FOR SEQ ID NO: 5 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Anaerorhabdus furcosus 

(B) STRAIN: ATCC 25662 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 



TGGATCAATC CTAGTAGTTG CTGCAACTGA TGGACCAATG CCTCAAACTC 50 

GTGAACATAT CTTACTTGCT CGTCAAGTAG GTGTTCCAAG AATGGTTGTA 100 

TTCTTGAACA AATGCGACAT GGTTGAAGAT GAAGAATTAA TCGACCTTGT 150 

TGAAATGGAA GTTCGTGAAC TTCTAAGTGC TTACGGTTTC GAAGGTGATG 200 

45 ATACACCAGT TATCCGTGGT TCTGCATTAA AATCTCTTGA AGGAAATGCT 250 

GATTGGGAAG CAAAAGTTGC TGAATTAATG GATGCAGTTG ACTCTTGGAT 300 

TCCAACTCCA ACTCATGAAA CAGACAAACC ATTCTTAATG GCTGTTGAAG 350 

ATGTATTCAC AATTACAGGT CGTGGTACAG TTGCTACTGG ACGTGTTGAA 400 

CGTGGACACT TAAACCTTAA CGAAGAAGTT GAAATCGTTG GTATTCATGA 450 

50 TACTAAGAAA TCAGTTGTTA CTGGTATCGA AATGTTCCGT AAATTATTAG 500 

ACTATGCTGA AGCAGGAGAC AACATTGGTG CATTATTACG TGGTGTTTCT 550 

CGTGATGAAA TCGAACGTGG ACAATGTCTA GCTAAACCTG GATCAGTTAC 600 

TCCACATACA GCTTTCAAAG CTCAAGTATA CGTATTAACT AAAGAAGAAG 650 

GTGGACGTCA TACACCATTC GTAACTAACT ACCGTCCTCA ATTCTATTTC 700 

55 CGTACAACTG ACGTAACAGG AGTTGTTAAA CTTCCTGAAG GTACTGAAAT 750 

GGTTATGCCT GGAGACAACA TCGAAATGAT CGTTGAATTA ATCGCTCCAA 800 

TCGCTGTTGA ACAAGGAACT AAG 823 



60 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



2) INFORMATION FOR SEQ ID NO: 6 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: 4229 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 



10 



CGGCGGTATC TTAGTAGTAT CTGCTGCTGA 
GTGAGCACAT CCTTCTTTCT CGTCAAGTAG 
TTCTTAAACA AATGCGACAT GGTAGACGAC 

20 AGAAATGGAA GTTCGCGACC TATTATCTGA 
ACATTCCTGT AATCAAAGGT TCTGCTCTTA 
GATTGGGAAG CAAAAATCAT TGAATTAATG 
CCCAACTCCA GAACGTGAAA CTGACAAACC 
ACGTATTCTC TATCACAGGT CGTGGTACAG 

25 CGCGGTATCG TTAAAGTTGG TGACGTAGTA 
AGAAAATGCT TCTACAACTG TAACTGGTGT 
TTGACCAAGC TCAAGCTGGA GACAACATCG 
GCTCGTGAAG ACATCCAACG TGGACAAGTA 
AAAAGCTCAC GCTAAATTCA AAGCTGAAGT 

30 AAGGTGGACG TCACACTCCA TTCTTCGCTA 
TTCCGTACAA CTGACGTAAC TGGTATCATC 
AATGGTAATG CCTGGTGACA ACATCGAAAT 
CAATCGCTAT CGAAGAGGGA ACTAA 



TGGCCCAATG CCTCAAACTC 50 

GTGTACCTTA CATCGTTGTA 100 

GAAGAATTAT TAGAATTAGT 150 

ATACGGATTC CCAGGCGACG 200 

AAGCTCTTCA AGGAGAAGCT 250 

GCTGAAGTTG ATGCTTACAT 3 00 

ATTCTTAATG CCTGTAGAGG 350 

TTGCTACTGG TCGTGTTGAG 400 

GAAATCATCG GTCTTGCTGA 450 

AGAGATGTTC CGTAAACTTC 500 

GTGCTTTACT TCGTGGGGTT 550 

CTTGCAAAAA GCGGTTCTGT 600 

TTTCGTATTA TCTAAAGAAG 650 

ACTACCGTCC TCAGTTCTAC 700 

CAATTACCAG AAGGTACTGA 750 

GACTATCGAA CTTATCGCTC 800 

825 



35 



2) INFORMATION FOR SEQ ID NO : 7 

(i) SEQUENCE CHARACTERISTICS : 
40 (A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 
[c'f % s t karu sum stf: ~ Double* 
(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE : Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 14579 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 



CGGCGGTATC TTAGTAGTAT 
GTGAGCACAT CCTTCTTTCT 

55 TTCTTAAACA AATGCGACAT 
AGAAATGGAA GTTCGCGACC 
ACATTCCTGT AATCAAAGGT 
GATTGGGAAG CAAAAATCAT 
CCCAACTCCA GAACGTGAAA 

60 ACGTATTCTC TATCACAGGT 



CTGCTGCTGA TGGCCCAATG 
CGTCAAGTAG GTGTTCCTTA 
GGTAGATGAC GAAGAATTAT 
TATTATCTGA ATACGGATTC 
TCTGCTCTTA AAGCTCTTCA 
TGAATTAATG GCTGAAGTTG 
CTGACAAACC ATTCTTAATG 
CGTGGTACAG TTGCTACTGG 



CCTCAAACAC 50 

CATCGTTGTA 100 

TAGAATTAGT 150 

CCAGGCGACG 200 

AGGAGAAGCT 250 

ATGCTTACAT 300 

CCTGTAGAGG 350 

TCGTGTTGAG 400 
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CGCGGTATCG TTAAAGTTGG TGACGTAGTA 
AGAAAATGCT TCTACAACTG TAACTGGTGT 
TTGACCAAGC TCAAGCTGGA GACAACATCG 
GCTCGTGAAG ACATCCAACG TGGACAAGTA 
5 AAAAGCTCAC GCTAAATTCA AAGCTGAAGT 
AAGGTGGACG TCACACTCCA TTCTTCGCTA 
TTCCGTACAA CTGACGTAAC TGGTATCATC 
AATGGTAATG CCTGGTGACA ACATTGAAAT 
CAATCGCTAT CGAAGAGGGA ACTAAATTC 



GAAATCATCG GTCTTGCTGA 4 50 

AGAGATGTTC CGTAAACTTC 500 

GTGCTTTACT TCGTGGGGTT 550 

CTTGCAAAAA GCGGTTCTGT 600 

TTTCGTATTA TCTAAAGAAG 650 

ACTACCGTCC TCAGTTCTAC 700 

CAATTACCAG AAGGTACTGA 750 

GACTATCGAA CTTATCGCTC 800 

829 



2) INFORMATION FOR SEQ ID MO: 8 

15 (i) SEQUENCE CHARACTERISTICS:' 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacteroides dietasonis 
25 (B) STRAIN: ATCC 8503 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 

CGGTGCTATC ATCGTAGTTG CTGCTACTGA TGGTCCTATG CCTCAAACTC 50 

30 GCGAGCACAT CCTTTTGGCT CGTCAGGTAA ACGTTCCGAG ATTGGTTGTA 100 

TTCATGAACA AGTGTGACAT GGTTGACGAC GAGGAAATGT TGGAATTGGT 150 

TGAGATGGAG ATGAGAGAGT TGCTTTCATT CTATCAATTC GACGGTGACA 200 

ACACTCCGAT CATCCGTGGT TCTGCTCTTG GTGCATTGAA CGGTGATGCT 250 

CAATGGGAAG ATAAAGTAAT GGAGTTGATG GAAGCTTGTG ATACTTGGAT 300 

35 TCCTCTGCCT CCGCGCGAAA TCGACAAGCC GTTCTTGATG CCGGTTGAGG 350 

ACGTATTCTC AATCACGGGT CGTGGTACTG TTGCTACAGG TCGTATCGAG 400 

ACAGGTATTG TTAAGGTTGG TGAGGAAGTT CAGATCATCG GTCTTGGCGC 450 

TGCTGGTAAG AAATCTGTTG TTACAGGTGT TGAGATGTTC CGTAAGTTAT 500 

TGGATCAAGG TGAGGCTGGT GATAACGTTG GTTTGTTGCT TCGCGGTATC 550 

40 GATAAGAATG AGATCAAGCG TGGTATGGTA ATCTGCCACC CGGGTCAGGT 600 

TAAAGAGCAT TCTAAGTTCA AGGCTGAGGT TTATATCTTG AAGAAAGAGG 650 

AAGGTGGTCG TCACACTCCG TTCCACAACA AATATCGTCC TCAGTTCTAT 700 

ATCCGTACAT TGGATGTAAC TGGTGAGATC ACTTTGCCGG AAGGAACTGA 750 

AATGGTAATG CCGGGTGATA ACGTAACGAT CGAGGTTGAG TTGATCTATC 800 

45 CGGTAGCATG TAGCGTAG 818 



2) INFORMATION FOR SEQ ID NO: 9 

50 

{ i ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 639 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Enterococcus casseliflavus 

1 
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(B) 



STRAIN: R763 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9 



5 GGTCCTATGC 
TGTACCATAC 
AAGAATTACT 
TATGACTTCC 
AGCTCTTGAA 

10 CTGCAGTTGA 
TTCATGATGC 
TGCTACAGGC 
AAATCGTTGG 
GAAATGTTCC 

15 TGCATTGCTA 
TGGCTAAAGC 
TACGTTTTAA 



CTCAAACACG 
ATCGTTGTTT 
AGAATTAGTT 
CAGGCGACGA 
GGCGATGCTT 
CGAATACGTT 
CAGTCGAAGA 
CGTGTTGAAC 
TATTGCTGAA 
GTAAATTGTT 
CGTGGTGTTG 
TGGTACAATC 
CAAAAGAAGA 



TGAACACATC 
TCTTAAACAA 
GAAATGGAAG 
TGTTCCTGTA 
CATACGAAGA 
CCAACTCCAG 
CGTATTCTCA 
GTGGACAAGT 
GAAACTGCTA 
AGACTATGCT 
CTCGTGAAGA 
ACACCTCATA 
AGGTGGACGT 



TTGTTATCAC 
AATGGATATG 
TTCGTGACTT 
ATCGCTGGTT 
AAAAATCATG 
AACGTGACAC 
ATCACTGGAC 
TCGCGTTGGT 
AAACAACTGT 
GAAGCAGGGG 
CATCCAACGT 
CAAAATTTAA 
CACACACCA 



GTAACGTTGG 
GTTGATGACG 
ATTGTCAGAA 
CTGCTTTGAA 
GAATTAATGG 
TGACAAACCA 
GTGGTACTGT 
GACGAAGTTG 
AACTGGTGTT 
ATAACATTGG 
GGACAAGTAT 
AGCTGAAGTT 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
639 
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30 



2) INFORMATION FOR SEQ ID NO: 10 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 692 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 

(B) STRAIN: CSG 197 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 

GAACACATTC TTTTATCACG TAACGTTGGT GTTCCAGCAT TAGTTGTATT 50 

CTTAAACAAA GTTGACATGG TTGACGATGA AGAATTATTA GAATTAGTAG 100 

AAATGGAAGT TCGTGACTTA TTAAGCGAAT ATGACTTCCC AGGTGACGAT 150 

40 GTACCTGTAA TCTCTGGTTC TGCATTAAAA GCTTTAGAAG GCGACGCTGA 200 

CTATGAGCAA AAAATCTTAG ACTTAATGCA AGCTGTTGAT GACTTCATTC 250 

CAACACCAGA ACGTGATTCT GACAAACCAT TCATGATGCC AGTTGAGGAC 300 

GTATTCTCAA TCACTGGTCG TGGTACTGTT GCTACAGGCC GTGTTGAACG 350 

TGGTCAAATC AAAGTCGGTG AAGAAATCGA AATCATCGGT ATGCAAGAAG . 400 

45 AATCAAGCAA AACAACTGTT ACTGGTGTAG AAATGTTCCG TAAATTATTA 450 

GACTACGCTG AAGCTGGTGA CAACATTGGT GCATTATTAC GTGGTGTTTC 500 

ACGTGATGAC GTACAACGTG GTCAAGTTTT AGCTGCTCCT GGTACTATTA 550 

CACCACATAC AAAATTCAAA GCGGATGTTT ACGTTTTATC TAAAGATGAA 600 

GGTGGTCGTC ATACACCATT CTTCACTAAC TACCGCCCAC AATTCTATTT 650 

50 CCGTACTACT GACGTAACTG GTGTTGTTAA CTTACCAGAA GG 692 



2) INFORMATION FOR SEQ ID NO: 11 

55 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
60 (D) TOPOLOGY: Linear 

8 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacteroides ovatus 

(B) STRAIN: ATCC 8483 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 

CGGTGCTATC ATCGTTTGTG CTGCAACTGA TGGTCCGATG CCTCAAACTC 50 

GCGAACACAT TCTGTTAGCT CGTCAGGTAA ACGTACCTCG TCTGGTTGTA 100 

TTCTTGAACA AATGCGATAT GGTAGACGAC GAAGAAATGT TGGAACTCGT 150 

TGAAATGGAA ATGAGAGAAC TCCTTTCATT CTATGATTTC GATGGTGACA 200 

ATACTCCTAT CATCCGTGGT TCTGCTCTTG GCGCATTGAA CGGTGTTGAA 250 

AAATGGGAAG ACAAAGTTAT GGAACTGATG GATGCAGTTG ATAACTGGAT 300 

TCCACTGCCT CCGCGCGATG TTGATAAACC ATTCTTGATG CCGGTTGAAG 350 

ACGTGTTCTC TATCACAGGT CGTGGTACTG TAGCAACAGG TCGTATCGAA 400 

ACAGGTGTCA TCCACGTTGG TGATGAAGTC GAAATTCTTG GTTTAGGTGA 450 

AGATAAGAAA TCAGTTGTAA CTGGTGTTGA AATGTTCCGT AAACTGTTGG 500 

ATCAAGGTGA AGCTGGTGAC AACGTAGGTC TTTTGCTTCG TGGTATTGAC 550 

AAGAACGAAA TCAAACGTGG TATGGTTCTT TGTAAACCAG GTCAGATTAA 600 

ACCGCACTCT AAATTCAAAG CTGAGGTTTA TATCTTGAAG AAAGAAGAAG 650 

GTGGTCGTCA CACTCCGTTC CACAACAAAT ACCGTCCTCA GTTCTACTTG 700 

CGTACTATGG ACTGTACAGG TGAAATCACT TTGCCGGAAG GAACAGAAAT 750 

GGTAATGCCG GGTGATAACG TAACTATTAC AGTTGAGTTG ATTTACCCAG 800 

TAGCATTGAA CCCGGGCTTC G 821 



2) INFORMATION FOR SEQ ID NO: 12 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 838 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bartonella henselae 
( bT STRAIN" " ATCC" 4 98 82" " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 



TGGTGCGATT 
GTGAGCATAT 
TTTCTTAATA 
TGAGCTTGAA 
ATATTCCGAT 
AAAAGCATTG 
TTATATACCG 
TTGAAGATGT 
GTTGAGCGTG 
TCGTCCAACT 
TTTTAGATCA 
ATTGATCGTG 
GGTTACACCT 
ATGAAGGTGG 
TATTTCCGTA 



TTGGTTGTTT 
TCTTCTTGCC 
AGGTTGATCA 
GTTCGGGAGT 
CGTTAAAGGT 
GTGAAGATGC 
ACGCCTGAAC 
TTTTTCGATT 
GTGTTATTAA 
TCTAAGACAA 
GGGGCAAGCG 
AAGGGATTGA 
CATACGAGAT 
TCGTCATACT 
CTACGGATGT 



CAGCTGCTGA 
CGTCAGGTTG 
GGTTGATGAT 
TATTGTCGAA 
TCTGCTTTGG 
GGTTCGTCTT 
GTCCTGTTGA 
TCGGGTCGTG 
GGTTGGTGAA 
CAGTTACAGG 
GGTGATAATA 
GCGTGGACAA 
TTAAAGCAGA 
CCATTTTTCA 
AACGGGAATT 



TGGTCCGATG CCTCAAACAC 
GTGTTCCAGC GATTGTTGTT 
GCTGAGCTTT TGGAGCTTGT 
ATATGATTTT CCAGGAGACG 
CAGCGCTTGA AGATAAAGAT 
TTGATGAGTG AAGTTGATAA 
TCAGCCGTTT TTGATGCCAA 
GAACTGTTGT GACGGGTCGT 
GAAGTTGAGA TTATCGGCAT 
GGTTGAAATG TTCCGCAAGC 
TTGGAGCGCT GCTTCGTGGT 
GTTTTGGCGA AGCCTGCTTC 
GGCTTACATT TTGACGAAAG 
CGAATTATCG TCCTCAGTTT 
GTTACGCTTC CAGAAGGTAC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
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AGAGATGGTT ATGCCTGGTG ATAATGTTGC 
TTCCAATTGC CATGGAAGAA AAACTTCGTT 



TATGGATGTC TCTCTGATTG 
TTGCTATC 



800 
838 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 13 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 839 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bifidobacterium adolescentis 

(B) STRAIN: ATCC 15703 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 

TGGCGCCATC CTTGTTGTGG CCGCCACCGA CGGCCCGATG GCTCAGACCC 50 

GCGAGCACGT GCTGCTCGCT CGTCAGGTGG GCGTCCCGAA GATCCTCGTC 100 

GCTCTGAACA AGTGCGATAT GGTCGACGAC GACGAGCTCA TCGAGCTCGT 150 

TGAGGAAGAG GTCCGTGACC TCCTCGACGA AAATGGCTTC GATCGCGATT 200 

GCCCGGTCAT CCACGTGTCC GCTTACGGCG CACTGCACGA TGACGCTCCG 250 

GACCACGAGA AGTGGGTTGA GCAGATCAAG AAGCTCATGG ACGCCGTCGA 3 00 

TGACTACATC CCGACCCCGG TCCACGATCT GGACAAGCCG TTCCTGATGC 3 50 

CGATCGAAGA TGTCTTCACC ATCTCCGGCC GTGGCACCGT GGTGACCGGC 4 00 

CGTGTCGAGC GTGGTAAGCT CCCGGTCAAC TCCAACGTCG AGATCGTCGG 450 

CATCCGTCCG ACCCAGACCA CCACCGTCAC CTCCATCGAG ACCTTCCACA 500 

AGCAGATGGA CGAGTGCGAG GCTGGCGACA ACACCGGTCT GCTGCTCCGC 550 

GGCATCAACC GTGACCAGGT CGAGCGTGGC CAGGTTCTGG CTGCTCCGGG 600 

CTCCGTGACC CCGCACACCA AGTTCGAGGG CGAAGTCTAC GTGCTGACCA 650 

AGGACGAAGG CGGCCGTCAC TCGCCGTTCT TCTCCAACTA CCGTCCGCAG 700 

TTCTACTTCC GTACCACCGA CGTCACCGGC GTCATCACCC TGCCGGAAGG 750 

CGTTGAGATG GTGCAGCCGG GCGATCACGC TACCTTCGGC GTTGAGCTGA 800 

TCCAGCCGAT CGCTATGGAA GAGGGCCTGA CCTTCGCAG 839 



2) INFORMATION FOR SEQ ID NO: 14 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 839 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bifidobacterium dentium 

(B) STRAIN: ATCC 27534 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 

TGGCGCTATC CTCGTTGTGG CCGCCACCGA CGGCCCGATG GCTCAGACCC 50 
GCGAGCACGT GCTGCTCGCT CGTCAGGTGG GCGTGCCGCG TATCCTCGTC 100 
GCCCTGAACA AGTGCGATAT GGTCGACGAC GAAGAGCTCA TCGAGCTCGT 150 
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TGAGGAAGAG GTCCGTGACC TCCTCGACGA 
GCCCGGTCAT CCACACCTCC GCCTACGGCG 
GACCACGACA AGTGGGTTGA GTCCGTCAAG 
CGAGTACATC CCGACCCCGA CCCACGATCT 
5 CGATCGAAGA TGTGTTCACC ATCTCCGGCC 
CGTGTCGAGC GTGGTAAGCT CCCGGTCAAC 
CATCCGTCCG ACCCAGACCA CCACCGTCAC 
AGCAGATGGA CGAGTGCGAG GCTGGCGACA 
GGCATCAACC GTGACCAGGT CGAGCGTGGC 
10 CTCCGTGACC CCGCACACCA AGTTCGAGGG 
AGGACGAAGG CGGCCGTCAC TCGCCGTTCT 
TTCTACTTCC GTACCACCGA CGTCACCGGC 
CGTTGAGATG GTGCAGCCGG GCGATCACGC 
TCCAGCCGAT CGCTATGGAA GAGGGCCTGA 

15 



AAACGGCTTC GATCGCGATT 2 00 

CGCTGCACGA TGACGCTCCG 250 

GAACTCATGA AGGCCGTCGA 3 00 

GGACAAGCCG TTCCTGATGC 350 

GTGGCACCGT GGTTACCGGC 4 00 

TCCAACGTTG AGATCGTCGG 450 

CTCCATCGAG ACCTTCCACA 500 

ACACCGGTCT GCTGCTCCGC 550 

CAGGTTCTGG CTGCTCCGGG 600 

CGAAGTCTAC GTGCTGACCA 6 50 

TCTCCAACTA CCGTCCGCAG 7 00 

GTCATCACCC TGCCGGAAGG 750 

TACCTTCGGC GTTGAGCTGA 8 00 

CCTTCGCAG 839 



2) INFORMATION FOR SEQ ID NO: 15 

20 (i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 838 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brucella abortus 
30 (B) STRAIN: S2308 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 

TGGCGCGATC CTGGTGGTTT CGGCTGCTGA CGGCCCGATG CCGCAGACCC 50 

35 GCGAGCACAT CCTGCTTGCC CGTCAGGTTG GCGTTCCGGC GATCGTCGTG 100 

TTCCTCAACA AGTGCGACCA GGTTGACGAT GCAGAACTGC TCGAACTGGT 150 

TGAACTGGAA GTGCGCGAAC TTCTGTCGAA GTACGAATTC CCCGGCGACG 200 

AAATCCCGAT CATCAAGGGC TCGGCTCTTG CTGCTCTGGA AGATTCTTCC 250 

AAGGAACTGG GCGAAGATGC CATCCGCAAC CTGATGGACG CGGTTGACAG 300 

40 CTACATTCCG ACCCCGGAAC GCCCGATCGA CCAGCCGTTC CTGATGCCGA 350 

TCGAAGACGT GTTCTCGATC TCCGGCCGTG GTACGGTTGT GACGGGTCGC 400 

UTUiAUUjL^ \iAAliTTliAAA TL^l^CiUCAT 4b U 

CAAGGCGACG ACGAAGACCA CGGTTACCGG CGTTGAAATG TTCCGCAAGC 500 

TGCTCGACCA GGGCCAGGCT GGCGACAACA TTGGCGCGCT GATCCGCGGC 550 

45 GTTGGCCGTG AAGACGTTGA ACGCGGCCAG GTTCTCTGCA AGCCGGGTTC 600 

TGTGAAGCCG CACACCAAGT TTAAGGCAGA AGCCTATATT CTGACCAAGG 650 

ACGAAGGTGG CCGTCATACG CCGTTCTTCA CCAACTACCG TCCGCAGTTC 700 

TACTTCCGTA CGACGGACGT GACGGGTGTT GTGACGCTTC CGGCTGGCAC 750 

GGAAATGGTC ATGCCTGGCG ATAACGTCGC CATGGACGTT ACCCTGATCG 800 

50 TGCCGATCGC CATGGAAGAG AAGCTTCGCT TCGCTATC 838 



2) INFORMATION FOR SEQ ID NO: 16 

55 

< i ) SEQUENCE CHARACTER I ST I CS : 

(A) LENGTH: 771 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
60 (D) TOPOLOGY: Linear 

11 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Burkholderia cepacia 

(B) STRAIN: LSPQ 2217 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 

GGCAGCAGAC GGCCCGATGC CGCAAACGCG TGAGCACATC CTGCTGGCGC 50 

GTCAGGTTGG CGTTCCGTAC ATCATCGTGT TCCTGAACAA GTGCGACATG 100 

GTGGACGACG CCGAACTGCT CGAGCTGGTC GAGATGGAAG TTCGCGAACT 150 

CCTGTCGAAG TACGACTTCC CGGGCGACGA CACGCCGATC GTGAAGGGTT 200 

CGGCGAAGCT GGCGCTGGAA GGCGACACGG GCGAGCTGGG CGAAGTGGCG 250 

ATCATGAGCC TGGCCGACGC GCTGGACACG TACATCCCGA CGCCGGAGCG 300 

TGCAGTTGAC GGCGCGTTCC TGATGCCGGT GGAAGACGTG TTCTCGATCT 350 

CGGGCCGCGG TACGGTGGTG ACGGGTCGTG TCGAGCGCGG CATCGTGAAG 4 00 

GTCGGCGAAG AAATCGAAAT CGTCGGTATC AAGCCGACGG TGAAGACGAC 450 

CTGCACGGGC GTTGAAATGT TCCGCAAGCT GCTGGACCAA GGTCAAGCAG 500 

GCGACAACGT TGGTATCCTG CTGCGCGGCA CGAAGCGTGA AGACGTGGAG 550 

CGTGGCCAGG TTCTGGCGAA GCCGGGTTCG ATCACGCCGC ACACGCACTT 600 

CACGGCTGAA GTGTACGTGC TGAGCAAGGA CGAAGGCGGC CGTCACACGC 650 

CGTTCTTCAA CAACTACCGT CCGCAGTTCT ACTTCCGTAC GACGGACGTG 700 

ACGGGCTCGA TCGAGCTGCC GAAGGACAAG GAAATGGTGA TGCCGGGCGA 750 

CAACGTGTCG ATCACGGTGA A 771 



2) INFORMATION FOR SEQ ID NO: 17 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 829 bases 

(B) - TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
■(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cedecea davisae 

(B) STRAIN: ATCC 33431 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 



GGCGCTATCC 
TGAGCACATC 
TCCTGAACAA 
GAAATGGAAG 
TACTCCAATC 
AGTGGGAAGC 
CCTGAGCCAG 
CGTATTCTCC 
GCGGTATCAT 
ACTGCGAAAT 
CGAAGGCCGT 
GTGAAGAAAT 
CCACACACCA 
CGGCCGTCAT 
GTACAACTGA 
GTAATGCCTG 



TGGTTGTTGC 
CTGCTGGGTC 
ATGCGACATG 
TTCGTGAACT 
GTTCGTGGTT 
TAAAATCGTT 
AGCGTGCTAT 
ATCTCCGGCC 
CAAAGTTGGT 
CTACCTGTAC 
GCTGGTGAGA 
CGAACGTGGT 
AGTTCGAATC 
ACTCCGTTCT 
CGTGACCGGC 
GCGACAACAT 



TGCGACTGAT 
GTCAGGTTGG 
GTTGATGACG 
TCTGTCCCAG 
CTGCTCTGAA 
GAGCTGGCTG 
CGATAAGCCG 
GTGGTACCGT 
GAAGAAGTTG 
CGGCGTTGAA 
ACGTTGGTGT 
CAGGTACTGG 
TGAAGTGTAC 
TCAAAGGCTA 
ACCATCGAAC 
CAAAATGGTT 



GGCCCAATGC CACAGACCCG 
CGTTCCGTAC ATCATCGTGT 
AAGAGCTGCT GGAACTGGTA 
TACGACTTCC CGGGCGACGA 
AGCGCTGGAA GGCGAAGCAG 
GCTACCTGGA TTCTTACATC 
TTCCTGCTGC CAATCGAAGA 
TGTTACCGGT CGTGTAGAGC 
AAATCGTTGG TATCAAAGAT 
ATGTTCCGCA AACTGCTGGA 
TCTGCTGCGT GGTATCAAAC 
CTAAGCCAGG CTCTATCAAG 
ATCCTGTCCA AAGACGAAGG 
CCGTCCACAG TTCTACTTCC 
TGCCAGAAGG CGTTGAGATG 
GTTACCCTGA TCCACCCAAT 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
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CGCGATGGAT GACGGTCTGC GTTTCGCAA 



5 2) INFORMATION FOR SEQ ID NO: 18 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cedecea neteri 

(B) STRAIN: ATCC 33855 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 



CGCTATCCTG GTTGTTGCTG CGACTGACGG CCCTATGCCT CAGACCCGTG 50 

AGCACATCCT GCTGGGTCGT CAGGTTGGCG TTCCTTACAT CATCGTGTTC 100 

CTGAACAAAT GTGACATGGT TGATGACGAA GAGCTGCTGG AGCTGGTTGA 150 

AATGGAAGTT CGTGAACTTC TGTCTCAGTA CGACTTCCCG GGCGATGACA 200 

25 CTCCAATCAT CCGTGGTTCT GCTCTGAAAG CGCTGGAAGG CGAAGCAGAG 250 

TGGGAAGCTA AAATYGTTGA GCTGGCTGGC TTCCTGGATT CCTACATCCC 3 00 

AGAACCAGTA CGTGCAATCG AYCTGCCGTT CCTGCTGCCA ATCGAAGACG 3 50 

TATTCTCCAT CTCCGGCCGT GGTACCGTTG TTACCGGTCG TGTAGAGCGC 4 00 

GGTATCGTTA AAGTGGGCGA AGAAGTAGAA ATCGTTGGTA TCAAAGATAC 450 

30 TGCGAAATCT ACCTGTACCG GCGTTGAAAT GTTCCGCAAA CTGCTGGACG 500 

AAGGCCGTGC TGGTGAGAAC GTTGGTGTTC TGCTGCGTGG TATCAAACGT 550 

GAAGAAATCG AACGTGGTCA GGTTCTGGCT AAGCCAGGCT CTATCAAGCC 600 

GCACACCAAG TTCGAATCTG AAGTGTACAT CCTGTCCAAA GACGAAGGCG 650 

GCCGTCATAC TCCGTTCTTC AAAGGCTACC GTCCACAGTT CTACTTCCGT 700 

35 ACAACTGACG TGACCGGTAC CATCGAACTG CCAGAAGGCG TAGAGATGGT 750 

AATGCCAGGC GACAACATCA AAATGGTTGT TACCCTGATC CACCCAATCG 800 

CGATGGACGA CGGTCTGCGT TTCG 824 



2) INFORMATION FOR SEQ ID NO: 19 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH.: 827 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cedecea lapagei 

(B) STRAIN: ATCC 33432 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 

CGCTATTCTG GTTGTTGCTG CAACTGACGG CCCTATGCCT CAGACCCGTG 50 

AGCACATCCT GCTGGGTCGC CAGGTTGGCG TTCCTTACAT CATCGTGTTC 100 

CTGAACAAAT GTGACATGGT TGATGACGAA GAGCTGCTGG AGCTGGTAGA 150 

AATGGAAGTT CGTGAACTTC TGTCTCAGTA CGACTTCCCA GGCGATGATA 200 

13 
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CCCCAATCAT CCGTGGTTCT 
TGGGAAGCTA AAATCGTTGA 
AGAACCAGTA CGTGCAATCG 
TATTCTCCAT CTCCGGCCGT 
5 GGTATCGTTA AAGTGGGCGA 
TGCGAAATCT ACCTGTACTG 
AAGGCCGTGC TGGTGAGAAC 
GAAGAAATCG AACGTGGTCA 
GCACACCAAG TTCGAATCTG 
10 GCCGTCATAC TCCGTTCTTC 
ACCACTGACG TGACCGGTAC 
AATGCCAGGT GACAACATCA 
CGATGGACGA CGGTCTGCGT 



GCTCTGAAAG CGCTGGAAGG 
GCTGGCTGGC TTCCTGGATT 
ACCTGCCGTT CCTGCTGCCA 
GGTACCGTTG TKACCGGTCG 
AGAAGTAGAA ATCGTTGGTA 
GCGTTGAAAT GTTCCGCAAA 
GTTGGTGTTC TGCTGCGTGG 
GGTTCTGGCT AAGCCAGGCT 
AAGTGTACAT CCTGTCCAAA 
AARGGCTACC GTCCACAGTT 
CATCGAACTG CCAGAAGGCG 
AAATGGTTGT TACCCTGATC 
TTCGCAA 



CGAAGCAGAG 250 

CCTACATCCC 300 

ATCGAAGACG 3 50 

TGTAGAGCGC 400 

TCAAAGATAC 450 

CTGCTGGACG 500 

TATCAAACGT 550 

CTATCAAGCC 600 

GACGAAGGCG 650 

CTACTTCCGT 700 

TAGAGATGGT 750 

CACCCAATCG 800 
827 



15 



2) INFORMATION FOR SEQ ID NO: 20 

(i) SEQUENCE CHARACTERISTICS : 
20 (A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Chlamydia pneumoniae 

(B) STRAIN: CWL 029 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 



GCGGAGCTAT CCTAGTCGTT TCAGCTACAG ACGGAGCTAT GCCACAAACT 50 

AAAGAACATA TCTTGCTAGC TCGCCAGGTT GGAGTTCCTT ATATCGTTGT 100 

35 TTTCTTGAAT AAAGTAGATA TGATCTCTCA AGAAGATGCT GAACTTATTG 150 

ACCTTGTTGA GATGGAACTT AGTGAGCTTC TTGAAGAAAA AGGCTACAAA 200 

GGATGCCCTA TTATCCGTGG TTCTGCTTTG AAAGCTCTTG AAGGTGATGC 250 

AAATTATATC GAAAAAGTTC GAGAACTTAT GCAAGCTGTG GATGACAACA 300 

TCCCTACACC AGAAAGAGAA ATTGATAAGC CTTTCTTAAT GCCTATCGAA 350 

40 GACGTATTCT CAATCTCTGG TCGTGGTACT GTGGTTACAG GAAGAATCGA 400 

GCGTGGAATC GTTAAAGTTT CTGATAAAGT TCAGCTCGTG GGATTAGGAG 450 

AGACTAAAGA AACAATCGTT ACTGGAGTCG AAATGTTCAG GAAAGAACTT 500 

CCTGAAGGTC GTGCAGGAGA AAACGTTGGT TTACTCCTCA GAGGTATTGG 550 

AAAGAACGAT GTTGAAAGAG GTATGGTGGT TTGTCAGCCT AACAGCGTGA 600 

45 AGCCTCATAC GAAATTTAAG TCAGCTGTTT ACGTTCTTCA GAAAGAAGAA 650 

GGCGGACGTC ATAAGCCTTT CTTCAGCGGA TACAGACCTC AGTTCTTCTT 700 

CCGTACTACA GACGTGACAG GAGTCGTAAC TCTTCCTGAA GGAACTGAAA 750 

TGGTAATGCC TGGAGATAAC GTTGAGCTTG ATGTTGAGCT CATTGGAACA 800 

GTTGCTCTTG AAGAAGGAAT GAGATTTGCA A 831 



2) INFORMATION FOR SEQ ID NO: 21 

55 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 

14 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Chlamydia psittaci 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 

TGGAGCGATT CTCGTTGTTT CCGCTACTGA CGGTGCGATG CCTCAGACCA 50 

AAGAACATAT TCTTTTGGCG AGACAGGTTG GTGTTCCTTA CATCGTTGTT 100 

10 TTCCTTAACA AAATCGATAT GATTTCTCAA GAAGATGCTG AGCTCGTAGA 150 

CTTAGTTGAA ATGGAATTGT CCGAACTTCT AGAAGAAAAA GGTTATAAAG 200 

GTTGCCCAAT TATCCGTGGT TCTGCTTTGA AAGCCTTAGA AGGTGATGCA 250 

AGCTACGTTG AAAAAATTCG CGAGTTAATG CAAGCAGTGG ATGATAACAT 3 00 

CCCTACTCCA GAGCGTGAAG TTGATAAGCC TTTCTTAATG CCTATCGAAG 350 

15 ACGTATTCTC TATTTCTGGT CGTGGTACTG TGGTCACAGG ACGTATCGAG 4 00 

CGTGGAATCG TTAAAGTGGG TGATAAAGTA CAGATTGTTG GTTTAAGAGA 4 50 

TACTAGAGAG ACAATTGTTA CCGGTGTGGA AATGTTCAGA AAAGAACTTC 500 

CAGAAGGTCA AGCAGGGGAA AACGTTGGTT TGCTCCTCAG AGGTATCGGT 550 

AAGAATGACG TTGAACGTGG TATGGTTATC TGCCAACCTA ATAGCGTGAA 600 

20 ATCTCACACA CAATTTAAAG GTGCTGTCTA CATTCTACAA AAAGAAGAGG 650 

GTGGACGTCA TAAACCTTTC TTTACCGGAT ACAGACCTCA GTTCTTCTTC 700 

CGTACAACAG ATGTTACAGG TGTTGTAACT CTCCCAGAAG GTACAGAGAT 750 

GGTTATGCCA GGCGATAACG TTGAATTCGA AGTTCAATTA ATTAGCCCAG 800 

TAGCTCTAGA AGAAGGTATG AGATTT 826 

25 



2) INFORMATION FOR SEQ ID NO: 22 



30 



35 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Chlamydia trachomatis 

(B) STRAIN: LGV 12 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 



GGGGCTATTC TAGTAGTTTC TGCAACAGAC GGAGCTATGC CTCAAACTAA 50 

45 AGAGCATATT CTTTTGGCAA GACAAGTTGG GGTTCCTTAC ATCGTTGTTT 100 

TTCTCAATAA AATTGACATG ATTTCCGAAG AAGACGCTGA ATTGGTCGAC 150 

TTGGTTGAGA TGGAGTTGGC TGAGCTTCTT GAAGAGAAAG GATACAAAGG 200 

GTGTCCAATC ATCAGAGGTT CTGCTCTGAA AGCTTTGGAA GGGGATGCTG 250 

CATACATAGA GAAAGTTCGA GAGCTAATGC AAGCCGTCGA TGATAATATC 300 

50 CCTACTCCAG AAAGAGAAAT TGACAAGCCT TTCTTAATGC ' CCATTGAGGA 350 

CGTGTTCTCT ATCTCCGGAC GAGGAACTGT AGTAACTGGA CGTATTGAGC 400 

GTGGAATTGT TAAAGTTTCC GATAAAGTTC AGTTGGTCGG TCTTAGAGAT 450 

ACTAAAGAAA CGATTGTTAC TGGGGTTGAA ATGTTCAGAA AAGAACTCCC 500 

AGAAGGTCGT GCAGGAGAGA ATGTTGGATT GCTCCTCAGA GGTATTGGTA 550 

55 AGAACGATGT GGAAAGAGGA ATGGTTGTTT GCTTGCCAAA CAGTGTTAAA 600 

CCTCATACAC GGTTTAAGTG TGCTGTTTAC GTTCTGCAAA AAGAAGAAGG 650 

TGGACGACAT AAGCCTTTCT TCACAGGATA TAGACCTCAA TTCTTCTTCC 700 

GTACAACAGA CGTTACAGGT GTGGTAACTC TGCCTGAGGG AGTTGAGATG 750 

GTCATGCCTG GGGATAACGT TGAGTTTGAA GTGCAGTTGA TTAGCCCTGT 800 

60 GGCTTTAGAA GAAGGTATGA GA 822 
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2 ) INFORMATION FOR SEQ ID NO: 23 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 835 bases 

(B) TYPE; Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Chryseobacterium meningosepticum 

(B) STRAIN: CDC B7681 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 

20 CGGAGCTATC TTAGTATGTG CTGCTACAGA TGGTCCAATG CCTCAAACTA 50 

GAGAACACAT CCTACTTTGC CGTCAGGTAA ACGTACCTAG AATTGTTGTG 100 

TTCATGAACA AAGTTGACAT GGTAGATGAT CCAGAATTGT TAGAGCTTGT 150 

TGAGCTTGAA CTTAGAGATC TATTATCTAC TTACGAATAT GATGGTGATA 200 

ACTCTCCAGT AATTCAAGGT TCTGCTCTTG GTGCTCTTAA CGGTGATGCT 250 

2 5 AAGTGGGTAG CTACTGTAGA AGCTCTAATG GATGCTGTTG ATACTTGGAT 300 

CGAGCAACCA GTAAGAGATT CTGATAAGCC ATTCCTTATG CCAATCGAAG 350 

ACGTATTCTC TATTACAGGT AGAGGTACTG TAGCAACTGG TAGAATCGAG 4 00 

GCTGGTGTAA TCAACACAGG TGATCCTGTT GACATCGTAG GTATGGGTGA 450 

CGAGAAGTTA ACTTCTACTA TTACAGGTGT TGAGATGTTT AGAAAAATCC 500 

30 TAGACAGAGG TGAAGCTGGT GATAACGTAG GTCTATTGTT GAGAGGTATT 550 

GAAAAGACTG ACATCAAGAG AGGTATGGTT ATCGCTAAGA AAGATTCAGT 600 

TAAGCCACAC AAGAAATTCA AAGCTGAGGT TTATATCCTT TCTAAAGAAG 650 

AAGGTGGACG TCACACTCCA TTCCACAACA AATACCGTCC TCAGTTCTAT 700 

GTAAGAACTA CTGACGTTAC AGGTGAAATC TTCTTACCAG AAGGTGTAGA 750 

35 AATGGTAATG CCTGGTGATA ACTTAACTAT CACTGTAGAA TTGTTACAAC 800 

CAATCGCTCT TAACGAGGGT CTTAGATTCG CGATC 835 



40 2) INFORMATION FOR SEQ ID NO: 24 

( i ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 816 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

( i i ) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacter amalonaticus 

(B) STRAIN: ATCC 25405 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 



CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACTC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

AGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGATTTC CCGGGCGACG 200 

60 ACACCCCGAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA 250 
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GAGTGGGAAG CGAAAATCAT 
CCCGGAACCA GAGCGTGCGA 
ACGTATTCTC CATCTCCGGT 
CGCGGTATCA TCAAAGTGGG 
5 GACTGCCAAG TCTACCTGTA 
ACGAAGGCCG TGCGGGTGAG 
CGTGAAGAAA TCGAACGTGG 
GCCGCACACC ATGTTCGAAT 
GCGGCCGTCA TACTCCGTTC 
10 CGTACAACTG ACGTGACTGG 
GGTAATGCCG GGCGACAACA 
TCGCGATGGA CGACGG 



CGAACTGGCC GGCTTCCTGG 
TTGACAAGCC GTTCCTGCTG 
CGTGGTACCG TTGTTACCGG 
CGAAGAAGTT GAAATCGTTG 
CTGGCGTTGA AATGTTCCGC 
AACGTTGGTG TTCTGCTGCG 
TCAGGTACTG GCTAAGCCGG 
CYGAAGTGTA CATCCTGTCC 
TTCAAAGGCT ACCGTCCGCA 
CACCATCGAA CTGCCGGAAG 
TCAAAATGGT TGTTACCCTG 



ATTCTTACAT 3 00 

CCGATCGAAG 350 

TCGTGTAGAA 4 00 

GTATCAAAGA 450 

AAACTGCTGG 500 

TGGTATCAAA 550 

GCWCCATCAA 600 

AAAGACGAAG 650 

GTTCTACTTC 700 

GCGTTGAGAT 750 

ATCCACCCGA 800 
816 



15 



20 



25 



2) INFORMATION FOR SEQ ID NO: 25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: CitroJbacter braakii 

(B) STRAIN: ATCC 43162 



30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 

CGCGATCCTG GTTGTTGCTG CAACTGACGG CCCGATGCCG CAGACTCGTG 50 

AGCACATCCT GCTGGGTCGY CAGGTAGGCG TTCCGTACAT CATCGTGTTC 100 

CTGAACAAAT GCGACATGGT TGATGACGAA GAGCTGCTGG AACTGGTAGA 150 

35 AATGGAAGTT CGTGAACTTC TGTCTCAGTA CGATTTCCCG GGCGACGACA 200 

CGCCGATCGT TCGTGGTTCT GCTCTGAAAG CGCTGGAAGG CGAWGCAGAG 250 

TGGGAAGCGA AAATCATCGA ACTGGCTGGC TTCCTGGATT CTTACATCCC 300 

GGAACCAGAG CGTGCGATTG ACAAGCCGTT CCTGCTGCCT ATCGAAGACG 350 

TATTCTCCAT CTCTGGTCGT GGTACCGTTG TTACCGGTCG TGTAGAGCGC 400 

40 GGTATCATCA AAGTTGGTGA AGAAGTTGAA ATCGTTGGTA TCAARGACAC 450 

TGCTAAGTCT ACCTGTACTG GCGTTGAAAT GTTCCGCAAA CTGCTGGACG 500 

AAGGCCGTGC TGGTGAGAAC GTTGGTGTTC TGCTGCGTGG TATCAAGCGT 550 

GAAGAAATCG AACGTGGTCA GGTACTGGCT AAGCCGGGCT CTATCAAGCC 600 

GCACACCAAG TTCGAATCTG AAGTGTACAT TCTGTCCAAA GACGAAGGCG 650 

45 GCCGTCATAC TCCGTTCTTC AARGGCTACC GTCCGCAGTT CTACTTCCGT 700 

ACTACTGACG TGACTGGTAC CATCGAACTG CCGGAAGGCG TTGAGATGGT 750 

AATGCCGGGC GACAACATCA AAATGGTTGT TACCCTGATC CACCCAATCG 800 

CGATGGACGA CGGTCTGCGT TTCGC 825 

50 



2) INFORMATION FOR SEQ ID NO: 26 

( i ) SEQUENCE CHARACTERISTICS : 
55 (A) LENGTH: 830 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



60 



(ii) MOLECULE TYPE: 



Genomic DNA 
17 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacter koseri 

(B) STRAIN: ATCC 27156 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 

CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACCC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

10 TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAGATGGAA GTGCGTGAAC TGCTGTCTCA GTACGATTTC CCGGGCGACG 200 

ACACGCCGAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGAMGCT 250 

GAGTGGGAAG CGAAAATCAT CGAACTGGCT GGCTACCTGG ATTCTTACAT 300 

CCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 350 

15 ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAG 4 00 

CGCGGTATCA TCAAAGTGGG CGAAGAAGTT GAAATYGTTG GTATCAAAGA 450 

GACTGCGAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCTGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GYTCCATCAA 600 

20 GCCGCACACC AAGTTCGAAT CTGAAGTGTA CATYCTGTCY AAAGATGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 700 

CGTACAACTG ACGTGACTGG CACCATCGAA CTGCCGGAAG GCGTAGAGAT 750 

GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 800 

TCGCGATGGA CGACGGTCTG CGTTTCGCAA 830 

25 



2 ) INFORMATION FOR SEQ ID NO: 27 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 827 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ci trojbacter farmeri 
40 (B) STRAIN: ATCC 51112 

( xi ) seqTjen!:e"1)es1:rTp^ seq" "id" no" 2 7~ ~ 

CGCGATCCTG GTTGTTGCTG CGACTGACGG CCCGATGCCG CAGACTCGTG 50 

45 AGCACATCCT GCTGGGTCGT CAGGTAGGCG TTCCGTACAT CATCGTGTTC 100 

CTGAACAAAT GCGACATGGT TGATGACGAA GAGCTGCTGG AACTGGTAGA 150 

GATGGAAGTT CGTGAACTGC TGTCTCAGTA CGATTTCCCG GGCGACGACA 200 

CGCCGATCGT TCGTGGTTCT GCTCTGAAAG CGCTGGAAGG CGACGCAGAG 250 

TGGGAAGCGA AAATCATCGA ACTGGCAGGC TTCCTGGATT CTTACATCCC 300 

50 GGAACCAGAG CGTGCGATTG ACAAGCCGTT CCTGCTGCCG ATCGAAGACG 350 

TATTCTCCAT CTCTGGTCGT GGTACCGTTG TTACCGGTCG TGTAGAGCGC 400 

GGTATCATCA AAGTGGGTGA AGAAGTTGAA ATCGTTGGTA TCAAAGAGAC 450 

TGCCAAGTCT ACCTGTACTG GCGTTGAAAT GTTCCGCAAA CTGCTGGACG 500 

AAGGCCGTGC TGGTGAGAAC GTAGGTGTTC TGCTGCGTGG TATCAAACGT 550 

55 GAAGAAATCG AACGTGGTCA GGTACTGGCT AAGCCGGGCW CCATCAAGCC 600 

RCACACTATG TTCGAATCTG AAGTGTACAT TCTGTCCAAA GACGAAGGCG 650 

GCCGTCATAC TCCGTTCTTC AAAGGCTACC GTCCGCAGTT CTACTTCCGT 700 

ACGACTGACG TGACTGGCAC CATCGAACTG CCGGAAGGTG TTGAGATGGT 750 

TATGCCGGGC GACAACATCA AAATGGTTGT TACCCTGATC CACCCGATCG 800 

60 CGATGGACGA CGGTCTGCGT TTCGCAA 827 

18 
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2) INFORMATION FOR SEQ ID NO: 28 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 797 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacier freundii 

(B) STRAIN: ATCC 8090 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 

CCTGGTTGTT GCTGCGACTG ACGGCCCGAT GCCGCAGACT CGTGAGCACA 50 

TCCTGCTGGG TCGTCAGGTA GGCGTTCCGT ACATCATCGT GTTCCTGAAC 100 

AAATGCGACA TGGTTGATGA CGAAGAGCTG CTGGAACTGG TAGAAATGGA 150 

AGTTCGTGAA CTTCTGTCTC AGTACGATTT CCCGGGCGAC GACACTCCGA 200 

TCGTTCGTGG TTCTGCTCTG AAAGCGCTGG AAGGCGAAGC AGAGTGGGAA 250 

GCGAAAATCA TCGAACTGGC TGGCTTCCTG GATTCTTACA TCCCAGAACC 3 00 

AGAGCGTGCG ATTGACAAGC CGTTCCTGCT GCCTATCGAA GACGTATTCT 350 

CCATCTCCGG TCGTGGTACC GTTGTTACCG GTCGTGTAGA GCGCGGTATC 4 00 

ATCAAAGTTG GTGAAGAAGT TGAAATCGTT GGTATCAAAG AGACTGCTAA 450 

GTCTACCTGT ACTGGCGTTG AAATGTTCCG CAAACTGCTG GACGAAGGCC 500 

GTGCTGGTGA GAACGTTGGT GTTCTGCTGC GTGGTATCAA ACGTGAAGAA 550 

ATCGAACGTG GTCAGGTACT GGCTAAGCCG GGCTCTATCA AGCCGCACAC 600 

CAAGTTCGAA TCTGAAGTGT ACATTCTGTC CAAAGACGAA GGCGGCCGTC 6 50 

ATACTCCGTT CTTCAAAGGC TACCGTCCGC AGTTCTACTT CCGTACTACT 700 

GACGTGACTG GTACCATCGA ACTGCCGGAA GGCGTAGAGA TGGTAATGCC 750 

GGGCGACAAC ATCAAAATGG TTGTTACCCT GATCCACCCA ATCGCGA 797 



2) INFORMATION FOR SEQ ID NO: 29 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacter sedlakii 

(B) STRAIN: ATCC 51115 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 



CGGCGCGATC CTGGTTGTTG CCGCGACTGA CGGCCCGATG 
GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA 
TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC 
AGAGATGGAA GTTCGTGAAC TGCTGTCTCA GTACGATTTC 
ACACGCCGAT CGTTCGTGGT TCAGCTCTGA AAGCGCTGGA 
GAGTGGGAAG CGAAAATCAT CGAACTGGCT GGCTTCCTGG 



CCGCAGACCC 
CATCATCGTG 
TGGAACTGGT 
CCGGGCGACG 
AGGCGACGCA 
ATTCTTACAT 



50 
100 
150 
200 
250 
300 
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TCCGGAACCA GAGCGTGCGA TTGACAAGCC 
ACGTATTCTC CATCTCCGGT CGTGGTACCG 
CGCGGTATCA TCAAAGTGGG CGAAGAAGTT 
GACTGCGAAG TCTACCTGTA CTGGCGTTGA 
5 ACGAAGGCCG TGCGGGTGAG AACGTAGGTG 
CGTGAAGAAA TCGAACGTGG TCAGGTACTG 
GCCGCACACC AAGTTCGAAT CTGAAGTGTA 
GCGGCCGTCA TACTCCGTTC TTCAAAGGCT 
CGTACAACTG ACGTGACTGG CACCATCGAA 
10 GGTAATGCCG GGCGACAACA TCAAAATGGT 
TCGCGATGGA CGACGGTCTG CGTTTC 



GTTCCTGCTG CCGATCGAAG 3 50 

TTGTTACCGG TCGTGTAGAG 4 00 

GAAATCGTTG GTATCAAAGA 4 50 

AATGTTCCGC AAACTGCTGG 500 

TTCTGCTGCG TGGTATCAAA 550 

GCGAAGCCGG GCACCATCAA 600 

TATTCTGTCC AAAGATGAAG 6 50 

ACCGTCCGCA GTTCTACTTC 700 

CTGCCGGAAG GCGTAGAGAT 750 

TGTTACCCTG ATCCACCCGA 800 

826 



15 2) INFORMATION FOR SEQ ID NO: 30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH:' 823 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacter werkmanii 

(B) STRAIN: ATCC 51114 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 

30 

GCGATCCTGG TTGTTGCTGC GACTGACGGC CCGATGCCGC AGACTCGTGA 50 

GCACATCCTG CTGGGTCGTC AGGTAGGCGT TCCGTACATC ATCGTGTTCC 100 

TGAACAAATG CGACATGGTT GATGACGAAG AGCTGCTGGA ACTGGTAGAA 150 

ATGGAAGTTC GTGAACTTCT GTCTCAGTAC GATTTCCCGG GCGACGACAC 200 

35 TCCGATCGTT CGTGGTTCTG CTCTGAAAGC GCTGGAAGGC GAAGCAGAGT 250 

GGGAAGCGAA AATCATCGAA CTGGCTGGCT TTCTGGATTC TTACATCCCG 300 

GAACCAGAGC GTGCGATTGA CAAGCCGTTC CTGCTRCCTA TCGAAGACGT 350 

ATTCTCCATC TCCGGTCGTG GTACCGTTGT TACCGGTCGT GTAGAGCGCG 4 00 

GTATCATCAA AGTTGGTGAA GAAGTTGAAA TCGTTGGTAT CAAAGACACC 4 50 

40 GCTAAGTCTA CCTGTACCGG CGTTGAAATG TTCCGCAAAC TGCTGGACGA 500 

AGGCCGTGCT GGTGAGAACG TTGGTGTTCT GCTGCGTGGT ATCAAACGTG 550 

AAGAAATCGA ACGTGGTCAG GTACTGGCTA AGCCGGGCTC TATCAAGCCG 600 

CACACCAAGT TCGAATCTGA AGTGTACATC CTGTCCAAAG ACGAAGGCGG 650 

CCGTCATACT CCGTTCTTCA AAGGCTACCG TCCGCAGTTC TACTTCCGTA 700 

45 CTACTGACGT GACTGGTACC ATCGAACTGC CGGAAGGCGT AGAGATGGTA 750 

ATGCCGGGCG ACAACATYAA AATGGTTGTT ACYCTGATCC ACCCGATCGC 800 

GATGGACGAC GGTCTGCGTT TCG 823 



50 



60 



2) INFORMATION FOR SEQ ID NO: 31 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 826 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE : Genomic DNA 

20 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacter youngae 

(B) STRAIN: ATCC 29935 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31 

GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAGACTCG 50 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT 100 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTA 150 

10 GAAATGGAAG TTCGTGAACT TCTGTCTCAG TACGATTTCC CGGGCGACGA 200 

TACGCCGATC GTTCGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGAAGCAG 250 

AGTGGGAAGC GAAAATCATC GAACTGGCTG GCTTCCTGGA TTCTTACATC 300 

CCGGAACCAG AACGTGCTAT CGATAAGCCG TTCCTGCTGC CAATCGAAGA 350 

CGTATTCTCC ATCTCCGGTC GTGGTACCGT TGTTACTGGT CGTGTAGAAC 400 

15 GCGGTATCAT CAAAGTTGGT GAAGAAGTTG AAATCGTTGG TATCAAAGAG 450 

ACTGCCAAGT CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGCCGT GCTGGTGAGA ACGTTGGTGT TCTGCTGCGT GGTATCAAAC 550 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CTCTATCAAG 600 

CCGCACACCA AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGACGAAGG 650 

20 CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 700 

GTACTACTGA CGTGACGGGT ACCATCGAAC TGCCGGAAGG CGTAGAGATG 750 

GTAATGCCGG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCAAT 800 

CGCGATGGAT GACGGTCTGC GTTTCG 826 



25 



40 



2 ) INFORMATION FOR SEQ ID NO: 32 



( i ) SEQUENCE CHARACTER I ST I CS : 
3 0 (A) LENGTH: 841 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium perfringens 
<B) STRAIN: ATCC 13124 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32 



CGGAGCTATA TTAGTTTGTT CAGCAGCTGA TGGTCCAATG CCTCAAACAA 50 

GAGAGCACAT CTTATTATCA TCAAGAGTTG GAGTTGACCA CATCGTAGTA 100 

45 TTCTTAAACA AAGCAGATAT GGTTGACGAC GAAGAATTAT TAGAATTAGT 150 

TGAAATGGAA GTTAGAGAGT TATTAAGCGA GTACAACTTC CCAGGAGACG 200 

AYATTCCAGT AATCAARGGA TCAGCTTTAG TAGCATTAGA AAACCCAACT 250 

GACGAAGCTG CAACAGCTTG TATCAGAGAG TTAATGGATG CTGTAGATAG 300 

CTACATCCCA ACACCAGAAA GAGCAACAGA TAAGCCATTC TTAATGCCAG 350 

50 TAGAGGACGT ATTCACAATC ACTGGTAGAG GAACAGTTGC AACAGGAAGA 400 

GTTGAAAGAG GAGTTCTACA TGTAGGAGAC GAAGTAGAAG TAATCGGATT 450 

AACTGAAGAA AGAAGAAAAA CTGTTGTAAC AGGAATCGAA ATGTTCAGAA 500 

AGTTATTAGA TGAAGCACAA GCTGGAGATA ACATCGGAGC ATTATTAAGA 550 

GGTATCCAAA GAACTGAYAT CGAAAGAGGT CAAGTTTTAG CTCAAGTTGG 600 

55 AACAATCAAC CCACACAAAA AATTCGTAGG TCAAGTATAC GTACTTAAAA 650 

AAGAAGAAGG TGGAAGACAT ACTCCATTCT TCGATGGATA CAGACCACAA 700 

TTCTACTTCA GAACAACAGA CGTTACAGGA TCAATCAAAT TACCAGAAGG 750 

AATGGAAATG GTTATGCCTG GAGACCACAT CGACATGGAA GTTGAATTAA 800 

TCACAGAAAT CGCTATGGAY GAAGGATTAA GATTCGCTAT C 841 



60 
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2 ) INFORMATION FOR SEQ ID NO: 33 

( i ) SEQUENCE CHARACTER ISTICS : 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Comamonas acidovorane 
15 (B) STRAIN: ATCC 15668 ' 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 

CGGCGCCATC CTGGTGTGCT CGGCCGCTGA CGGCCCCATG CCCCAGACCC 50 

20 GCGAGCACAT CCTGCTGGCC CGTCAGGTGG GCGTGCCCTA CATCATCGTG 100 

TTCCTGAACA AGTGCGACAT GGTGGACGAC GAAGAGCTGC TGGAACTGGT 150 

CGAAATGGAA GTGCGCGAGC TGCTTGCCAA GTACGACTTC CCCGGCGACG 200 

ACACCCCCAT CATCCGCGGC TCGGCCAAGC TGGCCCTGGA AGGCGACCAG 250 

TCCGACAAGG GCGAACCTGC CATCCTGCGC CTGGCTGAAG CACTGGACTC 300 

25 CTACATCCCC ACGCCCGAGC GCGCTGTGGA CGGCGCCTTT GCAATGCCCG 350 

TGGAAGACGT GTTCTCGATC TCTGGCCGTG GCACCGTGGT GACTGGCCGT 4 00 

ATCGAGCGCG GCATCATCAA GGTCGGCGAA GAAATCGAAA TCGTCGGTAT 450 

CCGCGACACC CAGAAGACCA TCGTCACCGG CGTGGAAATG TTCCGCAAGC 500 

TGCTGGACCA AGGTCAAGCT GGCGACAACG TGGGTCTGCT GCTGCGCGGC 550 

30 ACCAAGCGTG AAGACGTGGA ACGCGGCCAA GTGCTGTGCA AGCCCGGCTC 600 

CATCAAGCCC CACACCCACT TCACGGCTGA GGTGTACGTG CTGTCCAAGG 650 

ACGAAGGTGG TCGCCACACT CCGTTCTTCA ACAACTACCG TCCCCAGTTC 700 

TATTTCCGTA CGACCGACGT GACCGGCTCC ATCGAGCTGC CCGCCGACAA 750 

GGAAATGGTG ATGCCTGGCG ACAACGTGTC GATCACCGTC AAGCTGATCG 800 

35 CCCCCATCGC CATGGAAGAA GG 822 



2) INFORMATION FOR SEQ ID NO: 34 

40 

( i ) SEQUENCE CHARACTERISTICS : 

[a)~ ' length"" 7oY £>ases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Corynebacterium bovis 

(B) STRAIN: ATCC 7715 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 

55 GCCGCAGACC CGTGAGCACG TCCTCCTGGC CCGTCAGGTC GGTGTGCCCT 50 

ACATCCTCGT CGCCCTCAAC AAGTGCGACA TGGTCGACGA CGAGGACCTC 100 

ATCGAGCTCG TCGAGATGGA GGTCCGTGAG CTCCTCGCCG AGCAGGACTA 150 

CGACGAGGAC GCCCCGATCA TCCACATCTC CGCCCTCAAG GCCCTCGAGG 200 

GTGACCCGGA GTGGACGCAG CGCATCGTCG ACCTCATGAA GGCCTGCGAC 250 

60 GACGCCATCC CGGATCCGGA GCGCGAGACG GACAAGCCGT TCCTCATGCC 300 
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GATCGAGGAC ATCTTCACGA TCACCGGCCG 
GTGTCGAGCG TGGCATCCTC AACGTCAACG 
ATCTGCGAGA ACTCCCAGAA GACGACCGTC 
CAAGTTCCTC GACACGGCCG AGGCCGGCGA 
5 GTGGCCTGAA GCGCGAGGAC GTCGAGCGTG 
GGCGCCTACA CGCCGCACAC CGAGTTCGAG 
CAAGGACGAG GGTGGCCGCC ACACGCCGTT 
AGTTCTACTT CCGGACGACC GACGTCACCG 
GG 



CGGCACCGTC GTCACGGGCC 350 

AGGAGGTCGA GATCCTGGGT 4 00 

ACCTCCATCG AGATGTTCAA 4 50 

CAACGCCGCC CTGCTGCTCC 500 

GCCAGATCGT GGCCAAGCCG 550 

GGCTCCGTGT ACATCCTCTC 600 

CTTCGACAAC TACCGTCCGC 650 

GCGTCGTCAA GCTGCCGGAG 700 

702 



2) INFORMATION FOR SEQ ID NO: 35 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

20 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium cervicis 
25 (B) STRAIN: NCTC 10604 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35 

GGCTCAGACC CGCGAGCACG TTCTGCTTGC TCGCCAGGTT GGCGTTCCGA 50 

30 CGATCCTGGT TGCCCTCAAC AAGGCCGATA TGGTCGACGA TGAGGAAATG 100 

CTGGAGCTCG TTGAGGAAGA GTGCCGCGAC CTGCTCGAGT CCCAGGACTT 150 

CGATCGTGAC GCCCCGATCA TCCAGGTTTC CGCGCTGAAG GCTCTCGAAG 200 

GTGATCCGCA GTGGGTTGCT AAGGTCGAGG AGCTCATGGA GGCAGTCGAC 250 

ACCTTCGTGC CGACTCCTGA GCGCGACATG GACAAGCCGT TCCTCATGCC 3 00 

35 GATCGAAGAC GTCTTCACCA TCACCGGCCG TGGCACCGTT GTTACCGGTC 3 50 

GTGTTGAGCG TGGCAAGCTC CCGATCAACT CTGAGGTTGA AATCCTCGGT 4 00 

ATCCGCGAAC CGCAGAAGAC CACCGTTACC GGTATCGAGA TGTTCCACAA 4 50 

GTCCATGGAT GAAGCATGGG CAGGCGAGAA CTGTGGTCTC CTCCTGCGTG 500 

GCACCAAGCG CGATGAGGTT GAGCGCGGTC AGGTCGTTGC CGTTCCCGGT 550 

40 TCGATCACCC CGCACACCAA CTTCACCGGA CAGGTCTACA TCCTCAAGAA 600 

GGAAGAAGGC GGTCGTCACA ACCCGTTCTT CTCGAACTAC CGTCCGCAGT 650 

tcYaItttccg" caUcaoggac" gtgaccggcg" tcatcaccc 6«y 



45 



55 



2) INFORMATION FOR SEQ ID NO: 36 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 804 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium flavescens 

(B) STRAIN: ATCC 10340 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36 

23 
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GGTTGTTGCT GCAACCGATG GTCCTATGCC 
TTCTGGCTCG CCAGGTTGGC GTTCCTTACA 
TGCGACATGG TTGATGATGA GGAAATCATC 
5 CCGCGAACTG CTCGCTGAGC AGGACTACGA 
ACATCTCCGC TCTCAAGGCT CTTGAGGGTG 
ATCGTCGACC TCATGCAGGC CTGCGATGAC 
CGAGACCGAC AAGCCCTTCC TCATGCCTAT 
CCGGCCGCGG TACCGTTGTT ACCGGCCGTG 

10 GTCAACGAGG ATGTTGAGAT CATCGGCATC 
CACCGTTACC GGTATCGAAA TGTTCCGCAA 
CTGGCGACAA CTGTGGTCTG CTTCTGCGTG 
GAGCGCGGCC AGGTTGTTAT CAAGCCGGGC 
GTTCGAGGGT TCCGTCTACG TCCTCAAGAA 

15 CCCCGTTCAT GGACAACTAC CGTCCGCAGT 
GTGACCGGCG TTGTTCACCT GCCTGAGGGC 
CGACAACGTT GATATGACCG TTGAGCTCAT 
GGGC 



GCAGACCCGC GAGCACGTTC 50 

TCCTCGTTGC TCTTAACAAG 100 

GAGCTCGTTG AGATGGAAAT 150 

CGAGGATGCC CCCATCATCC 200 

ACGAGAAGTG GGTACAGGCC 250 

TCCATTCCGG ATCCGGAGCG 3 00 

CGAGGACATC TTCACCATCA 350 

TTGAGCGTGG CGTTTTGAAG 4 00 

AAGGAGAAGT CCATCTCCAC 4 50 

GATGATGGAC TACACCGAGG 500 

GTACCAAGCG TGAAGAGGTC 550 

GCCTACACCC CCCACACCAA 600 

GGAAGAGGGC GGCCGCCACA 650 

TCTACTTCCG TACCACTGAC 700 

ACCGAGATGG TCATGCCTGG 750 

CCAGCCCGTC GCTAGGATGA 8 00 

804 



20 



2) INFORMATION FOR SEQ ID NO: 37 

( i ) SEQUENCE CHARACTERISTICS : 
25 (A) LENGTH: 692 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium kutscheri 

(B) STRAIN: ATCC 15677 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 7 



TGCCTCAGAC CCGTGAGCAC GTTCTTCTTG CTCGCCAGGT TGGCGTTCCT 50 

TACATCCTCG TTGCTCTTAA CAAGTGCGAC ATGGTTGACG ATGAGGAAAT 100 

40 CATCGAGCTC GTTGAGATGG AAGTTCGCGA GCTTCTTGCT GAGCAGGAGT 150 

ACGATGAAGA GGCTCCAATC ATCCACATCT CTGCTTTGAA GGCTCTTGAG 200 

GGCGACGAGA ~A^ GACCTCATGC AGGCTTGTGA 250 

TGACTCCATC CCAGATCCAG AGCGTGAGAC CGACAAGCCA TTCCTCATGC 300 

CTATCGAGGA TATCTTCACC ATCACCGGTC GTGGCACCGT TGTTACCGGT 350 

45 CGTGTTGAGC GCGGTTCCTT GAAGGTGAAT GAGGACGTCG AGATCATCGG 400 

CATCAAGGAG AAGTCCACCA CTACTACCGT TACCGGTATC GAAATGTTCC 450 

GTAAGCTTCT TGATTACACC GAAGCTGGCG ATAACTGTGG TCTGCTTCTT 500 

CGTGGTATCA AGCGCGAAGA CGTTGAGCGT GGTCAGGTTG TTGTTAAGCC 550 

AGGCGCTTAC ACACCTCACA CCGAGTTCGA GGGCTCTGTT TACGTTCTTT 600 

50 CCAAGGACGA GGGCGGCCGC CACACCCCAT TCTTCGACAA CTACCGTCCA 650 

CAGTTCTACT TCCGCACCAC TGACGTTACC GGTGTTGTGA AG 692 



55 2) INFORMATION" FOR SEQ ID NO: 38 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 797 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 

24 
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10 



15 



20 



25 



30 



35 



40 



(D) TOPOLOGY : Linear 
(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium minutissimum 

(B) STRAIN: ATCC 23348 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38 

CCTGGTTGTT GCTGCAACCG ATGGCCCGAT GCCGCAGACC CGCGAGCACG 50 

TTCTTCTGGC CCGCCAGGTT GGCGTTCCGT ACATCCTCGT TGCACTGAAC 100 

AAGTGTGACA TGGTTGACGA TGAGGAAATC ATCGAGCTCG TTGAGATGGA 150 

GATCCGTGAG CTGCTCGCTG AGCAGGACTA CGACGAGGAA GCTCCGATCG 200 

TTCACATCTC CGCTCTGAAG GCTCTTGAGG GCGACGAGAA GTGGGCACAG 250 

TCCATCGTTG ACCTGATGCA GGCTTGCGAT GACTCCATCC CGGATCCGGA 3 00 

GCGCGAGCTG GACAAGCCGT TCCTGATGCC GATCGAGGAC ATCTTCACCA 350 

TTACCGGCCG CGGTACCGTT GTTACCGGCC GTGTTGAGCG TGGCTCCCTG 4 00 

AACGTTAACG AGGACATCGA GATCATCGGT ATCAAGGACA AGTCCATGTC 450 

CACCACCGTT ACCGGTATCG AGATGTTCCG CAAGATGATG GACTACACCG 500 

AGGCTGGCGA CAACTGTGGT CTGCTTCTGC GTGGTACCAA GCGTGAAGAG 550 

GTTGAGCGTG GCCAGGTTTG CATCAAGCCG GGCGCTTACA CCCCGCACAC 600 

CAAGTTCGAG GGTTCCGTCT ACGTCCTGAA GAAGGAAGAG GGCGGCCGCC 650 

ACACCCCGTT CATGGACAAC TACCGTCCGC AGTTCTACTT CCGCACCACC 700 

GACGTCACCG GTGTCATCAA GCTGCCGGAG GGCACCGAGA TGGTCATGCC 750 

GGGCGACAAC GTTGAGATGT CCGTAGAGCT GATCCAGCCG GTCGCTA 797 



2) INFORMATION FOR SEQ ID NO: 3 9 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 702 bases 

(B) TYPE: Nucleic acid 

• (C) STRANDEDNESS : Double 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium mycetoides 

(B) ' STRAIN V ATCC 2 1134 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39 



GCCGCAGACC CGCGAGCACG 
ACATCCTCGT TGCGCTGAAC 
ATCGAGCTCG TGGAGATGGA 
CGACGAGGAC GCCCCCATCA 
GCGACGAGAA GTGGGTTCAG 
GACTCCATCC CGGATCCGGT 
GATCGAGGAC ATCTTCACCA 
GTGTGGAGCG CGGCGTGCTC 
ATCCGCGACA AGTCCCAGAA 
CAAGCTGCTC GATACCGCTG 
GCGGTCTGAA GCGCGAGGAC 
GGCGCCTACA CCCCGCACAC 
CAAGGACGAG GGCGGCCGCC 
AGTTCTACTT CCGCACCACC 
GG 



TTCTTCTGGC 
AAGTGCGACA 
GGTCCGTGAG 
TCCACATCTC 
TCCGTGCTCG 
CCGCGAGACC 
TCTCCGGCCG 
AACCTCAACG 
GACCACCGTC 
AGGCAGGCGA 
GTCGAGCGTG 
CAAGTTCGAG 
ACACCCCGTT 
GACGTGACCG 



CCGCCAGGTC 
TGGTTGATGA 
CTGCTCGGCG 
CGCTCTGAAG 
ACCTCATGCA 
GACCGCGACT 
CGGCACCGTG 
ACGAGGTCGA 
ACCTCCATCG 
CAACGCGGCT 
GCCAGGTTGT 
GGTTCCGTCT 
CTTCGACAAC 
GTGTTGTGAA 



GGCGTCCCCT 
TGAGGAGATC 
AGCAGGACTA 
GCTCTCGAGG 
GGCGTGCGAC 
TCCTGATGCC 
GTTACCGGTC 
GATCATCGGC 
AGATGTTCAA 
CTGCTGCTCC 
CATCAAGCCG 
ACGTCCTGTC 
TACCGTCCGC 
GCTGCCGGAG 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
702 
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2) INFORMATION FOR SEQ ID NO: 40 

5 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 674 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Corynebacterium pseudogenitalium 

(B) STRAIN: ATCC 33038 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40 

20 GCTCGCCAGG TTGGCGTTCC TTACATCCTC GTTGCGCTGA ACAAGTGCGA 50 

CATGGTTGAT GATGAGGAAA TCATCGAGCT CGTTGAGATG GAGATCCGTG 100 

AGCTGCTCGC AGAGCAGGAT TACGATGAGG AAGCTCCTAT CGTTCACATC 150 

TCCGCTCTGA AGGCCCTCGA GGGCGATGAC AAGTGGGTAC AGTCCGTCGT 200 

TGATCTGATG GAAGCCTGCG ACAACTCCAT CCCGGATCCG GAGCGCGCTA 250 

25 CCGACCAGCC GTTCCTGATG CCTATCGAGG ACATCTTCAC CATTACCGGC 3 00 

CGCGGTACCG TTGTTACCGG CCGTGTTGAG CGTGGCCGTC TGAACGTCAA 350 

CGAGGACGTT GAGATCATCG GTATCCAGGA GAAGTCCCAG ACCACCACCG 4 00 

TTACCGGTAT CGAGATGTTC CGCAAGATGA TGGACTACAC CGAGGCTGGC 4 50 

GACAACTGTG GTCTGCTTCT GCGTGGTACC AAGCGTGAGG ACGTTGAGCG 500 

3 0 TGGCCAGGTT GTTATCAAGC CGGGCGCTTA CACCCCGCAC ACCAAGTTCG 550 

AGGGCTCCGT CTACGTCCTG AAGAAGGAAG AGGGCGGCCG CCACACCCCG 600 

TTCATGAACA ACTACCGTCC GCAGTTCTAC TTCCGTACCA CGGACGTTAC 650 

CGGTGTTGTT CACCTGCCAG AGGG 674 



35 



2) INFORMATION FOR SEQ ID NO: 41 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 694 bases 

(B) TYPE: Nucleic acid 

( C ) " " * S TRANDEDNESSf : " Do'ubie" 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 



50 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: CoryneJbacteriufli renale 

(B) STRAIN: ATCC 19412 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41 



TGCCTCAGAC CCGTGAGCAC GTTCTGCTTG CTCGTCAGGT CGGCGTTCCT 50 

TACATCCTCG TTGCACTGAA CAAGTGCGAC ATGGTCGACG ACGAAGAAAT 100 

55 CATCGAGCTC GTCGAGATGG AAATCCGTGA ACTGCTCGCA GAGCAGGACT 150 

ACGATGAGGA AGCTCCTATC GTTCACATCT CCGCTCTGGG CGCCCTGAAC 200 

GGCGAGCAGA AGTGGGTTGA CTCCATCGTC GAACTGATGG AAGCTTGCGA 250 

CAACTCCATC CCAGACCCAG TTCGCGACAT CGACCACCCA TTCCTGATGC 300 

CTATCGAGGA CATCTTCACC ATTACCGGTC GCGGTACCGT TGTTACCGGC 350 

60 CGTGTCGAGC GTGGCCGTCT CAACGTCAAC GAAGAAGTTG AGATCATCGG 400 

26 
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TATCAAGGAC AAGTCCCAGA AGACCACCGT 
GCAAGATGCT GGACTACACC GAAGCTGGCG 
CGCGGCATCG GCCGTGAGGA TGTCGAGCGT 
AGGCGCTTAC ACCCCTCACT CTGAGTTCGA 
5 CCAAGGACGA GGGTGGCCGC CACACCCCAT 
CAGTTCTACT TCCGCACCAC CGACGTGACC 



CACCGGTATC GAGATGTTCC 450 

ACAACTGTGG TCTGCTGCTC 500 

GGCCAGGTTA TCATCAAGCC 550 

GGGCTCTGTC TACGTCCTGT 600 

TCTTCGACAA CTACCGTCCA 650 

GGCGTTGTGC ACCT 694 



10 2) INFORMATION FOR SEQ ID NO: 42 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 687 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacteri urn ulcerans 

(B) STRAIN: NCTC 8665 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42 



GCCGCAGACC CGCGAGCACG TTCTGCTGGC TCGCCAGGTT GGCGTTCCKT 50 

ACATCCTSGT TGCACTGAAC AAGTGCGACA TGGTTGACGA TGAGGARCTC 100 

CTSGAGCTCG TCGAGATGGA GGTCCGCGAG CTGCTGGCTG AGCAGGACTA 150 

CGACGAGGAA GCTCCGRTCG TTCACATCTC CGCWCTGAAC GCCCTGGACG 200 

30 GCGACSAGAA GTGGGCTVAC TCCATCCTCG AGCTGATGCA GGCTTGCGAC 250 

GAGTCCATCC CGGATCCGGA GCGCGAGACC GACAAGCCGT TCCTGATGCC 300 

GATTGAGGAC ATCTTCACCA TTACCGGTCG CGGYACCGTT GTTACCGGCC 350 

GTGTTGAGCG TGGCDTCCTG AACGTSAACG ACGASGTTGA GATCATGGGY 4 00 

ATCCGGGAGA AGTCCCAGAA GACCACCGTY ACCKSCATCG AGATGTTCAA 450 

35 CAAGMTGMTG GACWCCGCAG AGGCTGGCGA CAACGCTGSW CTGCTGCTGC 500 

GTGGTMTSAA GCGTGAGGAC GTTGAGCGTG GCCAGATCAT CGYTAAGCCG 550 

GGCGCKTACA CCCCGCACAC CGAGTTCGAG GGCTCCGTCT ACGTCCTGTC 600 

CAAGGACGAG GGCGGCCGCC ACACCCCGTT CTTCGACAAC TACCGTCCGC 650 

AGTTCTACTT CCGCACCACC GACGTSACCG GTGTTGT 687 

40 



2) INFORMATION FOR SEQ ID NO: 43 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 778 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacteri um urealyticum 
55 (B) STRAIN: ATCC 43042 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 

CTGGTTGTTG CTGCAACCGA TGGCCCGATG CCGCAGACCC GTGAGCACGT 50 
60 TCTGCTGGCT CGCCAGGTTG GCGTTCCGTA CATCCTCGTT GCACTGAACA 100 

27 
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AGTGCGACAT GGTTGACGAT GAGGAGCTCC 
GTCCGCGAGC TTCTGGCTGA GCAGGACTAC 
CCCGATCTCC GCACTGGGCG CCCTGGACGG 
CCATCCTCGA GCTCATGAAG GCTTGCGACG 
5 CGCGAGACCG ACAAGCCGTT CCTGATGCCG 
TACCGGTCGC GGCACCGTCG TTACCGGCCG 
ACCTGAACGA CGAGGTCGAG ATCCTGGGCA 
ACCACCGTCA CCTCCATCGA GATGTTCAAC 
GGCTGGCGAC AACGCTGCAC TGCTGCTGCG 
10 TCGAGCGAGG CCAGATCATC GCTAAGCCGG 
GAGTTCGAGG GCTCCGTCTA CGTCCTGTCC 
CACCCCGTTC TTCGACAACT ACCGTCCGCA 
ACGTCACCGG TGTCGTTACC CTGCCAGAGG 
GGCGACAACG TTGAGATGAG CGTCAAGC 

15 



TCGAGCTCGT CGAGATGGAG 150 

GACGAGGAGG CTCCGGTCGT 200 

CGATCAGAAG TGGGTCGACT 250 

AGTCCATCCC GGACCCGGAG 300 

GTTGAGGACA TCTTCACCAT 3 50 

TGTTGAGCGT GGCGTCCTGA 4 00 

TCCGCGAGAA GTCCACCAAG 4 50 

AAGCTGCTGG ACACCGCAGA 500 

TGGTCTGAAG CGTGAGGACG 550 

GCGCTTACAC CCCGCACACC 600 

AAGGACGAGG GCGGCCGTCA 650 

GTTCTACTTC CGTACCACCG 700 

GCACCGACAT GGTCATGCCG 750 

778 



2) INFORMATION FOR SEQ ID NO: 44 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 703 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium xerosis 
30 (B) STRAIN: ATCC 373 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44 



CGCAGACCCG TGAGCACGTC CTCCTGGCCC GCCAGGTCGG CGTCCCCTAC 50 

35 ATCCTCGTCG CCCTGAACAA GTGCGACATG GTCGACGATG AGGAGATCAT 100 

CGAGCTCGTG GAGATGGAGG TGCGTGAGCT TCTCGCCGAG CAGGACTACG 150 

ACGAGGAGGC CCCGATCGTG CACATCTCCG CCCTGGGCGC CCTCAATGGC 200 

GAAGAGAAGT GGGTCGACTC CATCGTCGAG CTCATGAACG CCGTCGACGA 250 

GAACGTTCCG GACCCGGTCC GCGAGACCGA CAAGCCGTTC CTGATGCCCG 3 00 

40 TCGAGGACAT CTTCACCATC ACCGGCCGCG GCACCGTCGC CACCGGTCGC 3 50 

GTGGAGCGCG GCACCCTGAA GGTCAACGAC GAGGTCGAGA TCCTGGGCAT 4 00 

CCAGGAGAAG TCCCAGACCA CCACCGTCAC CGGCATCGAG ATGTTCCGCA 450 

AGCTGCTGGA CTCCGCCGAG GCCGGCGACA ACTGTGGCCT GCTGCTCCGC 500 

GGCATCAAGC GCGAGGACAT CGAGCGCGGC CAGATCATCG CGAAGCCGGG 550 

45 CGCCTACACC CCGCACACCG AGTTCGAGGG CTCCGTCTAC ATCCTGGCCA 600 

AGGACGAGGG CGGCCGCCAC ACCCCGTTCT TCGACAACTA CCGTCCGCAG 650 

TTCTACTTCC GCACCACCGA CGTCACCGGC GTCGTGAAGC TGCCGGAGGG 700 

CAC 703 



50 



2) INFORMATION FOR SEQ ID NO: 45 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



60 (ii) MOLECULE TYPE: Genomic DNA 

28 
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5 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Coxiella burnetii 

(B) STRAIN: Nine Mile phase II 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45 



GGAGCGATAT TGGTGGTGAG CGCAGCGGAC GGCCCGATGC CGCAAACGCG 50 

GGAACACATT GTATTGGCGA AGCAAGTGGG TGTTCCGAAC ATAGTGGTTT 100 

10 ACTTGAACAA AGCGGACATG GTGGATGACA AAGAGCTGTT GGAATTAGTG 150 

GAAATGGAAG TGAGGGATTT ATTGAACAGT TATGATTTCC CTGGGGATGA 2 00 

GACGCCGATA ATAGTGGGGT CAGCGTTAAA GGCGTTAGAA GGTGACAAGA 250 

GTGAGGTTGG GGAGCCATCG ATAATCAAAT TAGTGGAAAC GATGGACACG 300 

TACTTCCCGC AGCCGGAGCG AGCGATAGAC AAACCGTTTT TAATGCCGAT 350 

15 CGAAGATGTG TTTTCGATAT CGGGCCGAGG GACGGTGGTG ACGGGACGCG 400 

TAGAGCGAGG GATCATCAAA GTGGGCGACG AGATAGAGAT TGTGGGGATC 450 

AAGGACACGA CGAAGACGAC GTGCACGGGC GTTGAGATGT TTCGCAAATT 500 

ATTGGATGAA GGTCAAGCGG GTGACAACGT AGGAATTTTA TTGAGAGGGA 550 

CGAAACGCGA AGAAGTGGAG CGTGGTCAAG TATTGGCGAA ACCGGGATCG 600 

20 ATCACGCCAC ACAAGAAATT TGAGGCGGAG ATTTATGTGT TGTCGAAGGA 650 

AGAAGGGGGA CGCCACACAC CGTTTTTACA AGGCTATCGA CCGCAATTTT 700 

ATTTCCGCAC GACGGACGTG ACGGGCCAGT TATTGAGTTT ACCGGAGGGG 750 

ATAGAGATGG TGATGCCGGG AGATAACGTG AAAGTGACGG TTGAATTGAT 800 

TGCGCCGGTA GCGATGGATG AAGGGCTACG AT 832 

25 



2) INFORMATION FOR SEQ ID NO: 46 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 816 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Edwards i el la hoshinae 
40 (B) STRAIN: ATCC 33379 

(Xi ) SEQUENCE DES^YpYYqN :~ SEQ~ ~ID~~NO~: 4 6 

GGCGCTATCC TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAGACCCG 50 

45 TGAGCACATC CTGCTGGGTC GCCAGGTAGG CGTTCCGTAC ATCATCGTGT 100 

TCCTGAACAA GTGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT 150 

GAGATGGAAG TTCGCGAACT GCTGTCTCAG TACGATTTCC CGGGCGACGA 200 

TACGCCGGTA ATCCGCGGTT CTGCGCTGAA AGCGCTGGAA GGCGAAGCCG 250 

AGTGGGAAGC GAAGATCATC GAACTGGCTG AAACGCTGGA CTCCTACATT 3 00 

50 CCGGAACCTG AGCGTGACAT CGACAAGCCG TTCCTGCTGC CGATCGAAGA . 350 

CGTATTCTCA ATCTCTGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAGC 400 

GCGGTATCAT CAAGGTAGGC GACGAAGTTG AAATCGTAGG TATCAAGCCG 450 

ACCACCAAGA CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGCCGT GCTGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTACCAAGC 550 

55 GTGACGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CACCATCACT 600 

CCGCACACCA AGTTCGAATC AGAAGTGTAC ATCCTGAGCA AGGATGAAGG 650 

CGGCCGTCAT ACTCCGTTCT TCAAAGGTTA CCGTCCGCAG TTCTACTTCC 700 

GTACCACTGA CGTGACTGGC ACCATCGAAC TGCCGGAAGG CGTAGAGATG 750 

GTAATGCCGG GCGACAACAT CAAGATGGTT GTTACCCTGA TCCACCCGAT 800 

60 CGCCATGGAC GATGGT 816 
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2) INFORMATION FOR SEQ ID NO: 47 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Edwardsiella tarda 

(B) STRAIN: ATCC 15947 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47 

20 GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAGACCCG 50 

TGAGCACATC CTGTTGGGTC GCCAGGTAGG CGTTCCGTAC ATCATCGTGT 10 0 

TCCTGAACAA GTGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT 150 

GAGATGGAAG TTCGCGAACT GCTGTCTCAG TACGACTTCC CGGGCGACGA 200 

CACGCCGGTA ATCCGCGGTT CTGCGCTGAA AGCGCTGGAA GGCGAAGCCG 250 

25 AGTGGGAAGC GAAGATCATC GAACTGGCTG AAACTCTGGA CTCCTACATC 3 00 

CCGGAACCTG AGCGTGACAT CGACAAGCCG TTCCTGCTGC CGATCGAAGA 350 

CGTATTCTCT ATCTCTGGCC GTGGTACCGT TGTTACCGGT CGTGTAGAGC 400 

GCGGTATCAT CAAGGTAGGC GACGAAGTTG AAATCGTTGG TATCAAGCCG 450 

ACCACCAAGA CCACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

30 CGAAGGCCGT GCTGGTGAGA ACGTTGGTGT TCTGCTGCGT GGTACTAAGC 550 

GTGACGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CACCATCACT 600 

CCGCACACCA AGTTCGAATC TGAAGTGTAC ATCCTGAGCA AGGATGAAGG 650 

CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 700 

GTACTACTGA CGTGACTGGT ACCATCGAAC TGCCGGAAGG CGTAGAGATG 750 

35 GTAATGCCGG GCGACAACAT CAAGATGGTT GTTACCCTGA TCCACCCGAT 800 

CGCCATGGAC GATGGTCTGC G 821 



40 2) INFORMATION FOR SEQ ID NO: 48 

(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 830 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Eikenella corrodens 

(B) STRAIN: ATCC 23834 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48 



CGGTGCCATC CTGGTGGTAT CCGCTGCTGA CGGCCCCATG CCTCAGACTC 50 

GCGAACACAT CCTGTTGGCT CGTCAGGTAG GTGTACCCTA CATCCTCGTA 100 

TTCATGAACA AATGCGACAT GGTAGATGAT GCCGAGCTGC TTGAGTTGGT 150 

TGAGATGGAA ATCCGCGACC TGCTCTCCAG CTATGACTTC CCTGGTGACG 2 00 

60 ACTGCCCGAT CGTACAAGGT TCCGCTCTCA AAGCCCTCGA AGGCGATGCC 250 

30 
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GGTTACAAAG AAAAAATCTT 
CCCCACTCCT CAACGTGCTG 
ACGTATTCTC TATCTCCGGC 
CGCGGCATCA TCAAAGTAGG 
5 CACTCAGAAA ACTACCTGTA 
ACGAAGGTCA GGCCGGTGAC 
CGTGAAGAAG TTGAGCGTGG 
TCCGCACACC AAGTTCAAAG 
GTGGTCGTCA CACCCCGTTC 
10 CGTACTACTG ACGTAACCGG 
GGTTATGCCT GGTGAGAACG 
TTGCTATGGA AGAAGGTCTG 



CGAACTAGCT GCTGCTTTGG 
TAGACAAACC CTTCCTGTTG 
CGTGGTACCG TAGTAACCGG 
TGAAGAGATC GAAATCGTTG 
CTGGCGTGGA AATGTTCCGC 
AACGTAGGCG TACTGCTGCG 
TCAAGTATTG GCTAAACCCG 
CCGAAGTATA CGTATTGAGC 
TTTGCCAACT ACCGTCCACA 
TGCTGTAGAG CTGGAGCCTG 
TAACCATCAC CGTAGAACTG 
CGCTTTGCGA 



ATAGCTACAT 300 

CCGATCGAAG 350 

TCGTGTAGAG 400 

GTCTGAAGCC 450 

AAACTGCTGG 500 

CGGTACCAAA 550 

GCACCATCAC 600 

AAAGAAGAAG 650 

GTTCTACTTC 700 

GTGTAGAAAT 750 

ATTGCTCCGA 800 
630 



15 



25 



2) INFORMATION FOR SEQ ID NO: 49 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 808 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: En terobacter aerogenes 

(B) STRAIN: ATCC 13048 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 9 

GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAGACTCG 50 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT 100 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT 150 

35 GAGATGGAAG TTCGTGAACT GCTGTCTCAG TACGATTTCC CGGGCGACGA 200 

CACTCCGATC GTTCGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGACGCAG 250 

AGTGGGAAGC GAAAATCATC GAACTGGCTG GCTTCCTGGA TTCTTACATC 300 

CCRGAACCAG AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA 350 

CGTATTCTCC ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAGC 400 

4 0 GCGGTATCAT CAAAGTTGGT GAAGAAGTTG AAATCGTTGG TATCAAAGAC 450 

ACCGCGAAAA CCACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGCCGT GCTGGTGAGA ACGTAGGYGT TCTGCTGCGT GGTATCAAAC 550 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CAGCATCAAG 600 

CCGCACACCA AGTTCGAATC TGAAGTGTAC ATCCTGTCCA AAGACGAAGG 650 

45 CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 700 

GTACTACTGA CGTGACTGGT ACCATCGAAC TGCCGGAAGG CGTAGAGRTG 750 

GTAATGCCGG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT 800 

CGCGATGG 808 



50 



2) INFORMATION FOR SEQ ID NO: 50 



( i ) SEQUENCE CHARACTERISTICS : 
55 (A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 

31 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter agglomerans 

(B) STRAIN: ATCC 27989 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50 



CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACTC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

10 TTCCTGAACA AATGTGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGATTTC CCGGGCGACG 200 

ATACTCCGAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGAMGCW 250 

GAGTGGGAAG CGAAAATCAT CGARCTGGCT GGCCACCTGG ATACCTATAT 300 

CCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 350 

15 ACGTATTCTC CATCTCCGGT CGCGGTACCG TTGTTACCGG TCGTGTAGAG 4 00 

CGCGGTATCA TYAAAGTGGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA 4 50 

TACYGCGAAA TCAACCTGTA CCGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCTGGTGAG AACGTTGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCATCAA 600 

20 GCCGCACACC AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG 650 

GCGGTCGTCA CACTCCGTTC TTCAAAGGCT ACCGTCCSCA GTTCTACTTC 700 

CGTACAACTG ACGTGACTGG CACCATCGAA CTGCCGGAAG GCGTAGAGAT 750 

GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 800 

TCGCGATGGA CGACGGTCTG CGTTCGCA 828 

25 



. 2) INFORMATION FOR SEQ ID NO: 51 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter amnigenus 
40 (B) STRAIN: ATCC 33072 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51 

TGGCGCGATC CTGGTTGTTG CTGCAACTGA TGGCCCTATG CCACAGACGC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCTTA CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

AGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGATTTC CCAGGTGATG 200 

ACACTCCAAT CATCCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGAAGCA 250 

GAGTGGGAAG CTAAAATCGT TGAGCTGGCT GGCTACCTGG ATTCTTACAT 300 

CCCGGAACCA GAACGTGCTA TCGATAAGCC ATTCCTGCTG CCAATCGAAG 350 

ACGTATTCTC TATCTCCGGC CGTGGTACTG TTGTAACCGG TCGTGTAGAG 400 

CGCGGTATCG TTAAAGTTGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

GACTGCTAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCTGGTGAG AACGTTGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCAG GCTCAATCAA 600 

GCCGCACACC AAATTCGAAT CTGAAGTTTA TATTCTGTCC AAAGATGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCACA GTTCTACTTC 700 

CGTACAACTG ACGTGACCGG CACCATCGAA CTGCCAGAAG GCGTAGAGAT 750 

GGTAATGCCA GGCGACAACA TTCAGATGGT TGTTACCCTG ATCCACCCAA 800 

TCGCGATGGA TGACGGTCTG CGTTT 825 
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2) INFORMATION FOR SEQ ID NO: 52 

5 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Enterobacter asburiae 

(B) STRAIN: ATCC 35953 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52 

20 CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCAATG CCTCAGACTC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCTTT CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

AGAGATGGAA GTTCGTGAAC TGCTGTCTCA GTACGATTTC CCGGGCGACG 200 

ATACTCCAAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA 250 

25 GAGTGGGAAG CGAAAATCAT CGAACTGGCT GGCTTCCTGG ATTCTTACAT 3 00 

CCCAGAACCA GAGCGTGCGA TTGACAAGCC ATTCCTGCTG CCAATCGAAG 3 50 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAG 4 00 

CGCGGTATCA TCAAAGTTGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

GACTGCTAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

30 ACGAAGGCCG TGCTGGTGAG AACGTTGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTTCTG GCGAAGCCAG GCTCAATCAA 600 

GCCACACACC AAGTTCGAAT CTGAAGTGTA CATCCTGTCC AAAGACGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCACA GTTCTACTTC 700 

CGTACAACTG ACGTGACCGG TACCATCGAA CTGCCAGAAG GCGTTGAGAT 750 

35 GGTAATGCCA GGCGACAACA TCAAGATGGT TGTGACTCTG ATCCACCCAA 800 

TCGCGATGGA CGACGGTCTG CG 822 



40 2) INFORMATION FOR SEQ ID NO: 53 

( i J SE<^NCT~'^HARACT 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter cancerogenus 

(B) STRAIN: ATCC 3 5317 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53 



CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCAATG CCTCAGACTC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCTTA CATCATCGTG 100 

TTCCTGAACA AGTGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

AGAAATGGAA GTTCGTGAAC TGCTGTCTCA GTACGATTTC CCAGGCGACG 200 

60 ACACTCCAAT CGTTCGTGGT TCCGCGCTGA AAGCGCTGGA AGGCGAAGCT 250 
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GAGTGGGAAG CAAAAATCAT 

CCCAGAACCA GAGCGTGCGA 

ACGTATTCTC CATCTCCGGT 

CGCGGTATCA TCAAAGTTGG 

5 TACTGCKAAA TCTACCTGTA 

ACGAAGGCCG TGCTGGTGAG 

CGCGAAGAAA TCGAACGTGG 

GCCACACACC AAGTTCGAAT 
GCGGCCGTCA TACTCCGTTC 

10 CGTACAACTG ACGTGACCGG 

GGTAATGCCA GGCGACAACA 

TCGCGATGGA CGACGGTCTG 



CGAACTGGCT GGCTTCCTGG 
TTGACAAGCC ATTCCTGCTG 
CGTGGTACCG TTGTTACCGG 
TGAAGAAGTT GAAATCGTTG 
CTGGCGTTGA AATGTTCCGC 
AACGTTGGTG TTCTGCTGCG 
TCAGGTTCTG GCGAAGCCAG 
CTGAAGTGTA CATCCTGTCC 
TTCAAAGGCT ACCGTCCACA 
TACCATCGAA CTGCCAGAAG 
TCAAGATGGT TGTGACGCTG 
CGTTTC 



ATTCTTACAT 300 

CCAATCGAAG 350 

TCGTGTAGAG 4 00 

GTATCAAAGA 450 

AAACTGCTGG 500 

TGGTATCAAA 550 

GCTCAATCAA 600 

AAAGACGAAG 650 

GTTCTACTTC 700 

GCGTAGAGAT 750 

ATCCACCCAA 800 
826 



15 



25 



2) INFORMATION FOR SEQ ID NO: 54 



{ i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 806 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter cloacae 

(B) STRAIN: ATCC 13047 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54 

GATCCTGGTA GTAGCTGCGA CTGACGGCCC AATGCCTCAG ACTCGTGAGC 50 

ACATCCTGCT GGGTCGTCAG GTAGGCGTTC CTTACATCAT CGTGTTCCTG 100 

AACAAATGCG ACATGGTTGA TGACGAAGAG CTGCTGGAAC TGGTAGAGAT 150 

35 GGAAGTTCGT GAACTGCTGT CTCAGTACGA TTTCCCAGGC GACGATACCC 200 

CAATCGTTCG TGGTTCTGCT CTGAAAGCGC TGGAAGGCGA CGCAGAGTGG 250 

GAAGMGAAAA TCATCGAACT GGCTGGCTAC CTGGATTCTT ACATCCCAGA 300 

ACCAGAGCGT GCGATTGAYA AGCCATTCCT GCTGCCAATC GAAGACGTAT 350 

TCTCCATCTC CGGTCGTGGT ACCGTTGTTA CCGGTCGTGT AGAGCGCGGT . 400 

40 ATCATCAAAG TGGGTGAAGA AGTTGAAATC GTTGGTATCA AAGAGACTGC 450 

GAAGTCTACC TGTACTGGCG TTGAAATGTT CCGCAAACTG CTGGACGAAG 500 

gcc"gtg"c"tgg TGMAACGTC TG^GTGGTAT ^AAACGTGAA " b bU~ " 

GAAATCGAAC GTGGTCAGGT TCTGGCGAAG CCAGGCTCAA TCAAGCCACA 600 

CACCAAGTTC GAATCTGAAG TGTACATCCT GTCCAAAGAC GAAGGCGGCC 650 

45 GTCATACTCC GTTCTTCAAA GGCTACCGTC CACAGTTCTA CTTCCGTACA 700 

ACTGACGTGA CCGGTACCAT CGAACTGCCA GAAGGCGTAG AGGTGGTAAT 750 

GCCAGGCGAC AACATCAAGA TGGTTGTGAC TCTGATCCAC CCAATCGCGA 800 

TGGACG 806 



50 



2) INFORMATION FOR SEQ ID NO: 55 



(i) SEQUENCE CHARACTERISTICS : 
55 (A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 

34 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter gergoviae 

(B) STRAIN: ATCC 33028 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55 



CGGCGCGATC 
GTGAGCACAT 

10 TTCCTGAACA 
AGAGATGGAA 
ACACCCCGAT 
GAGTGGGAAG 
CCCGGAACCA 

15 ACGTATTCTC 
CGCGGTATCA 
CACCGCGAAA 
ACGAAGGCCG 
CGTGAAGAAA 

20 GCCGCACACC 
GCGGCCGTCA 
CGTACAACTG 
GGTAATGCCG 
TCGCGATGGA 

25 



CTGGTTGTTG 
CCTGCTGGGT 
AGTGCGACAT 
GTTCGTGAAC 
CGTTCGCGGT 
CGAAAATCAT 
GAGCGTGCGA 
CATTTCCGGT 
TCAAGGTTGG 
ACCACCTGTA 
TGCTGGTGAG 
TCGAACGTGG 
AAGTTCGAAT 
CACTCCGTTC 
ACGTGACTGG 
GGCGACAACA 
CGACGGTCTG 



CTGCGACTGA 
CGTCAGGTAG 
GGTTGATGAC 
TGCTGTCTCA 
TCTGCGCTGA 
CGAACTGGCT 
TTGACAAGCC 
CGTGGTACCG 
TGAAGAAGTT 
CTGGCGTTGA 
AACGTCGGCG 
TCAGGTACTG 
CTGAAGTGTA 
TTCAAAGGCT 
CACCATCGAA 
TCAAGATGGT 
CGTTTC 



CGGCCCGATG 
GCGTTCCGTA 
GAAGAGCTGC 
GTACGATTTC 
AAGCGCTGGA 
GGCCACCTGG 
GTTCCTGCTG 
TTGTTACCGG 
GAAATCGTTG 
AATGTTCCGC 
TTCTGCTGCG 
GCTAAGCCGG 
CATCCTGTCC 
ACCGTCCGCA 
CTGCCGGAAG 
TGTTACCCTG 



CCGCAGACCC 
CATCATCGTG 
TGGAACTGGT 
CCGGGCGACG 
AGGCGACGCA 
ATACCTAYAT 
CCGATCGAAG 
TCGTGTAGAG 
GTATCAAAGA 
AAACTGCTGG 
TGGTATCAAG 
GCTCCATCAA 
AAAGACGAAG 
GTTCTACTTC 
GCGTAGAGAT 
ATCCACCCGA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
826 



2 ) INFORMATION FOR SEQ ID NO: 56 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter hormaechei 
40 (B) STRAIN: ATCC 49162 

{ xx ) S^UUK^Klj^CKi^VWN: ' StiQ ~LD~n6 : 56 

GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCTATGC CTCAGACCCG 50 

45 TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCTTAC ATCATCGTGT 100 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTA 150 

GAGATGGAAG TTCGTGAACT GCTGTCTCAG TACGATTTCC CAGGCGACGA 200 

CACCCCAATC GTTCGTGGTT CCGCGCTGAA AGCGCTGGAA GGCGAMGCAG 250 

AGTGGGAAGM GAAAATCATC GARCTGGCTG GCTTCCTGGA TTCTTACATC 300 

50 CCAGAACCAG AGCGTGCGAT TGACAAGCCA TTCCTGCTGC CAATCGAAGA 350 

CGTATTCTCC ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTWGAGC 400 

GCGGTATCAT CAAAGTAGGT GAAGAAGTTG AAATCGTTGG TATCAAAGAG 450 

ACTGCGAAGT CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGCCGT GCTGGTGAGA ACGTTGGTGT TCTGCTGCGT GGTATCAAAC 550 

55 GTGAAGAAAT CGAACGTGGT CAGGTTCTGG CGAAGCCAGG CTCAATCAAG 600 

CCACACACCA AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGACGAAGG 650 

CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 700 

GTACAACTGA CGTGACCGGT ACCATCGAAC TGCCAGAAGG CGTAGAGATG 750 

GTAATGCCAG GCGACAACAT CAAGATGGTT GTGACGCTGA TCCACCCAAT 800 

60 CGCGATGGAC GACGGTCTGC GTTTCGCAA 829 
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2) INFORMATION FOR SEQ ID NO: 57 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Enterobacter sakazakii 

(B) STRAIN: ATCC 29544 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57 

20 GGCGCTATCC TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAGACCCG 50 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT 100 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT 150 

GAGATGGAAG TGCGCGAGCT GCTGTCTCAG TACGACTTCC CGGGCGACGA 200 

CACCCCGATC GTTCGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGACGCTG 250 

25 AGTGGGAAGC GAAAATCATC GAGCTGGCAG GTCACCTGGA TTCCTACATC 300 

CCGGAACCGG AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA 350 

CGTATTCTCC ATCTCYGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAGC 400 

GCGGTATCAT CAAGGTTGGT GAAGAAGTTG AAATCGTGGG CATCAAAGAC 450 

ACCGCGAAAT CCACCTGTAC CGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

30 CGAAGGCCGT GCGGGCGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAAC 550 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CTCCATCAAG 600 

CCGCACACCA AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGATGAAGG 650 

CGGCCGTCAC ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 700 

GTACRACTGA CGTGACTGGC ACCATCGAAC TGCCGGAAGG CGTTGAGATG 750 

35 GTAATGCCGG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT 800 

CGCGATGGAC GACGGTCTGC GTTTCGCAAT C 831 



40 2) INFORMATION FOR SEQ ID NO : 58 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 

(B) STRAIN: ATCC 25788 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58 



CGGCGCGATC TTAGTAGTAT CTGCTGCTGA TGGTCCTATG CCTCAAACAC 50 

GTGAACACAT CTTGTTATCA CGTAACGTTG GTGTACCATA CATCGTTGTT 100 

TTCTTAAACA AAATGGATAT GGTTGATGAC GAAGAATTAC TAGAATTAGT 150 

TGAAATGGAA GTTCGTGACT TATTGTCAGA ATATGACTTC CCAGGCGACG 200 

60 ATGTTCCTGT AATCGCTGGT TCTGCTTTGA AAGCTCYTGA AGGCGATGCT 250 

36 
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TCATACGAAG AAAAAATCAT GGAATTAATG 
TCCAACTCCA GAACGTGACA CTGACAAACC 
ACGTATTCTC AATCACTGGA CGTGGTACTG 
CGTGGACAAG TTCGCGTTGG TGACGAAGTT 
5 AGAAACTGCT AAAACAACTG TAACTGGTGT 
TAGACTATGC TGAAGCAGGG GATAACATTG 
GCTCGTGAAG ACATCCAACG TGGACAAGTA 
CACACCTCAT ACAAAATTTA AAGCTGAAGT 
AAGGTGGACG TCACACACCA TTCTTCACTA 
10 TTCCGTACAA CTGACGTAAC TGGTGTTGTT 
AATGGTTATG CCTGGTGATA ACGTAACAAT 
CAATCGCTAT CGAAGACGGA ACTCGTTTCT 



GCTGCAGTTG ACGAATACGT 3 00 

ATTCATGATG CCAGTCGAAG 3 50 

TTGCTACAGG CCGTGTTGAA 4 00 

GAAATCGTTG GTATTGCTGA 450 

TGAAATGTTC CGTAAATTGT 500 

GTGCATTGCT ACGTGGTGTT 550 

TTGGCTAAAG CTGGTACAAT 600 

TTACGTTTTA ACAAAAGAAG 650 

ACTACCGTCC TCAGTTCTAC 700 

GAATTACCAG AAGGAACTGA 750 

CGACGTTGAA TTGATCCACC 800 

CAATT 835 



15 

2) INFORMATION FOR SEQ ID NO: 59 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 826 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

25 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus cecorum 

(B) STRAIN: ATCC 43198 



30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59 



GGTGCTATCT TAGTAGTATC TGCTGCTGAT GGTCCTATGC CACAAACTCG 50 

TGAACACATT CTTTTATCAC GTAACGTTGG TGTTCCATAC ATCGTTGTTT 100 

TCTTAAACAA AGTTGATATG GTTGACGACG AAGAATTATT AGAATTAGTT 150 

35 GAAATGGAAG TACGTGACTT ATTAACTGAA TACGACTTCC CAGGAGACGA 200 

TGTTCCTGTA ATCGCTGGTT CTGCATTAAA AGCTTTAGAA GGCGACCCAT 250 

CTTACGAAGA AAAAATCTTA GAATTAATGG CTGCAGTTGA CGAATACATC 300 

CCAACTCCAG AACGTGACAA CGATAAACCA TTCATGATGC CAGTCGAAGA 350 

CGTATTTTCA ATCACTGGTC GTGGTACTGT TGCTACAGGT CGTGTTGAAC 4 00 

40 GTGGACAAGT ACGTGTTGGT GACGAAGTTG AAATAGTTGG TATCCATGAT 450 

GAAATTTCTA AAACAACAGT TACTGGTGTT GAAATGTTCC GTAAATTATT 500 

AGATTACGCT GAAGCTGGAG ACAACATCGG TGCATTATTA CGTGGT?GTGG 550 ~ 

CTCGTGAAGA TATCCAACGT GGTCAAGTAT TAGCTAAACC AGGTTCAATC 600 

ACTCCACATA CAAAATTCAC TGCTGAAGTG TACGTTTTAA CTAAAGAAGA 650 

45 AGGTGGACGT CATACTCCAT TCTTCACTAA CTACCGTCCA CAATTCTACT 700 

TCCGTACAAC TGACGTTACA GGTGTAGTTA ACTTACCAGA AGGTACTGAA 750 

ATGGTTATGC CTGGTGATAA CGTAACTATG GAAGTTGAAT TAATCCACCC 800 

AATCGCTATC GAAGACGGAA CTCGTT 826 



50 

2) INFORMATION FOR SEQ ID NO: 60 

(i) SEQUENCE CHARACTERISTICS : 
55 (A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 (ii) MOLECULE TYPE: Genomic DNA 

37 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus dispar 

(B) STRAIN: ATCC 51266 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60 



CGGCGCGATC TTGGTAGTAT CTGCTGCTGA TGGTCCTATG CCTCAAACTC 50 

GTGAACACAT CCTATTGTCA CGTAACGTTG GTGTTCCTTA CATCGTCGTT 100 

10 TTCTTGAACA AAATGGACAT GGTTGATGAC GAAGAATTAT TAGAATTAGT 150 

TGAAATGGAA GTTCGTGACT TATTGTCAGA ATACGACTTC CCAGGCGACG 200 

ACACTCCAGT TATCGCAGGT TCAGCTTTGA AAGCCTTAGA AGGCGACGCT 250 

TCATATGAAG AAAAAATCTT AGAATTAATG GCTGCAGTTG ACGAATATAT 300 

CCCAACTCCA GTTCGTGATA CTGACAAACC ATTCATGATG CCAGTCGAAG 350 

15 ATGTATTCTC AATCACTGGT CGTGGTACTG TTGCAACTGG TCGTGTTGAA 400 

CGTGGACAAG TTCGCGTTGG TGACGAAGTT GAAATCGTAG GTATCGCTGA 450 

AGAAACTGCT AAAACTACTG TAACAGGTGT TGAAATGTTC CGTAAATTGT 500 

TGGATTACGC TGAAGCTGGC GACAACATTG GTGCATTATT ACGTGGTGTG 550 

GCTCGTGAAG ATATCCAACG TGGTCAAGTA TTATCAAAAC CAGGTTCAAT 600 

20 CACTCCACAT ACAAAATTTG CGGCAGAAGT TTACGTTTTA ACTAAAGAAG 650 

AAGGTGGACG TCATACTCCA TTCTTCACTA ACTACCGCCC ACAATTCTAC 700 

TTCCGTACAA CTGACGTAAC AGGTGTTGTT GAATTACCAG AAGGTACTGA 750 

AATGGTTATG CCTGGCGATA ACGTTACTAT GGACGTTGAA TTAATCCACC 800 

CAATCGCGAT CGAAGACGGT ACTCGTTTCT CAATC 835 

25 



2 ) INFORMATION FOR SEQ ID NO: 61 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus durans 
40 (B) STRAIN: ATCC 19432 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61 

CGGAGCTATC TTAGTAGTTT CTGCTGCTGA TGGCCCTATG CCTCAAACTC 50 

45 GTGAACATAT CCTATTATCT CGTCAAGTTG GTGTTCCTTA CATCGTYGTA 100 

TTCTTGAACA AAGTAGATAT GGTCGATGAC GAAGAATTAC TAGAATTAGT 150 

TGAAATGGAA GTTCGTGACT TATTAACAGA ATACGAATTC CCTGGTGACG 200 

ATGTTCCTGT AATCGCTGGT TCAGCTTTGA AAGCTTTAGA AGGCGACGCT 250 

TCATACGAAG AAAAAATCCT TGAATTAATG GCTGCAGTTG ACGAATATAT 300 

50 CCCAACTCCA GAACGTGACA ACGACAAACC ATTCATGATG CCAGTTGAAG 35 0 

ATGTATTCTC RATCACTGGT CGTGGTACTG TTGCTACAGG TCGTGTTGAA 400 

CGTGGACAAG TTCGCGTTGG TGACGTTGTA GATATCGTTG GTATCGCAGA 450 

AGAAACAGCT CAAACAACAG * TTACTGGTGT TGAAATGTTC CGTAAATTAT 500 

TAGRCTACGC TGAAGCTGGA GACAACATTG GTGCTTTACT ACGTGGTGTT 550 

55 GCACGTGAAG ACATCCAACG TGGACAAGTT TTAGCTAAAC CAGGTACAAT 600 

CACKCCTCAT ACAAAATTCT CTGCAGAAGT ATACGTGTTG ACTAAAGAAG 650 

AAGGTGGACG TCATACTCCA TTCTTCACTA ACTACCGTCC ACAATTCTAC 700 

TTCCGTACAA CTGACGTAAC AGGTGTTGTT GAATTACCAG AAGGAACTGA 750 

AATGGTTATG CCTGGCGACA ACGTAACAAT GGAAGTTGAA TTAATCCACC 800 

60 CAATCGCTAT CGAAAATGGT ACTAAATTCT CAATC 835 
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10 



15 



20 



25 



30 



35 



2) INFORMATION FOR SEQ ID NO: 62 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 680 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 

(B) STRAIN: R610 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62 

AGTAGTTTCT GCTGCTGATG GTCCTATGCC TCAAACACGT GAACATATCT 50 

TATTATCACG TAACGTTGGT GTACCATACA TCGTTGTATT CTTAAACAAA 100 

ATGGATATGG TTGATGACGA AGAATTATTA GAATTAGTAG AAATGGAAGT 150 

TCGTGACTTA TTATCAGAAT ACGATTTCCC AGGCGATGAT GTTCCAGTTA 200 

TCGCAGGTTC TGCTTTGAAA GCTTTAGAAG GCGACGAGTC TTATGAAGAA 250 

AAAATCTTAG AATTAATGGC TGCAGTTGAC GAATATATCC CAACTCCAGA 300 

ACGTGATACT GACAAACCAT TCATGATGCC AGTCGAAGAC GTATTCTCAA 350 

TCACTGGACG TGGTACTGTT GCTACAGGCC GTGTTGAACG TGGTGAAGTT 4 00 

CGCGTTGGTG ACGAAGTTGA AATCGTTGGT ATTAAAGACG AAACATCTAA 450 

AACAACTGTT ACAGGTGTTG AAATGTTCCG TAAATTATTA GACTACGCTG 500 

AAGCAGGCGA CAACATCGGT GCTTTATTAC GTGGTGTAGC ACGTGAAGAT 550 

ATCGAACGTG GACAAGTATT AGCTAAACCA GCTACAATCA CTCCACACAC 600 

AAAATTCAAA GCTGAAGTAT ACGTATTATC AAAAGAAGAA GGCGGACGTC 650 

ACACTCCATT CTTCACTAAC TACCGTCCTC 680 



2) INFORMATION FOR SEQ ID NO: 63 ■ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 680 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double" 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 

(B) STRAIN: R487 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63 



AGTAGTTTCT GCTGCTGATG GTCCTATGCC TCAAACACGT 
TATTATCACG TAACGTTGGT GTACCATACA TCGTTGTATT 
ATGGATATGG TTGATGACGA AGAATTATTA GAATTAGTAG 
TCGTGACTTA TTATCAGAAT ACGATTTCCC AGGCGATGAT 
TCGCAGGTTC TGCTTTGAAA GCTTTAGAAG GCGACGAGTC 
AAAATCTTAG AATTAATGGC TGCAGTTGAC GAATATATCC 
ACGTGATACT GACAAACCAT TCATGATGCC AGTCGAAGAC 
TCACTGGACG TGGTACTGTT GCTACAGGCC GTGTTGAACG 



GAACATATCT 
CTTAAACAAA 
AAATGGAAGT 
GTTCCAGTTA 
TTATGAAGAA 
CAACTCCAGA 
GTATTCTCAA 
TGGTGAAGTT 



50 
100 
150 
200 
250 
300 
350 
400 
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CGCGTTGGTG ACGAAGTTGA AATCGTTGGT 
AACAACTGTT ACAGGTGTTG AAATGTTCCG 
AAGCAGGCGA CAACATCGGT GCTTTATTAC 
ATCGAACGTG GACAAGTATT AGCTAAACCA 
5 AAAATTCAAA GCTGAAGTAT ACGTATTATC 
ACACTCCATT CTTCACTAAC TACCGTCCTC 



ATTAAAGACG AAACATCTAA 450 
TAAATTATTA GACTACGCTG 500 
GTGGTGTAGC ACGTGAAGAT 550 
GCTACAATCA CTCCACACAC 600 
AAAAGAAGAA GGCGGACGTC 650 

680 



10 2) INFORMATION FOR SEQ ID NO: 64 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 685 bases 

(B) TYPE: Nucleic acid, 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: R482 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64 



AGTAGTTTCT GCTGCTGACG GCCCAATGCC TCAAACTCGT GAACACATCC 50 

TATTGTCTCG TCAAGTTGGT GTTCCTTACA TCGTTGTATT CTTGAACAAA 100 

GTAGACATGG TTGATGACGA AGAATTACTA GAATTAGTTG AAATGGAAGT 150 

TCGTGACCTA TTAACAGAAT ACGAATTCCC TGGTGACGAT GTTCCTGTAG 200 

30 TTGCTGGATC AGCTTTGAAA GCTCTAGAAG GCGACGCTTC ATACGAAGAA 250 

AAAATTCTTG AATTAATGGC TGCAGTTGAC GAATACATCC CAACTCCAGA 3 00 

ACGTGACAAC GACAAACCAT TCATGATGCC AGTTGAAGAC GTGTTCTCAA 350 

TTACTGGACG TGGTACTGTT GCTACAGGTC GTGTTGAACG TGGACAAGTT 400 

CGCGTTGGTG ACGAAGTTGA AGTTGTTGGT ATTGCTGAAG AAACTTCAAA 450 

35 AACAACAGTT ACTGGTGTTG AAATGTTCCG TAAATTGTTA GACTACGCTG 500 

AAGCTGGAGA CAACATTGGT GCTTTACTAC GTGGTGTTGC ACGTGAAGAC 550 

ATCCAACGTG GACAAGTTTT AGCTAAACCA GGTACAATCA CACCTCATAC 600 

AAAATTCTCT GCAGAAGTAT ACGTGTTGAC AAAAGAAGAA GGTGGACGTC 650 

ATACTCCATT CTTCACTAAC TACCGTCCTC AATTT 685 

40 



2) INFORMATION FOR SEQ ID NO: 65 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus flavescens 
55 (B) STRAIN: ATCC 49996 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65 

CGGCGCGATC TTAGTAGTAT CTGCTGCTGA TGGTCCTAYG CCTCAAACAC 50 
60 GTGAACACAT CTTGTTATCA CGTAACGTTG GTGTACCATA CATCGTTGTT 100 
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TTCTTAAACA AAATGGATAT GGTTGATGAC 
TGAAATGGAA GTTCGTGACT TATTGTCAGA 
ATGTTCCTGT AATCGCTGGT TCTGCTTTGA 
TCATACGAAG AAAAAATCAT GGAATTAATG 
5 TCCAACTCCA GAACGTGACA CTGACAAACC 
ACGTATTCTC AATCACTGGA CGTGGTACTG 
CGTGGACAAG TTCGCGTTGG TGACGAAGTT 
AGAAACTGCT AAAACAACTG TAACTGGTGT 
TAGACTATGC TGAAGCAGGG GATAACATTG 

10 GCTCGTGAAG ACATCCAACG TGGACAAGTA 
CACACCTCAT ACAAAATTTA AAGCTGAAGT 
AAGGTGGACG TCACACTCCA TTCTTCACTA 
TTCCGTACAA CTGACGTAAC TGGTGTTGTT 
AATGGTTATG CCTGGTGATA AMGTAACAAT 

15 CAATCGCTAT CGAAGACGGA ACTCG 



GAAGAATTAC TAGAATTAGT 150 

ATATGACTTC CCAGGCGACG 200 

AAGCTCTTGA AGGCGATGCT 2-50 

GCTGCAGTTG ACGAATACGT 300 

ATTCATGATG CCAGTCGAAG 350 

TTGCTACAGG CCGTGTTGAA 400 

GAAATCGTTG GTATTGCTGA 450 

TGAAATGTTC CGTAAATTGT 500 

GTGCATTGCT ACGTGGGGTT 550 

TTAGCTAAAG CTGGTACAAT 600 

TTACGTTTTA ACAAAAGAAG 650 

ACTACCGTCC TCAGTTCTAC 700 

GAATTACCAG AAGGAACTGA 750 

CGACGTTGAA TTGATCCACC 800 

825 



2 ) INFORMATION FOR SEQ ID NO: 66 



20 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 636 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: R420 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66 

35 TCCTATGCCT CAAACTCGTG AACACATCTT GTTATCACGT AACGTTGGCG 50 

TACCATACAT CGTTGTTTTC TTGAACAAAA TGGATATGGT TGATGACGAA 100 

GAATTGCTAG AATTAGTTGA AATGGAAGTT CGTGACCTAT TGTCTGAGTA 150 

TGACTTCCCA GGCGACGATG TTCCTGTAAT CGCCGGTTCT GCTTTGAAAG 200 

CTCTTGAAGG AGATCCTTCA TACGAAGAAA AAATCATGGA ATTGATGGCT 250 

40 GCAGTTGACG AATACGTTCC AACTCCAGAA CGTGATACTG ACAAACCATT 300 

CATGATGCCA GTCGAAGACG TATTCTCAAT CACTGGACGT GGTACTGTTG 350 

CTACAGGCCG TGTTGAACGT GGACAAGTTC GCGTTGGTGA TGAAGTAGAA 400 

ATCGTTGGTA TTGCTGACGA AACTGCTAAA ACAACTGTAA CAGGTGTTGA 450 

AATGTTCCGT AAATTGTTAG ACTATGCTGA AGCAGGGGAT AACATTGGTG 500 

45 CATTGCTACG TGGGGTTGCT CGTGAAGACA TCCAACGTGG ACAAGTATTG 550 

GCTAAAGCTG GTACAATCAC ACCTCATACA AAATTCAAAG CTGAAGTTTA 60 0 

TGTTTTGACA AAAGAAGAAG GTGGACGTCA CACTCC 63 6 



50 



60 



2) INFORMATION FOR SEQ ID NO: 67 



( i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 835 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus hirae 

(B) STRAIN: ATCC 8043 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67 



CGGAGCTATC TTAGTAGTTT CTGCTGCTGA TGGTCCTATG CCTCAAACTC 50 

GTGAACATAT CCTAYTATCT CGTCAAGTTG GTGTTCCATA CATCGTTGTA 100 

TTCTTGAACA AAGTAGATAT GGTTGACGAC GAAGAATTAC TAGAATTAGT 150 

10 TGAAATGGAA GTTCGTGACT TATTAACAGA ATACGAATTC CCTGGTGACG 200 

ATGTTCCTGT AGTTGCTGGT YCAGCTTTGA AAGCTTTAGA AGGCGACGCT 250 

TCATACGAAG AAAAAATCCT TGAATTGATG GCTGCAGTTG ACGAATATAT 300 

CCCAACTCCA GAACGTGACA ACGACAAACC ATTCATGATG CCAGTCGAAG 350 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG TCGTGTTGAA 4 00 

15 CGTGGACAAG TTCGCGTTGG TGACGTTGTA GATATCGTTG GTATCGCAGA 4 50 

AGAAACAGCT CAAACAACAG TTACTGGTGT TGAAATGTTC CGTAAATTAT 500 

TAGACTACGC TGAAGCTGGA GACAACATTG GTGCTTTACT ACGTGGTGTT 5 50 

GCACGTGAAG ACATCCAACG TGGACAAGTT TTAGCTAAAC CAGGTACAAT 600 

CACACCTCAT ACAAAATTCT CTGCAGAAGT ATACGTGTTG ACAAAAGAAG 650 

20 AAGGTGGACG TCATACTCCA TTCTTCACTA ACTACCGTCC ACAATTCTAC 700 

TTCCGTACRA CTGACGTAAC AGGTGTTGTT GAATTACCAG AAGGAACTGA 750 

AATGGTTATG CCTGGCGACA ACGTAACAAT GGAAGTTGAA TTAATCCACC 800 

CAATCGCTAT CGAAAACGGT ACTAAATTCT CAATC 83 5 

25 



2) INFORMATION FOR SEQ ID NO: 68 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus mundtii 

(B) STRAIN: ATCC 43186 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68 



CGGAGCAATC 
GTGAACACAT 

45 TTCTTGAACA 
TGAAATGGAA 
ATGTTCCTGT 
KCATACGAAG 
CCCAACTCCA 

50 ACGTATTCTC 
CGTGGACAAG 
AGAAACAGCT 
TAGACTACGC 
TCACGTGAAG 

55 CACACCTCAT 
AAGGTGGACG 
TTCYGTACGA 
AATGGTTATG 
CAATCGCTAT 

60 



TTAGTTGTTT 
CCTATTATCT 
AAGTAGATAT 
GTTCGTGACC 
AATCGCTGGT 
AAAAAATTCT 
GAACGTGATA 
AATCACTGGT 
YTCGTGTTGG 
CAAACAACTG 
TGAAGCAGGC 
ACATCCAACG 
ACAAAATTCT 
TCATACTCCA 
CTGACGTAAC 
CCTGGCGACA 
CGAAAATGGT 



CTGCTGCTGA 
CGTCAAGTTG 
GGTTGATGAC 
TATTAACAGA 
TCAGCTTTAA 
TGAATTGATG 
ACGACAAACC 
CGTGGTACTG 
TGACGTTATC 
TAACTGGTGT 
GATAACATTG 
TGGTCAAGTT 
CTGCAGAAGT 
TTCTTCACTA 
TRGTGTTGTY 
ACGTAACAAT 
ACTAAATTCT 



CGGCCCTATG 
GTGTACCATA 
GAAGAATTAC 
ATACGAATTC 
GAGCTTTAGA 
GCTGCAGTTG 
ATTCATGATG 
TTGCTACAGG 
GATATCGTTG 
TGAAATGTTC 
GTGCGTTACT 
TTAGCTAAAC 
ATACGTGTTG 
ACTACCGTCC 
GAATTACCAG 
GGAAGTTGAA 
CAATC 



CCTCAAACTC 
CATCGTTGTA 
TTGAATTAGT 
CCTGGTGACG 
AGGCGACGCT 
ACGAATATAT 
CCAGTTGAGG 
ACGTGTTGAA 
GTATCGCAGA 
CGTAAATTAT 
ACGTGGTGTT 
CAGGTACAAT 
ACTAAAGAAG 
ACAATTCTAC 
AAGGAACTGA 
TTAATCCACC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
835 
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2) INFORMATION FOR SEQ ID NO: 69 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 836 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

10 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus pseudoavium 
15 (B) STRAIN: ATCC 49372 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69 

CGGAGCTATC TTAGTAGTAT CTGCTGCTGA TGGCCCTATG CCTCAAACAC 50 

20 GTGAACACAT CTTGTTATCT CGTAACGTTG GTGTTCCTTA CATCGYTGTA 100 

TTCTTAAACA AAATGGATAT GGTTGATGAC GAAGAATTAC TAGAATTAGT 150 

TGAAATGGAA GTTCGTGACT TATTGTCAGA ATACGATTTC CCAGGCGACG 200 

ACACTCYAGT TATCGCTGGT TCAGCYTTGA AAGCTTTAGA AGGCGACCCT 250 

TCATACRAAG AAAAAATCTT AGAATTAATG SCTGCTGTTG ACGAATACAT 300 

25 CCCAACACCA GTTCGTGATA CTGACAAACC ATTCATGATG CCAGTCGAAG 350 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCAACTGG TCGTGTTGAA 4 00 

CGTGGACAAG TTCGCGTTGG TGACGAAGTT GAAATCGTAG GTATCGCTGA 450 

AGAAACTGCT AAAACAACTG TTACAGGTGT TGAAATGTTC CGTAAATTGT 500 

TAGACTACGC TGAAGCAGGC GATAACATCG GTGCATTATT ACGTGGTGTT 550 

30 GCACGTGAAG ACATCCAACG TGGACAAGTA TTGGCTAAAC CAGCTTCAAT 600 

CACTCCACAT ACAAAATTCT CTGCAGAAGT TTACGTTTTA ACTAAAGAAG 650 

AAGGCGGGCG TCACACTCCG TTCTTCACTA ACTACCGTCC TCAGTTCTAC 700 

TTCCGTACAA CTGACGTAAC TGGTGTTGTT GATCTACCAG AAGGTACTGA 750 

AATGGTAATG CCTGGTGATA ACGTAACTAT GGAAGTTGAA TTAATCCACC 800 

35 CAATCGCGAT CGAAGACGGA ACTCGTTTCT CTATTC 836 



2) INFORMATION FOR SEQ ID NO: 70 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA . 

(vi) ORIGINAL SOURCE : 
50 (A) ORGANISM: Enterococcus raffinosus 

(B) STRAIN: ATCC 4 9427 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70 

55 CGGAGCTRTC TTAGTAGTAT CTGCTGCTGA TGGCCCTATG CCTCAAACTC 50 

GTGAACACAT CTTGTTATCT CGTAACGTTG GTGTTCCTTA CATCGTTGTA 100 

TTCTTAAACA AAATGGATAT GGTTGACGAT GAAGAATTAC TAGAATTAGT 150 

TGAAATGGAA GTTCGTGACT TATTAACTGA ATACGACTTC CCAGGCGACG 200 

ACACTCCAGT TATCGCAGGT TCAGCTTTGA AAGCCTTAGA AGGCGACGCT 250 

60 TCATACGAAG AAAAAATCTT AGAATTAATG GCTGCTGTTG ATGAATACAT 300 

43 
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CCCAACACCA GTTCGTGATA CTGACAAACC 
ACGTAYTCTC AATCACTGGT CGTGGAACTG 
• CGTGGACAAG TTCGCGTTGG TGACGAAGTT 
AGAAACTGCT AAAACAACTG TTACAGGTGT 
5 TGGATTACGC TGAAGCGGGC GACAACATTG 
GCACGTGAAG ACATCCAACG TGGACAAGTA 
CACTCCACAT ACAAAATTCT CTGCAGAAGT 
AAGGCGGACG TCATACTCCA TTCTTCACTA 
TTCCGTACAA CTGACGTAAC TGGTGTAGTT 
10 AATGGTAATG CCTGGTGATA ACGTAACTAT 
CAATCGCGAT CGAAGACGGA ACTCGTTTCT 



ATTCATGATG CCAGYGGAAG 350 

TTGCAACTGG TCGTGTTGAA 4 00 

GAAATCGTAG GTATTGCTGA 450 

TGAAATGTTC CGTAAATTGT 500 

GTGCATTATT ACGTGGTGTT 550 

TTGGCTAAAC CAGCTTCAAT 600 

TTACGTTTTA ACTAAAGAAG 650 

ACTACCGTCC TCAGTTCTAC 700 

GATCTACCAG AAGGTACTGA 750 

GGAAGTTGAA TTAATCCACC 800 

CTATT 835 



15 2 ) INFORMATION FOR SEQ ID NO: 71 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus saccharolyticus 

(B) STRAIN: ATCC 43076 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71 

30 

CGGCGCGATC TTAGTAGTAT CTGCTGCTGA TGGTCCTATG CCTCAAACTC 50 

. GTGAACACAT CTTGTTATCT CGTAACGTAG GTGTTCCTTA CATCGTTGTA 100 

TTCTTAAACA AAATGGATAT GGTTGATGAC GAAGAATTAT TAGAATTAGT 150 

AGAAATGGAA GTTCGTGACT TATTATCAGA ATACGATTTC CCAGGCGATG 200 

35 ACACTCCAGT TATTGCAGGT TCTGCTTTGA AAGCTTTAGA AGGCGATCCA 250 

GTTTACGAAG AAAAAATCTT CGAATTAATG GCTGCAGTTG ACGAATATAT 3 00 

CCCAACTCCA GAACGTGATA CTGAAAAACC ATTCATGATG CCAGTTGAGG 350 

ATGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG TCGTGTTGAA 4 00 

CGTGGACAAG TTCGCGTTGG TGACGTTGTA GAAATCGTTG GTATCGACGA 4 50 

40 AGAAACAGCT CAAACTACTG TAACAGGTGT TGAAATGTTC CGTAAATTAT 5 00 

TAGACTACGC TGAAGCAGGC GATAACATCG GTGCTTTATT ACGTGGGGTT 550 

GCTCGTGAAG ACATCCAACG TGGACAAGTA TTAGCTAAAC CAGGAACAAT 600 

CACTCCTCAT ACAAAATTCG TAGCTGAAGT TTACGTTTTA ACTAAAGAAG 650 

AAGGTGGACG TCATACTCCA TTCTTCACTA ACTACCGTCC TCAATTCTAC 700 

45 TTCCGTACAA CTGACGTAAC TGGTGTTGTA GAATTACGCG AAGGTACTGA 750 

AATGGTAATG CCTGGTGACA ACGTAACTAT CGACGTTGAA TTAATCCACC 800 

CAATCGCTAT CGAAGACGGA ACTCGTTTCT CTATT 835 



50 

2) INFORMATION FOR SEQ ID NO: 72 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 823 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



60 



(ii) MOLECULE TYPE : Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus solitarius 

(B) STRAIN: ATCC 4 9428 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72 

GAGCTATCTT GGTAGTTTCT GCAGCTGATG GCCCAATGCC ACAAACTCGT 50 

GAACATATTT TGTTGTCACG TAATGTAGGT GTACCTTACA TCGTTGTGTT 100 

CTTGAACAAA ATGGATATGG TTGATGACGA AGAATTACTT GAGTTAGTTG 150 

10 AAATGGAAGT ACGTGATCTA TTATCTGAAT ACGACTTCCC AGGAGATGAT 200 

ACTCCAGTTA TTTCCGGTTC AGCTTTGAAA GCTTTAGAAG GCGACGAAGA 250 

ATATGAACAA AAAATTATGG ACTTAATGGA TGCAGTTGAT GACTACATTC 300 

CAACTCCTGA ACGTGACCAT GACAAACCAT TCATGATGCC AATTGAAGAT 350 

GTATTTTCAA TTACAGGCCG TGGTACTGTT GCTACAGGAC GTGTTGAACG 4 00 

15 CGGGACTATC AAAGTCGGCG ATGAAGTTGA CATTATTGGT ATTCATGAAG 4 50 

ACGTTAAAAA GACAACAGTT ACTGGTGTAG AAATGTTCCG TAAATTGTTG 500 

GACTACGCTG AAGCAGGCGA TAACATTGGT ACTTTGTTAC GTGGTGTTTC 550 

TCGTGATGAT ATCGAACGTG GTCAAGTATT AGCTAAACCA GGTTCAATCA 600 

CACCACATAC AAGATTCTCT GCTGAAGTTT ATGTTTTGAC TAAAGAAGAA 650 

20 GGCGGACGTC ATACTCCATT CTTCTCAAAC TATCGTCCTC AATTCTACTT 700 

CCGTACAACT GATATCACTG GTGTCATTGA ATTGCCAGAA GGTACTGAAA 750 

TGGTAATGCC AGGTGATAAT GTAACAATGG ATGTTGAATT AATCCACCCA 800 

GTCGCTATCG AAGAAGGAAC TCG 823 



25 



40 



2) INFORMATION FOR SEQ ID NO: 73 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 

(B) STRAIN: ATCC 25788 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73 



CGGTGCAATC TTGGTCGTAT CAGCGACAGA TGGCCCAATG CCACAAACAC 50 

GGGAGCATAT TTTGCTTTCT CGTCAAGTGG GTGTGAAACA TTTGATCGTC 100 

45 TTTTTGAATA AGACGGACCT TGTCGATGAT GACGAGTTGA TCGATTTAGT 150 

TGAAATGGAA GTCAGAGAAT TGCTGACTGA ATATGATTTT CCTGGCGACG 200 

ACATTCCTGT GATCAAGGGC TCTGCGTTAA AAGCCTTGGA AGGGGACCCA 250 

GATGCTGAAG CAGCGATCTT AACGCTGATG GATACAGTAG ATGAATATAT 300 

CCCAACGCCA GAACGTGATA CTGACAAACC ATTGTTGTTA CCGATCGAAG 350 

50 ATGTCTTTTC GATCACAGGA CGGGGGACCG TTGCTTCTGG TCGGATCGAT 400 

CGCGGCATGG TAAAAGTCGG GGATGAAGTA GAAATCGTCG GAATCAAACC 450 

TGAAACACAA AAAGCAGTCG TGACAGGGGT AGAAATGTTC CGCAAAACGA 500 

TGGACTTCGG AGAAGCTGGC GATAACGTAG GGGTATTGTT ACGGGGCATC 550 

ACCCGTGATG AAATTGAACG TGGCCAAGTG TTAGCAAAAC CAGGTTCTAT 600 

55 CACACCGCAT ACGAAATTCC AAGCGGAAGT CTATGTGTTG ACAAAAGAAG 650 

AAGGCGGTCG CCATACCCCA TTCTTTAATA ATTATCGCCC ACAATTTTAC 700 

TTCCGTACAA CGGACGTAAC TGGGAATATC GTTTTACCAG AAGGAACGGA 750 

AATGGTGATG CCTGGTGACA ACGTAACGAT CGATGTGGAA TTGATCCATC 800 

CGATCGCTGT AGAAAATGGA ACGACCTTCT CGATT 83 5 



60 
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2) INFORMATION FOR SEQ ID NO: 74 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 380 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

10 

(ii) MOLECULE TYPE: Genomic DNA 



(vi)ORiGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 
15 (B) STRAIN: ATCC 15305 



(xi) SEQUENCE DESCRIPTION: SEQ : 

TAACGGGCGT CTCGATAGAA AAACACGTGA 
20 AATCAATATA CGATTTTGCT GAAGCTGTAA 
ACAAATAAAA ATTAATAGGC AACTTAACCA 
AGATTCTGGT TTTTTAAATT CAAAAAGTTT 
TTTTAAGTAT AGGTATAAAA TACGATTGAT 
ATCATGAAAC AATTAACTAA GCCTTTATAC 
25 TACAACAACG CTGATTGGCG CGTTACTATT 
AACATCCTAT TGATTTTGTG GACGCCCGTT 



D NO: 74 

AAATCCCAAT GATTATAAAC 50 

CAAAAGGTAT TAAGGAACAA 100 

GAATCGTTAA AACTATATGA 150 

TCTAAAAAAT TTACTTGCTT 200 

TAAAACAGTA AAGGAAATGA 250 

TTTTACCTAT TACTTTTTAT 300 

ATATTTGCCA ATCACAGGTA 350 

380 



30 2) INFORMATION FOR SEQ ID NO: 75 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 6 bases 

(B) TYPE: Nucleic acid 
35 '(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus flavescens 

(B) STRAIN: ATCC 4 9 996 " 

i 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75 

45 

GCAATCTTGG TCGTATCAGC GACAGATGGC CCAATGCCAC AAACACGGGA 50 

GCATATTTTG CTTTCTCGTC AAGTGGGTGT GAAACATTTG ATCGTCTTTT 100 

TGAATAAGAC GGACCTTGTC GATGATGACG AGTTGATCGA TTTAGTTGAA • 150 

ATGGAAGTCA GAGAATTGCT GACTGAATAT GATTTTCCTG GCGACGACAT 200 

50 TCCTGTGATC AAGGGCTCTG CGTTAAAAGC CTTGGAAGGG GACCCAGATG 2 50 

CTGAAGCAGC GATCTTAACG CTGATGGATA CGGTAGATGA ATATATCCCA 300 

ACGCCAGAAC GTGATACTGA CAAACCATTG TTGTTACCGA TCGAAGATGT 350 

CTTTTCGATC ACAGGACGGG GGACCGTTGC TTCTGGTCGG ATCGATCGCG 400 

GCATGGTAAA AGTCGGGGAT GAAGTAGAAA TCGTCGGAAT CAAACCTGAA 450 

55 ACACAAAAAG CAGTCGTGAC AGGGGTAGAA ATGTTCCGCA AAACGATGGA 500 

CTTCGGAGAA GCTGGCGATA ACGTAGGGGT ATTGTTACGG GGCATCACCC 550 

GTGATGAAAT TGAACGTGGC CAAGTGTTAG CAAAACCAGG TTCTATCACA 600 

CCGCATACGA AATTCCAAGC GGAAGTCTAT GTGTTGACAA AAGAAGAAGG 650 

CGGTCGCCAT ACCCCA 66 6 

60 

46 
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2) INFORMATION FOR SEQ ID NO: 76 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 751 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 
15 (B) STRAIN: ATCC 49573 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76 

TGGTGCGATT TTAGTTGTAT CCGCAACAGA TGGTCCAATG CCTCAAACCC 50 

20 GGG AACATAT CTTGCTTTCG AGACAAGTTG GTGTGAAACA TCTGATTGTT 100 

TTCTTGAACA AAATCGATTT AGTCGATGAC GAAGAATTGA TTGATTTAGT 150 

AGAAATGGAA GTAAGAGAAC TGCTATCTGA ATATAATTTT CCAGGGGATG 2 00 

ACATTCCTGT TATCAAAGGT TCGGCATTAA AAGCGTTGGA AGGAGACCCT 250 

GATGCAGAAG CTGCCATCAT GGAATTAATG GATACAGTAG ACAGCTATAT 3 00 

25 CCCAACACCT GAGCGTGATA CAGACAAACC ATTACTCTTG CCAGTTGAAG 3 50 

ATGTCTTTTC GATTACTGGA CGAGGAACAG TTGCTTCCGG ACGGATCGAT 4 00 

CGGGGAACAG TTCGGGTAGG CGATGAAGTA GAAATCGTCG GTATCAAACC 450 

TGAAACCCAA AAAGCTGTAG TGACAGGCGT CGAAATGTTC CGCAAGACGA 500 

TGGACTTTGG GGAAGCCGGT GACAATGTAG GTGTCTTGCT GAGAGGGATC 550 

30 ACTCGTGACG AAATTGAACG AGGACAAGTG TTGGCTAAAC CAGGTTCGAT 600 

CACACCACAT ACAAAATTCC AAGCAGAAGT TTATGTATTG ACGAAAGAAG 650 

AAGGTGGTCG TCATACACCA TTCTTCAACA ACTATCGTCC ACAATTTTAT 700 

TTCCGTACAA CGGATGTGAC AGGGAACATT ACATTGCCTG AAGGAACAGA 750 

A 751 

35 



2) INFORMATION FOR SEQ ID NO: 77 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 834 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ehrlichia canis 
50 (B) STRAIN: Florida 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77 

TGCAGCAATA TTAGTAGTGT CTGCAACTGA TGGAGCAATG CCACAAACAA 50 

55 GAGAACATAT ATTATTAGCA AAGCAAGTAG GTGTAAAAGA TATAGTAGTG 100 

TGGATGAATA AGTGTGATGT TGTAGATGAT GAAGAAATGT TGTCATTAGT 150 

TGAAATGGAA ATAAGGGAAT TGTTATCAAA ATATGGGTAT CCTGGGGATG 2 00 

ATATAGATGT AGTTAGAGGA TCTGCAGTTA AAGCATTAGA AGAAGAAACA 250 

GGCTCAGGTG TGTGGAGTGA AAAAATAATG GAATTGATGA ATGCTTTAGA 3 00 

60 AAAAATAAGT TTACCAGTAA GAGAAAAAGA TAAGCCATTT TTAATGTCAA 3 50 

47 
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TAGAAGATGT GTTTTCAATA CCTGGAAGAG 
ATAGAAAGAG GAGTAATTAG AGTAGGGGAT 
GCGTGAGATA CAAAGTACAG TATGTACAGG 
CATTAGATGC AGGAGAAGCA GGGGATAATG 
5 ATAAAAAAAG AAGATGTAGA AAGAGGGCAA 
GATACATTCA TATAAGAGAT TTAAGGCAGA 
AAGAAGGAGG AAGACATACT CCATTTTTCT 
TATGTTAGAA CAACAGATGT AACAGGGAAT 
AGAAATGGTA ATGCCAGGGG ATAATATAAA 
10 AGCCTGTTGC TATTGATCAA GGATTGAGAT 



GTACAGTAGT AACAGGAAGA 4 00 

AAAATAGAGA TAGTAGGATT 450 

TGTTGAAATG TTTCATAAAG 500 

CTGGAATATT GTTAAGAGGG 550 

GTATTGAGTG CACCTGGACA 600 

GGTATATATA TTGAAAAAAG 650 

CAAATTACCA GCCGCAATTT 700 

ATAAAGTTAC CAGAAGGAGT 750 

TATCGAAGTG AGTTTGGATA 800 

TTGC 834 



2) INFORMATION FOR SEQ ID NO: 78 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 23511 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78 

30 CGGCGCGATC CTGGTAGTTG CTGCGACTGA CGGCCCGATG CCGCAGACTC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGACTTC CCGGGCGACG 200 

ACACTCCGAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA 250 

35 GAGTGGGAAG CGAAAATCCT GGAACTGGCT GGCTTCCTGG ATTCTTAYAT 300 

TCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 350 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAA 400 

CGCGGTATCA TCAAAGTTGG TGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

GACTCAGAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

4 0 ACGAAGGCCG TGCTGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCATCAA 600 

GCCGCACACC AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 700 

CGTACTACTG ACGTGACTGG TACCATCGAA CTGCCGGAAG GCGTAGAGAT 750 

45 GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 800 

TCGCGATGGA CGACGGT 817 



50 2) INFORMATION FOR SEQ ID NO: 79 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 

48 
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(A) ORGANISM: Escherichia fergusonii' 

(B) STRAIN: ATCC 3 54 69 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79 



CGATCCTGGT 
CACATCCTGC 
GAACAAGTGC 
TGGAAGTTCG 

10 CCGATCGTTC 
GGAAGCGAAA 
AACCAGAGCG 
TTCTCCATCT 
TATCATCAAA 

15 AGAAGTCTAC 
GGCCGTGCTG 
AGAAATCGAA 
ACACCAAGTT 
CGTCATACTC 

20 TACTGACGTG 
TGCCGGGCGA 
ATGGACGACG 



AGTTGCTGCG 
TGGGTCGTCA 
GACATGGTTG 
TGAACTTCTG 
GTGGTTCTGC 
ATCCTGGAAC 
TGCGATTGAC 
CCGGTCGTGG 
GTTGGTGAAG 
CTGTACTGGC 
GTGAGAACGT 
CGTGGTCAGG 
CGAATCTGAA 
CGTTCTTCAA 
ACTGGTACCA 
CAACATCAAA 
GTCTGCGTTT 



ACTGACGGCC 
GGTAGGCGTT 
ATGACGAAGA 
TCTCAGTACG 
TCTGAAAGCG 
TGGCTGGCTT 
AAGCCGTTCC 
TACCGTTGTT 
AAGTTGAAAT 
GTTGAAATGT 
AGGTGTTCTG 
TACTGGCTAA 
GTGTACATTC 
AGGCTACCGT 
TCGAACTGCC 
ATGGTTGTTA 
CGCAA 



CGATGCCGCA 
CCGTACATCA 
GCTGCTGGAA 
ACTTCCCGGG 
CTGGAAGGCG 
CCTGGATTCT 
TGCTGCCGAT 
ACCGGTCGTG 
CGTTGGTATC 
TCCGCAAACT 
CTGCGTGGTA 
GCCGGGCACC 
TGTCCAAAGA 
CCGCAGTTCT 
GGAAGGCGTA 
CCCTGATCCA 



GACTCGTGAG 
TCGTGTTCCT 
CTGGTTGAAA 
CGACGACACT 
ACGCAGAGTG 
TACATTCCGG 
CGAAGACGTG 
TAGAACGCGG 
AAAGAGACTC 
GCTGGACGAA 
TCAAACGTGA 
ATCAAGCCGC 
TGAAGGCGGT 
ACTTCCGTAC 
GAGATGGTAA 
CCCGATCGCG 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
825 



25 



2) INFORMATION FOR SEQ ID NO: 80 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 829 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia hermannii 

(B) STRAIN; ATCC 33650 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 0 



GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAGACCCG 50 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT 100 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT 150 

4 5 GAGATGGAAG TTCGCGAACT GCTGTCCCAG TACGATTTCC CGGGCGACGA 200 

CACCCCGATC GTTCGTGGTT CCGCGCTGAA AGCGCTGGAA GGCGAAGCAG 250 

AGTGGGAAGA GAAAATCATC GAACTGGCTG GCTACCTGGA TTCCTATATC 300 

CCGGAACCAG AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CTATCGAAGA 350 

CGTATTCTCC ATCTCCGGCC GTGGTACCGT TGTTACCGGT CGTGTAGAGC 400 

50 GCGGTATCAT CAAAGTGGGT GAAGAAGTTG AAATCGTGGG TATCAAAGAT 450 

ACTGCGAAAT CAACCTGTAC CGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGCCGT GCGGGCGAGA ACGTGGGTGT TCTGCTGCGT GGTATCAAAC 550 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG TTCCATCAAG 600 

CCKCACACCA AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGACGAAGG 650 

55 CGGCCGTCAC ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 700 

GTACAACTGA CGTGACTGGC ACCATCGAAC TGCCGGAAGG CGTTGAGATG 750 

GTAATGCCGG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT 800 

CGCGATGGAC GACGGTCTGC GTTTCGCAA 82 9 



60 
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2) INFORMATION FOR SEQ ID NO: 81 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 816 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia vulneris 

(B) STRAIN: ATCC 33821 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81 



CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACCC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

20 TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAGATGGAA GTGCGTGAAC TTCTGTCCCA GTACGACTTC CCGGGCGACG 200 

ACACCCCGAT CATTCGTGGT TCTGCGCTGA AAGCGCTGGA AGGCGAAGCT 250 

GAGTGGGAAG AGAAAATCGT TGAGCTGGCT GGCTACCTGG ATTCCTACAT 3 00 

CCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 350 

25 ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAG 4 00 

CGCGGTATCA TCAARGTKGG TGAAGAAGTT GAAATCGTGG GTATCAAAGA 450 

TACTGCGAAA TCTACCTGTA CCGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGTCG TGCAGGCGAG AACTGCGGCG TTCTGCTGCG TGGTATCAAG 550 

CGTGAAGAGA TCCAGCGTGG CCAGGTTCTG GCTAAGCCGG GCTCAATCAA 600 

30 GCCGCACACC AAGTTCGAAT CCGAAGTGTA CATCCTGTCC AAAGACGAAG 650 

GCGGCCGTCA CACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 700 

CGTACAACTG ACGTGACTGG CACCATCGAA CTGCCGGAAG GCGTAGAGAT 750 

GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCATCCGA 800 

TCGCGATGGA CGACGG 816 

35 



2) INFORMATION FOR SEQ ID NO: 82 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Eujbacteriiun lentum 

(B) STRAIN: ATCC 43055 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82 

CGGCGCCTCC TCGTTATCGC CGCCACCGAC GGCCCGATGG CCCAGACCCG 50 

CGAGCACATC CTGCTCGCCC GTCAGGTCGG CGTGCCCTAC ATCGTGGTCT 100 

TCCTGAACAA GTGCGACATG GTCGACGACG AGGAGCTCCT CGAGCTCGTC 150 

GAGATGGAAG TTCGCGAGCT GCTCGACTCT TACGAGTTCC CGGGCGACGA 2 00 

CACCCCGATC ATCCGCGGCT CCGCTTTGAA GGCCCTCGAG GGCGACAAAG 2 50 

AGTGGCAGGA GAAGGTCTGG GAGCTCATGG ACGCCGTCGA CTCCTACATC 300 

CCGACGCCGG AGCGCATGGT CGACAAGCCG TTCCTGATGG CCGTCGAGGA 350 

50 
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CACGATGACC ATCACCGGCC GCGGCACCGT 
GTGGTACGCT GCATGTCAAC GACCCGCTGG 
ACCCAGAACA CGGTCTGCAC CGGTATCGAG 
CGAGGCTCAG GCCGGCGACA ACATCGGCTG 
5 GCGAGGAGAT CGTTCGCGGC CAGGTTCTCT 
CCGCACACCG AGTTCGAGGG TCAGGTCTAC 
CGGCCGCCAC ACGCCGTTCT TCGACGGCTA 
GCACGACGGA CGTGACGGGT GTTGCCCACC 
GTCATGCCGG GCGACAACGT GGAGATCAAG 
10 CGCCAGGAAG AGGGCTGCGT TCGCTAAC 



TGCCACCGGT CGTGTGGAGC 4 00 

AGATCGTCGG TATCAAGGAG 450 

ATGTTCCGCA AGCTGCTCGA 500 

CCTGCTCCGC GGTGTCAAGC 550 

GCAAGCCCGG TAGCGTGACC 600 

ATCCTGACGA AGGAAGAGGG 650 

CCGTCCGCAG TTCTACTTCC 700 

TTCCCGAGGG CACCGAGATG 750 

GGCGAGCTCA TTCACCCGAT 800 

828 



2 ) INFORMATION FOR SEQ ID NO: 83 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: EuJbacteriiun nodatum 

(B) STRAIN: ATCC 33099 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83 

30 GGAGCAATTC TGGTTTGTGC AGCAACKGAC GGACCAATGC CTCAGACAAG 50 

AGAACATATC CTTTTGTCAA GGCAGGTAGG AGTGCCATAT ATCATCGTAT 100 

TCCTGAATAA ATGTGACATG GTGGATGAYG AAGAGCTTCT GGACTTGGTA 150 

GAGATGGAAG TAAGAGAACT TCTCAGTGAG TATGAATTCC CGGGAGATGA 2 00 

TACCCCGATA GTAAGAGGTT CAGCCCTGAA GGCACTGGAA GAACCCAATG 250 

35 GAGAATGGGC AGACAAGATT GTAGAGCTGA TGGAGGAAGT AGATAAATAC 3 00 

ATTCCTGAAC CAAAGAGAGA TAACGACAAA CCGTTCCTGA TGCCTGTAGA 350 

GGACGTATTC TCAATAACAG GAAGAGGAAC AGTAGCGACA GGAAGRGTTG 400 

AAAGAGGAAT CCTGAAGGTC GGTGATGAAG TAGAAATCGT GGGAATGAGC 4 50 

GAAGAGAGAA GAAAGGTAGT AGTAACGGGA GTTGAAATGT TCAGAAAGCT 500 

40 TCTGGATGAA GCAGAGACAG GAGACAACAT CGGAGCACTG CTGAGAGGAG 550 

TTCAGAGAAC RGAGATCCAG AGAGGTCAGG TATTGGCRGC ACCTGGAACG 600 

ATCAACCCAC ATACAAAgVt~^AAg"ggYcAG GTATaTGTAC TG^GAAGGa" 6bU 

AGAAGGAGGA AGGCATACGC CGTTCTTCAA YGGATACAGW CCACAGTTCT 700 

ACTTCAGAAC AACAGACGTA ACAGGAGATT TGCAGCTGCC GGAAGGARCA 750 

45 GAGATGTGCA TGCCGGGAGA TAATGTGGTA ATGAACRTCA GCCTGATCAC 800 

TCCGATTGCT ATAGAAGAGG GWCTGAGATT TGCCA 835 



50 2) INFORMATION FOR SEQ ID NO: 84 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 6 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 

51 
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(A) ORGANISM: Ewingella americana 

(B) STRAIN: ATCC 33852 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84 

5 

GCGATCCTGG TTGTTGCTGC AACTGATGGC CCTATGCCAC AGACTCGTGA 50 

GCACATCCTG TTGGGTCGYC AGGTTGGCGT TCCATTCATG ATCGTATTCA 100 

TGAACAAATG CGACATGGTT GATGACGAAG AGCTGCTGGA ACTGGTAGAA 150 

ATGGAAGTTC GYGAACTTCT GTCTGCTTAC GATTTCCCAG GCGACGACAT 200 

10 CCCAGTTGTT A^AGGTTCAG CGCTGAAAGC ACTGGAAGGC GAAGCAGAGT 250 

GGGAAGCTAA GATCATCGAG CTGGCTGGCC ACCTGGATAG CTACATCCCA 300 

GAACCAGAGC GTGCGATTGA CAAGCCATTC CTGCTGCCAA TCGAAGACGT 350 

ATTCTCCATC TCCGGTCGTG GTACCGTTGT TACCGGTCGT GTAGAGCGCG 400 

GTATCATCAA AGTTGGCGAA GAAGTTGAAA TCGTTGGTAT CAAGGACACT 450 

15 GTTAAGTCTA CTTGTACTGG CGTTGAAATG TTCCGCAAAC TGCTGGACGA 500 

AGGCCGTGCY GGTGAGAACG TTGGTGTTCT GCTGCGTGGT ATCAAGCGTG 550 

AAGACATCGA ACGTGGTCAG GTTCTGGCTA AACCAGGTTC AATCAAACCA 600 

CACACCACWT TYGACTCAGA AGTTTATATC CTGAGCAAAG ATGAAGGCGG 650 

CCGTCATACT CCGTTCTTCA AAGGCTACCG TCCACAGTTC TACTTCCGTA 700 

20 CAACTGACGT GACCGGTACC ATCGAACTGC CAGAAGGCGT AGAGATGGTA 750 

ATGCCAGGTG ACAACATCAA CATGRTAGTT ACCCTGATCC ACCCAATCGC 800 

GATGGATGAC GGTCTGCGTT TCGCAA 826 



25 



35 



2) INFORMATION FOR SEQ ID NO: 85 



<i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 828 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE : Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Francisella tularensis 

(B) STRAIN: LVS 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85 

tggtgctatt" ctagt atgYt" ~C tgc tgcgga" tg^tc^tatg" ccaca^cxcT ~ b U 

GTGAGCACAT TCTGCTTTCT CGTCAAGTTG GTGTACCAAA AATCGTTGTT 100 

TTCTTAAACA AGTGTGACAT GGTTGATGAT GAAGAGTTAT TAGAGCTAGT 150 

45 TGAGATGGAA GTTCGTGAGC TTTTAGATCA GTATGAGTTC CCAGGTGATG 200 

ACACTCCAGT TATTATGGGT TCAGCTCTTA GAGCTATTGA AGGTGACGAA 250 

GCTTACGTTG AGAAAATTGT TGAGCTAGTT CAAGCTATGG ATGACTATAT 300 

TCCTGCTCCT GAGCGTGATA CTGAGAAGCC ATTTATTCTT CCGATCGAAG 350 

ATGTATTCTC AATTTCAGGT CGTGGTACTG TTGTAACTGG TCGTATTGAG 400 

50 CGCGGTGTAG TTAACGTTGG TGATGAAGTT GAAGTTGTTG ■ GTATTCGTCC 450 

AACTCAAAAA ACTACAGTAA CTGGTGTGGA AATGTTCCGT AAGCTTTTAG 500 

ATAGAGGGGA AGCTGGTGAT AACGTTGGTA TCCTAGTTCG TGGACTTAAG 550 

AGAGATGATG TTGAGCGTGG ACAAGTATTA TGTAAGCCAG GTTCAATTAA 600 

GCCACATACT AAGTTTGAAG CTGAGGTTTA TGTATTATCT AAAGAAGAGG 650 

55 GTGGTAGACA TACTCCATTC TTCAAGGGAT ATAGACCACA ATTCTACTTC 700 

CGTACTACAG ACATTACTGG AGCTGTTGAG CTTCCAGAGG GTGTAGAAAT 750 

GGTTATGCCT GGTGATAACG TTAAGATGAC TATCACTCTA ATTAACCCAA 800 

TCGCTAGGAT GAAGGGTTAC GTTTTGCA 828 



60 
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2) INFORMATION FOR SEQ ID NO: 86 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusobacterium nucleatum subsp. polymorphum 

(B) STRAIN: ATCC 10953 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86 



CGGTGCTATC ATCGTAGKTG CTGCTACTGA TGGTCCGATG CCTCARACTC 50 

GTGAGCAYAT CYTGCTGGCT CGTCAGGTAA ACGTWCCKAG ACTGGTTGTA 100 

20 TTCATGAACA AGTGYGACAT GGTAGACGAC GCTGAAATGY TGGAACTCGT 150 

TGAAATGGAA ATGCGTGAAC TGCTTTCAGC YTACGAATTC GAYGGYGACA 200 

ACACTCCKTT CATTCAGGGT TCTGCTCTTG GTGCRTTGAA YGGCGTTGAA 250 

AAGTGGGAAG AGAAGGTTAT GGANCTGATG GATGCTTGCG ACACTTGGAT 300 

TCCTTTGCCT CCGCGTGATA TTGAYAAACC GTTCTTGATG CCGGTTGAAG 350 

25 ACGTATTCTC AATCACTGGT CGTGGTACTG TAGCTACTGG TCGTATCGAA 400 

GCTGGTGTTA TCCATGTAGG TGACGAAGTT GAAATCCTCG GTTTGGGTGA 450 

AGACAAGAAG TCTGTTGTAA CTGGTGTTGA AATGTTCCGC AAGTTGCTGG 500 

ATCAAGGTGA AGCTGGTGAC AACGTAGGTY TGTTGCTCCG TGGTATCGAC 550 

AAGAACGAAA TCAAACGTGG TATGGTTCTT TGTAAGCCCG GTCAGATTAA 600 

30 ACCTCACTCT AAGTTCAAAG CTTCTATCTA CGTTTTGAAG AAAGAAGAAG 650 

GTGGTCGTCA CACTCCGTTC CACAACAAAT ACCGTCCTCA GTTCTATCTG 700 

CGTACTATGG ACTGTACAGG TGAAATCWCT CTTCCGGAAG GAACTGAAAT 750 

GGTAATGCCT GGTGATAACG TAGAAATCAC TGTAGAACTG ATCTACCCGG 800 

TAGCATTGAA CGTAGGTTTG CGTTTCGCT 82 9 

35 



2) INFORMATION FOR SEQ ID NO: 87 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Gemella haemolysans 
50 (B) STRAIN: ATCC 10379 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87 

CTATCTTAGT AATCGCTGCT ACAGATGGAC CAATGGCTCA AACTCGTGAG 50 

55 CACATCCTAT TATCTCGTAA CGTTGGAGTA CCAAAAATCG TTGTATTCTT 100 

AAACAAATGT GATATGGTTG ATGACGAAGA GTTATTAGAA TTAGTTGAAA 150 

TGGAAGTTCG TGAACTATTA TCTGAATACG GATTCGACGG AGATGAACTA 200 

CCAGTAATCA AAGGTTCTGC TCTTAAAGCT CTTGAAGGAG ATGCAGATGC 250 

AGAAAAAGCT ATCATCGAAT TAATGGAAAC AGTTGACGAA TACATCCCAA 300 

60 CTCCAGAACG TGATAACGCT AAACCATTCA TGATGCCAGT TGAGGACGTA 350 

53 
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TTCTCAATCA CAGGTCGTGG TACAGTTGCT 
ACAAGTTAAA GTTGGAGACG TAGTAGAAAT 
CAGCTTCAAC .TACTGTAACA GGTGTTGAAA 
TACGCTGAAG CAGGAGATAA CATCGGTGCA 
TGAAGACATC GAACGTGGAC AAGTTTTAGC 
CACACACTCA ATTCGTAGCT GACGTGTACG 
GGACGTCACA CTCCATTCTT CACAAACTAC 
TACTACTGAC GTAACTGGTG TAGTTACTTT 
TAATGCCTGG GGATAACGTA TCAATCAACG 
GCGATCGAAG AAGGAACTCG TTTCTCAA 



ACTGGACGTG TTGAACGTGG 400 

CGTTGGATTA ACTGAAGAAC 450 

TGTTCCGTAA ATTATTAGAT 500 

TTATTACGTG GTGTTGCTCG 550 

AGCTCCTAAA ACAATCACTC 600 

TATTATCTAA AGAAGAAGGT 650 

CGTCCTCAAT TCTACTTCCG 700 

ACCAGAAGGT ACTGAAATGG 750 

TAGAACTTAT TTCTCCAATC 800 

828 



2) INFORMATION FOR SEQ ID NO: 88 

( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 82 3 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Gemella morbillorum 

(B) STRAIN: ATCC 27824 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88 

TCTTAGTAAT CGCTGCTACA GATGGTCCTA TGGCTCAAAC TCGTGAACAC 50 

ATCCTATTAT CTCGTAACGT TGGAGTACCT AAAATTGTTG TATTCTTAAA 100 

CAAATGTGAT ATGGTTGATG ACGAAGAGTT ATTAGAATTA GTAGAAATGG 150 

AAGTTCGTGA ACTATTATCT GAATACGGAT TTGATGGAGA TGAACTACCA 200 

GTAATCAAAG GTTCAGCTCT TAAAGCTCTT GAAGGAGATG CAGATGCTGA 250 

AAAAGCTATC ATCGAATTAA TGGAAACAGT TGACGAGTAC ATCCCAACTC 300 

CAGAACGTGA TAACGCTAAA CCATTTATGA TGCCAGTTGA GGACGTGTTC 350 

TCAATCACAG GTCGTGGTAC AGTTGCTACT GGACGTGTTG AACGTGGACA 4 00 

AGTTAAAGTT GGTGACGTAG TAGAAATCGT TGGATTAACT GAAGAACCAG 450 

CTTCAACTAC TGTAACAGGT GTTGAAATGT TCCGTAAATT ATTAGATTAC 500 

GCTGAAGCAG GAGATAACAT CGGTGCATTA TTACGTGGTG TTGCTCGTGA 550 

AGATATCGAA CGTGGACAAG TTTTAGCAGC TCCTAAAACA ATCACTCCAC 600 

ATACTCAATT CGTAGCTGAT GTGTACGTAT TATCTAAAGA AGAAGGTGGA 650 

CGTCACACTC CATTCTTCAC AAACTACCGT CCACAATTCT ACTTCCGTAC 700 

TACTGACGTA ACTGGTGTAG TTACTTTACC AGAAGGTACT GAAATGGTAA 7 50 

TGCCTGGGGA CAACGTATCA ATCAACGTAG AACTTATTTC TCCAATCGCT 8 00 

ATCGAAGAAG GAACTCGTTT CTC 823 



2) INFORMATION FOR SEQ ID NO: 89 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 

54 
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(A) ORGANISM: Haemophilus actinomycetemcomitana 

(B) STRAIN: ATCC 33384 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89 

5 

GCTATCTTAG TAGTAGCAGC AACCGACGGT CCTATGCCAC AAACTCGTGA 50 

GCACATCTTA TTAGGTCGCC AAGTAGGTGT TCCTTACATC ATCGTATTCT 100 

TAAACAAATG CGACATGGTA GATGACGAAG AGTTATTAGA ATTAGTTGAA 150 

ATGGAAGTTC GTGAACTTCT TTCTCAATAT GACTTCCCGG GCGATGACAC 200 

10 CCCAATCGTA CGCGGTTCTG CATTAAAAGC GCTTGAAGGC GATGCCGCAT 250 

GGGAAGAAAA AATCCTTGAA TTAGCAAACC ATTTAGATAC TTACATCCCG 300 

GAACCTGAGC GTGCTATCGA CCAACCGTTC CTTCTTCCAA TTGAAGATGT 350 

GTTCTCTATC TCCGGTCGTG GTACCGTAGT AACGGGTCGT GTTGAGCGCG 4 00 

GTATCATCCG TACCGGTGAT GAAGTTGAAA TCGTGGGTAT CAAACCGACT 450 

15 GCAAAAACCA CCGTAACCGG TGTTGAAATG TTCCGTAAAT TACTTGACGA 500 

AGGTCGTGCG GGTGAAAACA TCGGTGCATT ATTGCGTGGT ACTAAACGTG 550 

AAGAAATCGA ACGTGGTCAG GTATTGGCGA AACCGGGGTC AATCACCCCG 600 

CACACTGACT TCGAATCTGA AGTGTACGTA TTGTCCAAAG AAGAAGGTGG 650 

TCGTCATACT CCATTCTTCA AAGGTTACCG TCCACAATTC TATTTCCGTA 700 

20 CAACTGACGT AACCGGTACT ATCGAGTTAC CTGAAGGCGT GGAAATGGTT 750 

ATGCCTGGCG ATAACATCAA AATGACCGTA TCCTTAATTC ACCCAATTGC 800 

GATGGACCAA GGTTTACGTT TCGCTATCG 829 



25 



2) INFORMATION FOR SEQ ID NO: 90 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 833 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus aphrophilus 

(B) STRAIN: ATCC 33389 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90 

tggtg"ctatc~ 'ttagtagtag "cagc^ctgaT tggtcctatg" C C A C AAA C T C~ b U 

GTGAGCACAT CTTATTAGGT CGCCAAGTAG GTGTTCCTTA CATCATCGTA 100 

TTCTTAAACA AATGCGACAT GGTAGATGAC GAAGAGTTAT TAGAATTAGT 150 

45 TGAAATGGAA GTTCGTGAAC TTCTTTCTCA ATATGACTTC CCGGGTGATG 200 

ATACACCAAT CGTACGTGGT TCTGCATTAC AAGCGTTAAA CGGCGTTGCA 250 

GAATGGGAAG AAAAAATCCT TGAATTAGCA AACCACTTAG ATACTTACAT 300 

TCCTGAGCCA CAACGTGCTA TCGACCAACC GTTCCTTCTT CCAATTGAAG 350 

ACGTGTTCTC TATCTCCGGT CGTGGTACTG TAGTAACAGG TCGTGTTGAG 400 

50 CGTGGTATCA TCCGTACCGG TGATGAAGTT GAAATCGTAG GTATCAAACC 450 

GACTGCGAAA ACTACCGTAA CCGGTGTTGA AATGTTCCGT AAATTACTTG 500 

ACGAAGGTCG TGCAGGTGAA AACATCGGTG CATTATTACG TGGCACTAAA 550 

CGTGAAGAAA TCGAACGTGG TCAAGTATTG GCTAAACCGG GCTCAATCAC 600 

TCCGCACACT GATTTCGAAT CTGAAGTGTA CGTATTATCC AAAGAAGAAG 650 

55 GTGGTCGTCA TACTCCATTC TTCAAAGGTT ACCGTCCACA ATTCTATTTC 700 

CGTACAACTG ACGTAACCGG TACTATCGAG TTACCGGAAG GCGTGGAAAT 750 

GGTTATGCCT GGCGATAACA TCAAAATGAC TGTATCCTTA ATCCACCCAA 800 

TCGCGATGGA CCAAGGTTTA CGTTTCGCTA TCG 833 



60 
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2) INFORMATION FOR SEQ ID NO: 91 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 815 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus ducreyi 

(B) STRAIN: DSM 8925 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91 



CGGCGCTATC TTAGTTGTAG CAGCAACTGA TGGTCCTATG CCTCAAACTC 50 

GTGAACACAT CTTATTAGGC CGCCAAGTTG GTGTTCCTTA CATCATCGTA 100 

20 TTCTTAAATA AATGCGATAT GGTAGATGAT GAAGAATTAT TAGAATTAGT 150 

TGAAATGGAA GTTCGTGAAC TTCTTTCTCA ATATGATTTC CCAGGTGACG 2 00 

ATACTCCTAT CGTTCGTGGT TCAGCATTAC AAGCATTAAA TGGTGTGCCT 2 50 

GAGTGGGAAG AAAAAATCAT TGAATTAGCA CAACACTTAG ATTCTTATAT 3 00 

CCCTGAGCCT GAGCGTGCGA TTGATAAACC TTTCTTATTA CCAATCGAAG 3 50 

25 ACGTATTCTC AATTTCAGGT CGTGGTACAG TAGTAACCGG TCGTGTTGAG 4 00 

CGTGGTATCA TCAAATCAGG TGAAGAAGTT GAAATCGTAG GGATTAAAGA 4 50 

AACGACAAAA ACAACAGTAA CCGGTGTTGA GATGTTCCGT AAACTATTAG 500 

ACGAAGGTCG TGCGGGTGAA AACGTAGGTG CCTTATTACG TGGTACTAAA 550 

CGTGAAGAAA TCGAACGTGG TCAAGTATTA GCGAAACCAG GTACAATTAC 600 

30. ACCACACACT GATTTTGAAT CAGAAGTTTA TGTATTATCA AAAGAAGAAG 650 

GTGGTCGTCA TACTCCATTC TTCAAAGGTT ATCGTCCTCA GTTCTACTTC 7 00 

CGYACAACGG ACGTAACAGG AACGATTGAA TTACCTGAAG ATGTTGAGAT 750 

GGTAATGCCT GGTGATAATA TCAAGATGAC AGTAAGCTTA ATTCACCCTA 800 

TCGCGATGGA CGAAG 815 

35 



2) INFORMATION FOR SEQ ID NO: 92 

40 ( i ) SEQUENCE CHARACTER ISTICS : 

(A) LENGTH: 83 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus haemolyticus 

(B) STRAIN: ATCC 33390 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92 

TGGTGCTATC TTAGTAGTAG CAGCAACTGA TGGTCCAATG CCACAAACTC 50 

GTGAGCACAT CTTATTAGGT CGCCAAGTAG GTGTTCCATA CATCATCGTA 100 

TTCTTAAACA AATGCGACAT GGTAGATGAC GAAGAGTTAT TAGAATTAGT 150 

AGAAATGGAA GTTCGTGAAC TTCTTTCTCA ATATGACTTC CCAGGTGACG 200 

ATACACCAAT CGTACGTGGT TCTGCATTAC AAGCATTAAA TGGCGTAGCA 250 

GAATGGGAAG AAAAAATCCT TGAGTTAGCA AACCACTTAG ATACTTACAT 300 

CCCAGAACCA GAGCGTGCAA TTGACCAACC GTTCCTTCTT CCAATCGAAG 350 

56 
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ATGTGTTCTC AATCTCAGGT 
CGTGGTATCA TCCGTACTGG 
TACAGCAAAA ACTACTGTAA 
ACGAAGGTCG TGCAGGTGAA 
5 CGTGAAGAAA TCGAACGTGG 
GCCACACACT GACTTCGAAT 
GTGGTCGTCA TACTCCATTC 
CGTACAACTG ACGTAACTGG 
GGTAATGCCA GGCGATAACA 
10 TCGCGATGGA CCAAGGTTTA 



CGTGGTACAG TAGTAACTGG 
TGATGAAGTA GAAATCGTAG 
CGGGTGTTGA AATGTTCCGT 
AACATCGGTG CATTATTACG 
TCAAGTATTA GCGAAACCAG 
CAGAAGTTTA TGTATTATCA 
TTCAAAGGTT ACCGTCCACA 
TACTATCGAG TTACCAGAAG 
TCAAGATGAC AGTAAGCTTA 
CGTTTCGCAA 



TCGTGTAGAA 400 

GTATCAAAGA 450 

AAATTACTTG 500 

TGGTACCAAA 550 

GTTCAATCAC 600 

AAAGAAGAAG 650 

ATTCTATTTC 700 

GCGTAGAAAT 750 

ATCCACCCAA 800 
830 



2) INFORMATION FOR SEQ ID NO: 93 



15 



( i } SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Haemophilus parahaemolyticus 

(B) STRAIN: ATCC 10014 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93 

30 TCTTAGTAGT AGCAGCAACA GACGGTCCAA TGCCACAAAC TCGTGAGCAC 50 

ATCTTATTAG GTCGCCAAGT AGGTGTTCCA TACATCATCG TATTCTTAAA 100 

CAAATGCGAT ATGGTTGACG ATGAAGAATT ATTAGAATTA GTTGAAATGG 150 

AAGTGCGTGA ACTTCTTTCA CAATATGACT TCCCAGGTGA TGACACGCCA 200 

GTAGTACGTG GTTCAGCGTT ACAAGCGTTA AACGGCGTAG CAGAGTGGGA 250 

35 AGAAAAAATT CTTGAATTAG CAAACCACTT AGATACATAC ATCCCAGAGC 300 

CAGAGCGTGC GATTGATAAA CCATTCTTAT TACCAATCGA AGACGTATTC 350 

TCAATCTCAG GTCGTGGTAC AGTAGTAACA GGTCGTGTTG AGCGTGGTAT 400 

CATCAAAGCG GGTGAAGAAG TTGAAATCGT AGGTATCAAA GACACTGCGA 450 

AAACAACAGT AACTGGCGTG GAAATGTTCC GTAAATTATT AGACGAAGGT 500 

4 0 CGTGCGGGTG AAAACGTTGG TGCATTATTA CGTGGTACAA AACGTGAAGA 550 

AATCGAACGT GGTCAAGTGT TAGCGAAACC AGGTACAATT ACACCACACA 600 

CAGACTTCGA ATCAGAAGTG TACGTATTAT CAAAAGAAGA AGGTGGTCGT 650 

CACACTCCAT TCTTCAAAGG TTACCGTCCA CAATTCTACT TCCGTACAAC 700 

TGACGTAACT GGTACTATTG AATTACCAGA AGGCGTAGAA ATGGTAATGC 750 

45 CAGGCGATAA CATCAAAATG ACAGTATCAT TAATCCACCC AATCGCGATG 800 

GACGAAGGTT TACGTTTTGC GATT 824 



50 2) INFORMATION FOR SEQ ID NO: 94 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 833 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 

57 
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(A) ORGANISM: Haemophilus parainfluenzae 
(8) STRAIN: ATCC 7901 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94 

5 

TGGTGCTATC TTAGTAGTAG CAGCAACTGA TGGTCCTATG CCACAAACTC 50 

GTGAGCACAT CTTATTAGGT CGCCAAGTAG GTGTTCCTTA CATCATCGTG 100 

TTCTTAAACA AATGCGACAT GGTAGATGAC GAAGAGTTAT TAGAATTAGT 150 

TGAAATGGAA GTTCGTGAAC TTCTTTCTCA ATATGACTTC CCGGGTGACG 2 00 

10 ATACACCAAT CGTACGTGGT TCTGCATTAC AAGCGTTAAA CGGCGTWGCA 250 

GAATGGGAAG AAAAAATCCT TGAATTAGCT AGCCACTTAG ATTCTTACAT 300 

TCCTGAGCCT CAACGTGCTA TCGACCAACC GTTCCTTCTT CCAATCGAAG 3 50 

ACGTGTTCTC TATCTCCGGT CGTGGTACAG TAGTAACAGG TCGTGTTGAG 4 00 

CGTGGTATCA TCCGTACCGG TGATGAAGTT GAAATCGTAG GTATCAAACC 450 

15 GACTGCGAAA ACTACCGTAA CCGGTGTTGA AATGTTCCGT AAATTACTTG 500 

ACGAAGGTCG TGCAGGTGAA AACATCGGTG CATTATTACG TGGTACYAAA 550 

CGTGAAGAAA TCGAACGTGG TCAAGTATTG GCTAAACCGG GTTCAATCAC 600 

TCCACACACT GATTTCGAAT CTGAAGTGTA CGTATTATCC AAAGAAGAAG 650 

GTGGTCGTCA TACTCCATTC TTCAAAGGTT ACCGTCCACA ATTCTATTTC 700 

20 CGTACAACTG ACGTAACCGG AACTATCGAA TTACCGGAAG GCGTGGAAAT 750 

GGTTATGCCT GGTGATAACA TCAAAATGAC TGTATCCTTA ATCCACCCAA 800 

TCGCGATGGA CCAAGGTTTA CGTTTCGCTA TCG 833 



25 



35 



2) INFORMATION FOR SEQ ID NO: 95 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 824 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus paraphrophilus 

(B) STRAIN: ATCC 2 9241 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95 

TGGTGCTATC TTAGTAGTAG CAGCAACTGA TGGTCCTATG CCACAAACTC"" 50 

GTGAGCACAT CTTATTAGGT CGCCAAGTAG GTGTTCCTTA CATCATCGTA 100 

TTCTTAAACA AATGCGACAT GGTAGATGAC GAAGAGTTAT TAGAATTAGT 150 

45 TGAAATGGAA GTTCGTGAAC TTCTTTCTCA ATATGACTTC CCGGGTGACG 200 

ATACGCCAAT CGTACGTGGT TCTGCATTAC AAGCGTTAAA CGGCGTTGCA 250 

GAATGGGAAG AAAAAATCCT TGAATTAGCA AACCACTTGG ATACTTACAT 300 

TCCTGAGCCA CAACGTGCTA TCGACCAACC GTTCCTTCTT CCAATCGAAG 3 50 

ACGTGTTCTC TATCTCCGGT CGTGGTACAG TAGTAACAGG TCGTGTTGAG 4 00 

50 CGTGGTATCA TCCGTACCGG TGATGAAGTT GAAATCGTAG GTATCAAACC 4 50 

GACTGCGAAA ACTACCGTAA CCGGTGTTGA AATGTTCCGT AAATTACTTG 500 

ACGAAGGTCG TGCAGGTGAA AACATCGGTG CATTATTACG TGGTACCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAAGTATTG GCTAAACCGG GTTCAATCAC 600 

TCCACACACT GATTTCGAAT CTGAAGTGTA CGTATTATCC AAAGAAGAAG 650 

55 GTGGTCGTCA TACTCCATTC TTCAAAGGTT ACCGTCCACA ATTCTATTTC 700 

CGTACAACTG ACGTAACCGG TACTATCGAG TTACCGGAAG GTGTGGAAAT 750 

GGTAATGCCT GGCGATAACA TCAAAATGAC CGTATCCTTA ATCCACCCAA 800 

TCGCGATGGA CCAAGGTTTA CGTT 624 



60 
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2) INFORMATION FOR SEQ ID NO: 96 

(i) SEQUENCE CHARACTERISTICS : 
5 (A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 



15 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus segnis 

(B) STRAIN: ATCC 33393 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96 



GCTATCTTAG TAGTAGCAGC AACTGATGGT CCTATGCCAC AAACTCGTGA 50 

GCACATCTTA TTAGGTCGCC AAGTAGGTGT TCCTTACATC ATCGTATTCT 100 

20 TAAACAAATG CGACATGGTA GATGACGAAG AGTTATTAGA ATTAGTTGAA 150 

ATGGAAGTTC GTGAACTTCT TTCTCAATAT GACTTCCCAG GTGATGATAC 2 00 

TCCAATCATT CGTGGTTCTG CATTACAAGC GTTAAACGGC GTAGCAGAAT 2 50 

GGGAAGAAAA AATCCTTGAA TTAGCTCAAG CATTAGATAC TTACATTCCT 300 

GAACCTGAGC GTGCAATCGA CCAACCGTTC CTTCTTCCAA TTGAAGACGT 3 50 

25 GTTCTCAATC TCTGGTCGTG GTACTGTAGT AACAGGTCGT GTAGAGCGTG 4 00 

GTATCATCCG TACCGGTGAT GAAGTTGAAA TCGTTGGTAT C AAA CC AACT 4 50 

GCGAAAACAA CCGTAACCGG TGTTGAAATG TTCCGTAAAT TACTTGACGA 500 

AGGTCGTGCA GGTGAAAACA TCGGTGCATT ATTACGTGGT ACTAAACGTG 550 

AAGAAATCGA ACGTGGTCAA GTATTAGCGA AACCGGGTTC AATCACTCCA 600 

30 CACACTGACT TCGAATCTGA AGTGTACGTA TTATCTAAAG AAGAAGGTGG 650 

TCGTCATACT CCATTCTTCA AAGGTTACCG TCCACAATTC TATTTCCGTA 700 

CAACTGACGT AACCGGTACT ATCGAGTTAC CGGAAGGCGT GGAAATGGTT 750 

ATGCCTGGCG ATAACATCAA AATGACCGTA TCCTTAATCC ACCCAATCGC 800 

GATGGACCAA GGTTTACG 818 

35 



2) INFORMATION FOR SEQ ID NO: 97 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 763 bases 
.(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 

(ii) MOLECULE TYPE: Genomic DNA 



(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hafnia alvei 
50 (B) STRAIN: ATCC 13337 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97 



CGGCGCTATC CTGGTTGTTG 
55 GTGAGCACAT CCTGCTGGGT 
TTCCTGAACA AATGCGACAT 
AGAAATGGAA GTTCGTGAAC 
ATACTCCAAT CATCCGTGGT 
GAGTGGGAAG CTAAGATCGT 
60 YCCACARCCA GAACGTGCTA 



CTGCGACTGA CGGCCCTATG 
CGTCAGGTTG GCGTTCCTTW 
GGTTGATGAT GAAGAGCTGC 
TTCTGTCTCA GTACGACTTC 
TCTGCTCTGA AAGCGCTGGA 
AGAACTGGCT GAAACTCTGG 
TCGAYAAGCC ATTCCTGCTG 
59 



CCTCAGACTC 50 

CATCATCGTR 100 

TGGAACTGGT 150 

CCWGGYGATG 200 

AGGYGMACCT 2 50 

ATTCTTACAT 300 

CCAATCGAAG 3 50 
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ACGTATTCTC 
CGCGGTATCG 
TACCGTTAAA 
ACGAAGGTCG 
5 CGTGAAGACA 
GCCACACACC 
GCGGYCGTCA 
CGTACAACTG 
GGTAATGCCA 

10 



TATCTCTGGC 
TTAAAGTTGG 
TCAACTTGTA 
TGCAGGCGAG 
TCGAACGTGG 
AAGTTCGAAT 
TACTCCGTTC 
ACGTGACCGG 
GGC 



CGTGGTACWG 
TGAAGAAGTT 
CCGGCGTTGA 
AACGTTGGTG 
TCAGGTTCTG 
CAGAAGTTTA 
TTCAAAGGCT 
TACCATCGAA 



TTGTTACCGG 
GARATCGTTG 
AATGTTCCGT 
TTCTGCTGCG 
GCTAAACCAG 
TATTCTGAGC 
ACCGTCCACA 
TTGCCAGAAG 



TCGTGTAGAG 
GTATCAAAGA 
AAACTGCTGG 
TGGTATCAAG 
GYTCYATCAA 
AAAGATGAAG 
GTTCTACTTC 
GCGTGGAAAT 



400 
450 
500 
550 
600 
650 
700 
750 
763 
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20 



25 



2) INFORMATION FOR SEQ ID NO: 98 

( i ) SEQUENCE .CHARACTER I S TICS :' 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY; Linear 

(ii) MOLECULE TYPE : Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kingella kingae 

(B) STRAIN: ATCC 23330 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98 



CGGCGCAATC 
3 0 GCGAACACAT 
TTCATGAACA 
TGAAATGGAA 
ATTGCCCAAT 
GCATACAAAG 
35 TCCTACTCCA 
ATGTATTCTC 
CGCGGTATCA 
CACGCAAAAA 
ACGAAGGTCA 
40 CGTGAAGACG 
TCCGCACACT 



TTGGTATGTT 
CTTGTTGGCT 
AATGCGACAT 
ATCCGTGACT 
CGTTCAAGGT 
AAAAAATCTT 
GAACGTGCTG 
TATCTCTGGT 
TCAAAGTAGG 
ACCACTTGTA 
AGCTGGTGAT 
TTGAACGTGG 
AAATTTGAAG 



CAGCAGCTGA 
CGCCAAGTAG 
GGTCGATGAT 
TGTTGTCTAG 
TCTGCATTGC 
TGAATTGGCT 
TTGATAAACC 
CGTGGTACAG 
CGAAGAGATT 
CTGGCGTGGA 
AACGTTGGTG 
TCAGGTATTG 
CTGAAGTGTA 



CGGTCCTATG 
GTGTACCTTA 
GCTGAGTTGT 
CTACGATTTT 
GYGCATTGGA 
GCTGCTTTGG 
ATTCTTGTTG 
TAGTTACTGG 
GAAATCGTTG 
AATGTTCCGC 
TATTATTGCG 
GCTAAACCAG 
TGTGTTGAGC 



CCACAAACTC 
TATCATCGTA 
TGGAATTGGT 
CCAGGCGACG 
AGGCGACGCT 
ATAGCTACAT 
CCAATCGAAG 
TCGTGTAGAG 
GTTTGAAAGA 
AAATTGTTGG 
TGGTACGAAG 
GTTCTATCAC 
AAAGAAGAAG 



GTGGCCGTCA TACGCCATTC TTCGCTAALT ACOjCCCACA ATTCTACTTC 
CGTACGACTG ACGTAACTGG TGCAGTTACT TTGTCTGAGG GTGTGGAAAT 
GGTTATGCCA GGCGAAAACG TGAAAATCAC TGTTGAGTTG ATTGCACCTA 
4 5 TCGCTAGGAA AACGGTTTGC GTTTTGCG 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 



/OU 
750 
800 
828 



2) INFORMATION FOR SEQ ID NO: 99 



50 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Klebsiella ornithinolytica 

60 
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(B) 



STRAIN: ATCC 31898 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99 



GGCGCGATCC 
TGAGCACATC 
TCCTGAACAA 
GAAATGGAAG 
CACGCCGATC 
ACTGGGAAGC 
CCGGAACCAG 
CGTATTCTCC 
GCGGTATCAT 
ACTGCGAAGT 
CGAAGGCCGT 
GTGAAGAAAT 
CCGCACACCA 
CGGCCGTCAT 
GTACAACTGA 
GTCATGCCGG 
CGCGATGGAC 



TGGTTGTTGC 
CTGCTGGGTC 
ATGCGACATG 
TCCGTGAGCT 
GTTCGTGGTT 
GAAAATCATC 
AGCGTGCGAT 
ATCTCCGGTC 
CAAAGTGGGC 
CTACCTGTAC 
GCTGGTGAGA 
CGAACGTGGT 
AGTTCGAATC 
ACTCCGTTCT 
CGTGACTGGC 
GCGACAACAT 
GATGGTCTGC 



TGCGACTGAC 
GTCAGGTAGG 
GTTGATGACG 
GCTGTCTCAG 
CCGCTCTGAA 
GAACTGGCTG 
TGACAAGCCG 
GTGGTACCGT 
GAAGAAGTTG 
TGGCGTTGAA 
ACGTTGGTGT 
CAGGTACTGG 
TGAAGTGTAC 
TCAAAGGCTA 
ACCATCGAAC 
CAAAATGGTT 
GTTTCGCA 



GGCCCGATGC CGCAGACTCG 
CGTTCCGTAC ATCATCGTGT 
AAGAGCTGCT GGAGCTGGTT 
TACGACTTCC CGGGCGACGA 
AGCGCTGGAA GGCGAAGCAG 
GCTACCTGGA TTCTTACATC 
TTCCTGCTGC CGATCGAAGA 
TGTTACCGGT CGTGTAGAGC 
AAATCGTTGG TATCAAAGAG 
ATGTTCCGCA AACTGCTGGA 
TCTGCTGCGT GGTATCAAAC 
CTAAGCCGGG CTCTATCAAG 
ATTCTGTCCA AAGACGAAGG 
CCGTCCGCAG tTCTACTTCC 
TGCCGGAAGG CGTAGAGATG 
GTTACCCTGA TTCACCCGAT 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
828 



2) INFORMATION FOR SEQ ID NO: 100 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 74 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella oxytoca 

(B) STRAIN: ATCC 33496 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100 

GATGCCGCAG ACTCGTGAGC ACATCCTGCT GGGTCGTCAG GTAGGCGTTC ' 50 

CGTACATCAT CGTGTTCCTG j^CAAGTGCG ACaYgGTTGaT ~TGAC~g1^GAG~ luV " 

CTGCTGGAAC TGGTTGAAAT GGAAGTTCGT GAACTTCTGT CTCAGTACGA 150 

TTTCCCGGGC GACGACACTC CGATCGTTCG TGGTTCTGCT CTGAAAGCGC 200 

TGGAAGGCGA CGCWGAGTGG GAAKCKAAAA TCATCGAACT GGCTGGCTTC 250 

CTGGATTCTT ACATTCCGGA ACCAGAGCGT GCGATTGACA AGCCGTTCCT 300 

GCTGCCGATC GAAGACGTAT TCTCCATCTC CGGTCGTGGT ACCGTTGTTA 350 

CCGGTCGTGT AGAGCGCGGT ATCATCAAAG TTGGCGAAGA AGTTGAAATY 4 00 

GTTGGTATYA AAGACACTGC TAAGTCTACC TGTACTGGCG TTGAAATGTT 450 

CCGCAAACTG CTGGACGAAG GCCGYGCTGG TGAGAACGTT GGTGTTCTGC 500 

TGCGTGGTAT CAAACGTGAA GAAATCGAAC GTGGTCAGGT ACTGGCTAAG 550 

CCGGGCTCTA TCAAGCCGCA CACCAAGTTC GAATCTGAAG TTTATATCCT 600 

GTCCAAAGAC GAAGGCGGCC GTCACACTCC GTTCTTCAAA GGCTACCGTC 650 

CGCAGTTCTA CTTCCGTACA ACTGACGTGA CTGGCACCAT CGAACTGCCG 700 

GAAGGCGTAG AGATGGTTAT GCCGGGCGAC AACATCAAAA TGGTTGTTA 749 



2) INFORMATION FOR SEQ ID NO: 101 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 830 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Klebsiella planticola 

(B) STRAIN: ATCC 33531 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101 

15 TGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACTC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAGC TGCTGTCTCA GTACGACTTC CCGGGCGACG 200 

ACACTCCGAT CGTTCGTGGT TCCGCTCTGA AAGCGCTGGA AGGCGAAGCA 250 

20 GACTGGGAAG CGAAAATCAT CGAACTGGCT GGCTACCTGG ATTCTTACAT 300 

CCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCTATCGAAG 350 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAG 400 

CGCGGTATCA TCAAAGTGGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

TACTGCTAAR TCWACCTGTA CYGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

25 ACGAAGGCCG TGCTGGTGAG AACGTTGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCTCTATCAA 600 

GCCGCACACC AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGACGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 700 

CGTACAACTG ACGTGACTGG CACCATCGAA CTGCCGGAAG GCGTAGAGAT 750 

30 GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATTCACCCRA 800 

TCGCGATGGA CGACGGTCTG CGTTTCGCAA 830 



35 2) INFORMATION FOR SEQ ID NO: 102 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

4 0 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. ozaenae 

(B) STRAIN: ATCC 11296 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102 



CCTGGTTGTT GCTGCGACTG ACGGCCCGAT GCCGCAGACT CGTGAGCACA 50 

TCCTGCTGGG TCGTCAGGTA GGCGTTCCGT ACATCATCGT GTTCCTGAAC 100 

AAATGCGACA TGGTTGATGA CGAAGAGCTG CTGGAACTGG TTGAGATGGA 150 

AGTTCGTGAA CTGCTGTCTC AGTACGATTT CCCGGGCGAC GACACCCCGA 200 

55 TCGTTCGTGG TTCTGCTCTG AAAGCGCTGG AAGGCGACGC AGAGTGGGAA 250 

GCGAAAATCA TCGAACTGGC TGGCCACCTG GATACCTATA TCCCGGAACC 300 

AGAGCGTGCG ATTGACAAGC CGTTCCTGCT GCCGATCGAA GACGTATTCT 350 

CCATCTCCGG TCGTGGTACC GTTGTTACCG GTCGTGTAGA GCGCGGTATC 4 00 

ATCAAAGTAG GTGAAGAAGT TGAAATCGTT GGTATCAAAG AAACCGCGAA 450 

60 AACCACCTGT ACTGGCGTTG AAATGTTCCG CAAACTGCTG GACGAAGGCC 500 

62 
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GTGCTGGTGA GAACGTAGGT GTTCTGCTGC GTGGTATCAA ACGTGAAGAA 550 

ATCGAACGTG GTCAGGTACT GGCTAAGCCG GGCACCATCA ACCCGCACAC 600 

CAAGTTCGAA TCTGAAGTGT ACATCCTGTC CAAAGACGAA GGCGGCCGTC 650 

ATACTCCGTT CTTCAAAGGC TACCGTCCGC AGTTCTACTT CCGTACTACT 700 

GACGTGACTG GCACCATCGA ACTGCCGGAA GGCGTAGAGA TGGTAATGCC 750 

GGGCGACAAC ATCAAAATGG TTGTTACCCT GATCCACCCG ATCGCGATGG 800 

ACGACG 806 



10 



20 



2) INFORMATION FOR SEQ ID NO: 103 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 74 3 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Klebsiella pneumoniae subsp. pneumoniae 

(B) STRAIN: ATCC 13883 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103 

GCAGACTCGT GAGCACATCC TGCTGGGTCG TCAGGTAGGC GTTCCGTACA 50 

TCATCGTGTT CCTGAACAAA TGCGACATGG TTGATGACGA AGAGCTGCTG 100 

GAACTGGTTG AGATGGAAGT TCGTGAACTG CTGTCTCAGT ACGATTTCCC 150 

30 GGGCGACGAC ACTCCGATCG TTCGTGGTTC TGCTCTGAAA GCGCTGGAAG 2 00 

GCGACGCAGA GTGGGAAGCG AAAATCATCG AACTGGCTGG CCACCTGGAT 250 

ACCTATATCC CGGAACCAGA GCGTGCGATT GACAAGCCGT TCCTGCTGCC 3 00 

GATCGAAGAC GTATTCTCCA TCTCCGGTCG TGGTACCGTT GTTACCGGTC 350 

GTGTAGAGCG CGGTATCATC AAAGTAGGTG AAGAAGTTGA AATCGTTGGT 4 00 

35 ATCAAAGAAA CCGCGAAAAC CACCTGTACT GGCGTTGAAA TGTTCCGCAA 450 

ACTGCTGGAC GAAGGCCGTG CTGGTGAGAA CGTAGGTGTT CTGCTGCGTG 500 

GTATCAAACG TGAAGAAATC GAACGTGGTC AGGTACTGGC TAAGCCGGGC 550 

ACCATCAACC CGCACACCAA GTTCGAATCT GAAGTGTACA TCCTGTCCAA 600 

AGACGAAGGC GGCCGTCACA CTCCGTTCTT CAAAGGCTAC CGTCCGCAGT 650 

40 TCTACTTCCG TACTACTGAC GTGACTGGCA CCATCGAACT GCCGGAAGGC 700 

GTAGAGATGG TAATGCCGGG CGACAACATC AAAATGGTTG TTA 743 



45 2) INFORMATION FOR SEQ ID NO: 104 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. 
rhinoscl eroma tis 

(B) STRAIN: ATCC 13884 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104 

63 
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TGGTTGTTGC TGCGACTGAC GGCCCGATGC 
CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC 
ATGCGACATG GTTGATGACG AAGAGCTGCT 
TTCGTGAACT RCTGTCTCAG TACGATTTCC 
GTTCGTGGTT CTGCTCTGAA AGCGCTGGAA 
GAAAATCATC GAACTGGCTG GCCACCTGGA 
AGCGTGCGAT TGACAAGCCG TTCCTGCTGC 
ATCTCCGGTC GTGGTACCGT TGTTACCGGT 
CAAAGTAGGT GAAGAAGTTG AAATCGTTGG 
CCACCTGTAC TGGCGTTGAA ATGTTCCGCA 
GCTGGTGAGA ACGTAGGTGT TCTGCTGCGT 
CGAACGTGGT CAGGTACTGG CTAAGCCGGG 
AGTTCGAATC TGAAGTGTAC ATCCTGTCCA 
ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG 
CGTGACTGGC ACCATCGAAC TGCCGGAAGG 
GCGACAACAT CAAAATGGTT GTTACCCTGA 
GACGGTCTGC GTTTCGCAA 



CGCAGACTCG TGAGCACATC 50 

ATCATCGTGT TCCTGAACAA 100 

GGAACTGGTT GAGATGGAAG 150 

CGGGCGACGA CACCCCGATC 200 

GGCGACGCAG AGTGGGAAGC 250 

TACCTATATC CCGGAACCAG 300 

CGATCGAAGA CGTATTCTCC 350 

CGTGTAGAGC GCGGTATCAT 4 00 

TATCAAAGAA ACCGCGAAAA 450 

AACTGCTGGA CGAAGGCCGT 500 

GGTATCAAAC GTGAAGAAAT 550 

CACCATCAAC CCGCACACCA 600 

AAGACGAAGG CGGCCGTCAC 650 

TTCTACTTCC GTACTACTGA 700 

CGTAGAGATG GTAATGCCGG 750 

TCCATCCGAT CGCGATGGAC 800 

819 



2) INFORMATION FOR SEQ ID NO: 105 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kluyvera ascorbata 

(B) STRAIN: ATCC 33433 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105 

CGGCGCGATC CTGGTTGTTG CTGCGACTGA TGGCCCTATG CCACAGACTC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCTTA CATCATCGTG 100 

TTCCTGAACA AATGYGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGATTTC CCAGGCGACG 200 

ATACTCCAAT CATCCGTC^^^ ^ blf ~ 

GAGTGGGAAG CGAAAATCAT CGAACTGGCT GGCTTCCTGG ATTCTTACAT 300 

CCCAGAACCA GAACGTGCTA TCGATAAGCC GTTCCTGCTG CCAATCGAAG 350 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAG 400 

CGCGGTATCA TCAAAGTTGG YGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

CACCGCTAAG TCTACCTGTA CCGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCTGGTGAG AACGTTGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTTCTG GCTAAGCCAG GCTCTATCAA 600 

GCCGCACACC AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGACGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCACA GTTCTACTTC 700 

CGTACTACTG ACGTGACCGG TACCATCGAA CTGCCAGAAG GCGTTGAGAT 750 

GGTAATGCCA GGCGACAACA TCAAGATGGT TGTGACTCTG ATCCACCCAA 800 

TCGCGATGGA CGACGGCCTG CGTTTCGCAA CC 832 



2) INFORMATION FOR SEQ ID NO: 106 
( i ) SEQUENCE CHARACTERISTICS : 

64 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



(A) LENGTH: 830 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kluyvera cryocrescens 
10 (B) STRAIN: ATCC 33435 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106 

TGGCGCGATC CTGGTTGTTG CTGCAACTGA TGGCCCTATG CCACAGACTC 50 

15 GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCTTA CATCATCGTG 100 

TTCCTGAACA AATGTGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGATTTC CCAGGCGACG 2 00 

ACACTCCTAT CGTTCGTGGT TCCGCGCTGA AAGCGCTGGA AGGCGACGCT 2 50 

GAGTGGGAAG CAAAAATCAT CGAACTGGCT GGCTTCCTGG ATTCTTACAT 3 00 

20 CCCAGAACCA GAGCGTGCGA TTGATAAGCC GTTCCTGCTG CCAATCGAAG 3 50 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAG 4 00 

CGCGGTATCA TCAAAGTTGG TGAAGAAGTT GAAATCGTGG GTATCAAAGA 4 50 

CACTGCTAAG TCTACCTGTA CCGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCTGGTGAG AACGTTGGTG TTCTGCTGCG TGGTATCAAA 550 

25 CGTGAAGAAA TCGAACGTGG TCAGGTTCTG GCTAAGCCAG GCTCCATCAA 600 

GCCGCACACC AAATTCGAAT CTGAAGTTTA CATCCTGTCC AAAGACGAAG 6 50 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCACA GTTCTACTTC 700 

CGTACTACTG ACGTGACTGG TACCATCGAA CTGCCAGAAG GCGTAGAGAT 750 

GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCAA 800 

30 TCGCGATGGA CGACGGTCTG CGTTTCGCAA 830 



2) INFORMATION FOR SEQ ID NO: 107 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
40 (D) TOPOLOGY: Linear 

( i i ) MOLECULE TYPE ~ Genomic DNA - " 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Kluyvera georgiana 

(B) STRAIN: ATCC 51603 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107 

50 CGCGATCCTG GTTGTTGCTG CGACTGACGG CCCGATGCCG CAGACTCGTG 50 

AGCACATCCT GCTGGGTCGT CAGGTTGGCG TTCCGTACAT CATCGTGTTC 100 

CTGAACAAAT GCGACATGGT TGATGACGAA GAGCTGCTGG AACTGGTTGA 150 

AATGGAAGTT CGTGAACTTC TGTCTCAGTA CGACTTCCCG GGCGACGACA 200 

CGCCGATCGT TCGTGGTTCT GCTCTGAAAG CGCTGGAAGG CGACGCTGAG 2 50 

55 TGGGAAGCGA AAATCATCGA ACTGGCGGGC TTCCTGGATT CTTACATCCC 300 

GGAACCAGAG CGTGCGATTG ACAAGCCGTT CCTGCTGCCG ATCGAAGACG 3 50 

TATTCTCCAT CTCCGGTCGT GGTACCGTTG TTACCGGTCG TGTAGAACGC 4 00 

GGTATCATCA AAGTTGGCGA AGAAGTTGAA ATCGTTGGTA TCAAAGACAC 4 50 

CGCTAAGTCT ACCTGTACTG GCGTTGAAAT GTTCCGCAAA CTGCTGGACG 500 

60 AAGGCCGTGC TGGTGAGAAC GTTGGTGTTC TGCTGCGTGG TATCAAACGT 550 

65 
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GAAGAAATCG AACGTGGTCA GGTACTGGCT 
GCACACCAAG TTCGAATCTG AAGTGTACAT 
GCCGTCATAC TCCGTTCTTC AAAGGCTACC 
ACTACTGACG TGACTGGCAC CATCGAACTG 
AATGCCGGGC GACAACATCA AAATGGTTGT 
CGAAGGACGA AGGTCTGCGT TTCGCA 



AAGCCGGGTT CTATCAAGCC 600 
TCTGTCCAAA GACGAAGGCG 650 
GTCCGCAGTT CTACTTCCGT 700 
CCGGAAGGCG TTGAGATGGT 750 
TACCCTGATC CACCCGATCG 800 

826 



2) INFORMATION FOR SEQ ID NO: 108 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 803 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactobacillus casei subsp. casei 

(B) STRAIN: ATCC 3 93 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 108 

GCTGCTGATG GCCCAATGCC ACAAACTCGT GAACATATCT TACTTTCACG 50 

TCAAGTTGGT GTTCCATACA TCGTTGTATT CATGAACAAA TGTGACATGG 100 

TTGACGATGA AGAATTACTA GAATTAGTTG AAATGGAAAT TCGTGATCTA 150 

TTAACTGAAT ATGAATTCCC TGGCGATGAC ATTCCTGTAA TCAAAGGTTC 2 00 

AGCTCTTAAA GCACTTCAAG GTGAAGCTGA CTGGGAAGCT AAAATTGACG 250 

AGTTAATGGA AGCTGTAGAT TCTTACATTC CAACTCCAGA ACGTGATACT 300 

GACAAACCAT TCATGATGCC AGTTGAGGAT GTATTCTCAA TCACTGGTCG 350 

TGGAACAGTT GCAACTGGAC GTGTTGAACG TGGACAAGTT AAAGTTGGTG 400 

ACGAAGTAGA AGTTATCGGT ATTGAAGAAG AGAGCAAAAA AGTAGTAGTA 450 

ACTGGAGTAG AAATGTTCCG TAAATYACTA GATTACGCTG AAGCTGGCGA 50 0 

CAACATTGGC GCACTTCTAC GTGGTGTTGC TCGTGAAGAT ATCCAACGTG 550 

GTCAAGTATT AGCTAAACCA GGTTCGATTA CTCCACACAC TAACTTCAAA 600 

GCTGAAACTT ATGTTTTAAC TAAAGAAGAA GGTGGACGTC ACACTCCATT 650 

CTTCAACAAC TACCGCCCAC AATTCTATTT CCGTACTACT GACGTAACTG 700 

GTATTGTTAC ACTTCCAGAA GGTACTGAAA TGGTAATGCC TGGTGATAAC 750 

ATTGAGCTTG CAGTTGANCT AATTGCACCA ATCGCTATCG AAGACGGTAC 800 

TAA" »uV~ 



2) INFORMATION FOR SEQ ID NO: 109 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis subsp. lactis 

(B) STRAIN: ATCC 19435 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109 

66 
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CGGTGCAATC CTCGTTGTTG CTGCAACTGA TGGACCAATG CCACAAACTC 50 
GTGAACACAT CTTGCTTTCA CGTCAAGTTG GTGTTAAATA CCTTATCGTC . 100 

TTCCTTAACA AGGCTGACCT TGTTGATGAT GAAGAATTGA TGGAACTCGT 150 

5 TGAAATGGAA GTTCGTGACC TCTTGAGCGA ATACGACTTC CCAGGTGACG 200 

ATATTCCTGT AATCGCTGGT TCAGCACTTG GTGCTTTGAA CGGTGAACCA 250 

CAATGGGTTG CTAAAGTTGA AGAATTGATG GACATCGTTG ATGAATACAT 300 

CCCAACTCCA GAACGCGACA CTGACAAACC ACTCCTTCTT CCAGTCGAAG 3 50 

ACGTATTCTC TATCACTGGT CGTGGTACAG TTGCTTCAGG ACGTATCGAA 4 00 

10 CGTGGTACTG TTAAAGTTGG TGACGAAGTT GAAATCGTTG GTATCAAAGA 4 50 

AGAAACTAAA AAAGCTGTTG TTACTGGTAT CGAAATGTTC CGTAAAACAC 500 

TTACTGAAGG TCTTGCTGGT GATAACGTCG GTGCACTTCT CCGTGGTATC 550 

CAACGTGACG AAATCGAACG TGGTCAAGTT ATTGCTAAAC CAGGTTCAAT 600 

CACTCCACAC AAACTTTTCG AAGGTGAAGT TTACGTATTG AGCAAAGAAG 650 

15 AAGGCGGACG TCACACTCCA TTCTTCGACA ACTACCGTCC TCAATTCTAC 700 

TTCCACACAA CTGACGTTAC TGGTTCAGTT AAACTTCCAG AAGGAACTGA 750 

AATGGTAATG CCTGGTGACA ACGTGCATAT CGACGTTGAA TTGATCCACC 800 

CAGTTGCGAT CGAACAAGGT ACTAC 825 

20 



2) INFORMATION FOR SEQ ID NO: 110 

(i) SEQUENCE CHARACTERISTICS : 
25 (A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leclercia adecarboxylata 

(B) STRAIN: ATCC 23216 



35 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110 



GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCAATGC CTCAGACCCG 50 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCTTTC ATCATCGTGT 100 

40 TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT 150 

GAGATGGAAG TTCGTGAACT YCTGTCCCAG TACGACTTCC CGGGCGACGA 200 

CACCCCAATC" GTTCGTGGtT^TG^gUiG^" AGCGCl'GGAA GGCG^GCAIJ " 4 blf ' 

AGTGGGAAGA GAAAATCATC GARCTGGCTG GCTACCTGGA TTCCTACATC 300 

CCAGAGCCAG AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CTATCGAAGA 350 

4 5 CGTATTCTCC ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAGC 400 

GCGGTATCAT CAARGTTGGC GAAGAAGTTG AAATCGTTGG TATCAAGGAC 4 50 

ACTGCTAAGT CTACCTGTAC CGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGCCGT GCCGGTGAGA ACGTTGGTGT TCTGCTGCGT GGTATCAAAC 550 

GTGAAGAAAT CGAACGTGGT CAGGTTCTGG CTAAGCCAGG CTCYATCAAG 600 

50 CCGCACACCA AGTTCGAATC TGAAGTGTAC ATCCTGTCYA AAGACGAAGG 650 

CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 700 

GTACKACTGA CGTGACCGGT ACCATCGARC TGCCAGAAGG CGTTGAGATG 750 

GTAATGCCAG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCAAT 800 

CGCAATGGAC GATGGTCTGC GTTC 824 

55 



2) INFORMATION FOR SEQ ID NO: 111 
60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 838 bases 

(B) TYPE: Nucleic* acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Legionella micdadei 
10 (B) STRAIN: ATCC 33218 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111 

CGGAGCGATA TTAGTAGTAT CAGCAGCGGA TGGCCCAATG CCTCAAACGA 50 

15 GAGAGCACAT ACTYTTATCC CGSCAGGTAG GTGTTCCCTA TATAGTAGTG 100 

TTCTTAAACA AAGCTGACAT GGTGGATGAT GCGGAGTTAT TAGAATTAGT 150 

TGAAATGGAA GTACGCGAYT TGTTGAGCAG CTATGAATTT CCAGGAGATG 2 00 

AGATCCCGAT TGTAGTTGGT TCAGCATTAA AAGCATTGGA AGGCGATACG 2 50 

AGTGATATAG GTGTACCAGC GATTGAGAAG TTAGTTGAGA CGATGGATTC 3 00 

20 TTATATACCT GAGCCGGTAA GAAACATCGA TAAAAGTTTC TTGTTACCGA 3 50 

TCGAAGACGT GTTCTCAATA TCTGGACGAG GAACAGTAGT AACAGGACGT 4 00 

ATCGAAAGCG GGATCATCAA AGTTGGTGAG GAAGTCGAGA TTGTTGGTAT 4 50 

ACGTGACACT CAAAAGACGA CATGCACAGG CGTTGAAATG TTCCGTAAAT 500 

TACTTGACGA AGGTCGAGCT GGAGACAACG TTGGTATATT GCTACGTGGT 550 

25 ACGAAGCGGG ATGAAGTTGA ACGCGGACAA GTATTAGCTA AGCCGGGAAG 600 

CATTAAACCG CATACTAAAT TTGAAGCTGA AGTGTATGTG TTGTCAAAAG 650 

ATGAAGGTGG ACGTCATACC CCATTCTTTA ACGGATATCG GCCTCAATTT 700 

TACTTCAGGA CCACAGACGT AACTGGTTCT TGTGATTTAC CTGARGGTAT 750 

AGAAATGGTA ATGCCAGGTG ATAACGTCAA GCTGATTGTT AGCTTACACT 8 00 

30 CACCGATTGC TATGGACGAA GGTTTGCGTT TTGCAATC 838 



2) INFORMATION FOR SEQ ID NO: 112 



35 



( i ) SEQUENCE CHARACTER I STI CS : 

(A) LENGTH: 838 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Legionella pneumophila subsp. pneumophila 

(B) STRAIN: ATCC 33152 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112 

50 CGGAGCGATA CTGGTTGTAT CAGCAGCTGA TGGTCCTATG CCACAAACGA 50 

GGGAACACAT TCTATTGTCT CGCCAGGTAG GTGTTCCATA TATTGTTGTG 100 
TTCATGAACA AAGCGGATAT GGTTGATGAC CCTGAGTTAT TAGAGTTAGT " 150 

GGAAATGGAA GTGCGAGATT TATTAAGCAG TTACGATTTC CCAGGGGATG 200 

ACATACCTAT TGTTGTTGGT TCAGCTTTGA AAGCATTGGA AGGTGAAGAC 250 

55 AGTGATATAG GCGTTAAGGC TATTGAGAAA TTGGTTGAAA CAATGGATTC 3 00 

ATACATTCCT GAGCCAGTTA GAAACATAGA CAAGCCATTT TTGTTGCCGA 3 50 

TTGAAGACGT ATTTTCAATT TCTGGACGCG GAACAGTGGT AACTGGTCGT 400 

GTAGAGAGTG GAATTGTTAA AGTTGGTGAG GAAGTTGAAA TTGTTGGAAT 450 

AAGAGACACC CAAAAGACGA CTTGTACGGG TGTTGAGATG TTCCGTAAAT 500 

60 TACTTGATGA AGGTCGAGCT GGTGATAACG TTGGTGTGTT ATTACGAGGT 550 

68 
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ACGAAGCGAG ATGAAGTGGA GCGTGGACAG 
CATCAAGCCA CACACCAAGT TTGAAGCAGA 
AAGAAGGCGG ACGTCACACT CCATTCTTTA 
TATTTCAGAA CCACTGACGT GACAGGTACT 
5 TGAAATGGTA ATGCCTGGAG ATAATGTGCA 
CTCCGATTGC GATGGATGAA GGTTTAAGAT 



GTATTGGCGA AGCCAGGAAC 600 

AGTGTATGTA TTATCCAAGG 650 

ATGGATACCG TCCACAATTC 700 

TGTGACTTGC CATCAGGAGT 750 

ATTAGTTGTT AGCTTGCATG 800 

TCGCAATT 838 



10 2) INFORMATION FOR SEQ ID NO: 113 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leminorella grimontii 

(B) STRAIN: ATCC 33999 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113 



GTGCAATCCT GGTAGTAGCA GCGACTGACG GCCCGATGCC TCAGACTCGC 50 

GAGCACATCC TGCTGGGTCG TCAGGTAGGC GTTCCGTACA TCATCGTATT 100 

CCTGAACAAG TGCGATATGG TTGATGACGA AGAGCTGCTG GAGCTGGTTG 150 

ARATGGAAGT TCGCGAACTG CTGTCTCAGT ACGACTTCCC GGGCGACGAC 200 

30 ACTCCGGTAG TCCGCGGTTC AGCGCTGAAA GCGCTGGAAG GCGAAGCCGA 250 

GTGGGAARCG AAAATCATCG AGCTGGCAGG CCMTCTGGAT ACTTATATCC 300 

CAGAACCTGA GCGTGCGATT GACAAGCCGT TCCTGCTGCC KATCGAAGAC 350 

GTATTCTCTA TCTCCGGCCG TGGTACCGTT GTTACCGGTC GTGTAGAGCG 400 

CGGCATCATC AAAGTCGGTG AAGAAGTGGA AATCGTCGGT ATCAAAGATA 450 

35 CCACCAAGAC CACCTGTACC GGCGTTGAAA TGTTCCGTAA GCTGCTGGAC 500 

GAAGGCCGTG CGGGCGAGAA CGTGGGCGTT CTGCTGCGCG GTACCAAGCG 550 

TGACGAAATC GAACGTGGTC AAGTTCTGGC CAAGCCGGGC ACCATCACTC 600 

CTCACACCCA GTTCGTGTCA GAAGTGTATA TCCTGAGCAA GGATGAAGGC 650 

GGCCGTCATA CTCCGTTCTT CAAAGGCTAC CGTCCTCAGT TCTACTTCCG 700 

4 0 TACGACTGAC GTGACAGGCA CCATCGAACT GCCGGAAGGC GTAGAGATGG 750 

TAATGCCAGG CGACAACATT CAGATGACCG TAAGTCTGAT TGCGCCGATC 800 

GCAATGGACG ^GGtcYgCG" CTTCGCAA " a 2 b 



45 



55 



2) INFORMATION FOR SEQ ID NO: 114 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 82 6 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leminorella richardii 

(B) STRAIN: ATCC 33998 

60 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114 

69 
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GCTATCCTGG TTGTTGCTGC GACTGACGGC CCAATGCCTC AGACTCGTGA 50 

GCACATCCTG CTGGGTCGCC AGGTAGGCGT TCCTTACATC ATCGTGTTCC 100 

TGAACAAGTG CGACATGGTT GATGACGAAG AGCTGCTGGA ACTGGTAGAA 150 

5 ATGGAAGTTC GTGAACTTCT GTCTCAATAC GACTTCCCGG GCGACGATAC 200 

GCCGGTTGTT CGCGGTTCAG CGCTGAAAGC GCTGGAAGGT GACGCYGAGT 250 

GGGAARCGAA AATCATTGAA CTGGCGGAAT CCTTRGATAC TTAYATTCCA 300 

GAGCCAGAGC GTGCGATTGA CAAGCCGTTC CTGCTGCCTA TCGAAGACGT 350 

TTTCTCTATC TCTGGCCGTG GTACTGTAGT CACCGGTCGT GTAGAGCGCG 400 

10 GCATCATCAA AGTTGGTGAA GAAGTGGAAA TCGTGGGAAT CAAAGACACC 450 

ACCAAGACCA CCTGTACTGG CGTTGAAATG TTCCGTAAGC TGCTGGACGA 500 

AGGCCGTGCA GGTGAGAACG TTGGTGTTCT GCTGCGYGGT ACTAAGCGTG 550 

ACGAAATCGA ACGTGGTCAG GTACTGGCTA AGCCAGGCAC CATCACTCCT 600 

CACACAGAAT TCGTGTCAGA AGTGTATATC CTGAGCAAGG ATGAAGGCGG 650 

15 YCGTCATACT CCGTTCTTCA AAGGCTACCG TCCTCAGTTC TACTTCCGTA 700 

CGACTGACGT GACCGGCACC ATCGAACTGC CAGAAGGCGT AGAGATGGTA 750 

ATGCCAGGCG ATAACATCCA GATGGTAGTT ACGCTGATTG CCCCAATCGC 800 

GATGGACGAA GGTCTGCGCT TCGCAA 826 



20 



2) INFORMATION FOR SEQ ID NO: 115 



( i ) SEQUENCE CHARACTERI STICS : 
25 (A) LENGTH: 84 3 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

3 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM; Leptospira interrogans 

(B) STRAIN: ATCC 23581 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115 



TGCGGCGATT CTTGTAGTAT CCGCAACTGA CGGACCTATG CCACAAACAA 50 

AAGAACATAT CCTTCTTGCT CGTCAGGTAG GTGTTCCATA TGTAATTGTA 100 

40 TTCATTAACA AAGCAGATAT GCTTGCTGCT GACGAAAGAG CAGAAATGAT 150 

CGAAATGGTT GAGATGGACG TTCGTGAACT TCTCAATAAG TATAGCTTCC 200 

CAGGAGATAC J^CTC^TATC" GTI^TGGtT Cl'b^Gcf l : AAA AGCAL^U^G " 2 Mj" ' 

GGCGATGAAT CTGAAATTGG GATGCCTGCA ATTCTCAAAT TGATGGAAGC 300 

TCTGGATACT TTCGTTCCAA ATCCAAAACG TGTAATCGAC AAACCTTTCC 350 

45 TTATGCCAGT AGAAGACGTT TTCTCGATCA CTGGTCGTGG AACTGTTGCA 400 

ACTGGAAGAG TGGAACAAGG TGTTTTGAAA GTGAACGACG AAGTTGAAAT 450 

TATCGGTATC CGCCCAACAA CAAAAACTGT TGTTACCGGT ATCGAAATGT 500 

TCAGAAAACT TCTCGATCAA GCGGAAGCTG GCGACAACAT CGGCGCTCTT 550 

CTTCGTGGAA CTAAAAAAGA AGAAATCGAA AGAGGGCAAG TTCTTGCGAA 600 

50 GCCAGGTTCT ATCACTCCTC ACAAAAAGTT TGCCGCTGAG GTGTATGTAT 650 

TAACTAAGGA TGAAGGCGGA CGTCATACTC CGTTTATCAA TAACTACCGT 700 

CCTCAGTTTT ACTTTAGAAC AACTGACGTA ACCGGAGTTT GTAACCTTCC 750 

TAATGGTGTC GAAATGGTTA TGCCTGGTGA TAACGTTTCT TTGACGGTTG 800 

AATTGATTAG CCCGATCGCA ATGGACAAGG GTCTTAAGTT CGC 843 

55 



2) INFORMATION FOR SEQ ID NO: 116 
60 (i) SEQUENCE CHARACTERISTICS: 

70 
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10 



15 



20 



25 



30 
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(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Megamonas hypermegale 

(B) STRAIN: ATCC 25560 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116 

CGGTGCTATC CTCGTTGTTA GTGCTGCTGA TGGTCCTATG CCTCAGACTC 50 

GTGAACACAT CCTTCTCGCT CGTCAGGTTG GTGTTCCAGC TATCGTTGTA 100 

TTCCTCAACA AAGCTGACCA GGTTGATGAC CCTGAACTTC TCGAACTTGT 150 

TGAAATGGAA GTTCGTGAAC TTCTTTCCAG CTATGACTTC CCAGGCGATG 200 

ACGTTCCAGT AATCACTGGT TCCGCTCTTC AGGCTCTCGA AGGCGACGAA 250 

GAAGCTAAAA AGAAAATTCT TGAATTAATG GATGCTGTTG ATGATTACAT 3 00 

CCCAACTCCA ACACGTGACA CTGATAAACC TTTCTTAATG CCAGTTGAAG 3 50 

ACGTATTCAC AATTACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 4 00 

CGTGGCGAAC TTAAACTTGG TGACAGCGTT GAAATCGTTG GTCTTTCCGA 450 

TGAAAAGAAA TCCACTACTG TAACTGGTAT CGAAATGTTC CGCAAAATGC 500 
TTGATAGCGC TGTTGCTGGT GATAACATCG GTGCACTTCT TCGTGGTATT ' 550 

GACCGTAAAG AAATCGAACG TGGTCAAGTT CTTGCTAAAC CTGGCACAAT 600 

TCATCCACAC AAAAAATTCA AAGCTCAGGT TTACGTATTA ACTAAAGAAG 6 50 

AAGGTGGACG TCATACTCCA TTCTTCTCCA ACTATCGTCC ACAGTTCTAT 700 

TTCCGTACTA CTGACGTTAC TGGTGTTGTA ACTCTTCCAG AAGGTACTGA 750 

AATGGTTATG CCTGGCGATA ACATTGAAAT GAGCATCGAA CTCATCACTC 800 

CAATCGCTAT TGAAAAAGGT CTTCGCTTCG CT 832 



2) INFORMATION FOR SEQ ID NO: 117 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

( ii" J MOLECULE TYPE f Genomic DNA~" 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mitsuokella multacida 

(B) STRAIN: ATCC 27723 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117 



TGGTGCTATC 
GTGAGCACAT 
TTCCTCAACA 
CGAGATGGAA 
ACATCCCTGT 
GAGCAGAAGA 
CCCGACGCCG 
ATGTCTTCAC 
CGTGGTGAGC 
CGAGCCGCGT 
TTGATTTCGC 



CTCGTCGTTT 
CCTGCTCGCT 
AGGTTGACCA 
GTTCGCGAGC 
AATCGCTGGT 
AGAACATCCT 
GTCCGCGACA 
GATCACGGGC 
TCAAGATGAA 
CAGACGGTTG 
TGAGGCTGGC 



CCGCTGCTGA TGGCCCGATG CCGCAGACGC 
CGCCAGGTCG GTGTTCCGGC AATCGTTGTC 
GGTTGACGAT CCGGAGCTCC TCGAGCTCGT 
TGCTCTCCAG CTACGACTTC CCGGGCGATG 
TCCGCTCTGA AGGCCCTCGA AGGCGACGAA 
CAAGCTCATG GAAGCTGTCG ATGAGTACAT 
ACGCTAAGCC GTTCCTGATG CCGGTCGAGG 
CGTGGTACGG TTGCAACGGG CCGCGTTGAG 
CGATACGGTT GAGATCGTTG GTCTGCAGGA 
TCACGGGCAT CGAGATGTTC CGCAAGATGC 
GATAACATCG GTGCTCTGCT CCGTGGTATC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
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GACCGCAAGG AGATCGAGCG TGGCCAGGTT 
TCATCCGCAC ACGAAGTTCA AGGCTCAGGT 
AAGGCGGCCG TCATACGCCG TTCTTCACGA 
TTCCGCACGA CGGACGTAAC TGGCGTAGTC 
5 GATGGTTATG CCTGGCGATA ACGTCGAGAT 
CGATCGCTAT CGAGAAGGGC 



CTCGCAAAGC CGGGCACGAT 600 
CTATGTCCTG A CG AAAGAAG 650 
ACTATCGCCC GCAGTTCTAC 700 
AAACTGCCGG AAGGCACGGA 750 
GGAAGTTGAG CTCATCACCC 800 

820 



10 2) INFORMATION FOR SEQ ID NO: 118 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mobiluncus curtisii subsp. holmesii 

(B) STRAIN: ATCC 35242 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118 



25 



cggcgctatc ctcgtggtgg ctgctactga cggtccgatg gctcagacca 50 

aggaacacat cctgttggct aagcaggttg gcgtgccctc catcctggtc 100 

gctctgaaca agtgcgattc ttccgatgtg gacgaagaca tgctcgaaat 150 

cgtcgaggac gaaatccgcg atgacctgga gaagcagggc ttcgatcgtg 200 

30 actgcccgat tatccacgtt tccgctctga aggccctgga aggcgacccc 250 

gagtggacca agaagattga agagctcatg gaagcggtcg atacctacat 300 

tcctgagcct gttcgtgacc tcgacaagcc gttcttgatg cctatcgaag 350 

acgtcttcac cattactggt cgcggtaccg tagtgaccgg tcgtgtggaa 4 00 

cgcggcaagc taccgttgaa cgccgaagtg gaaatcgtag gtattcgtcc 450 

35 tacgcaaaag accaccgtta ccggtatcga aatgttccac aagtccatgg 500 

acgaagccta cgccggcgag aactgtggtc tgttgctg cg tggcaccaag 550 

cgtgaggacg ttgagcgcgg tcaggttgtc tgcattcctg gctccgtgac 600 

cccgcacacc aagttcgagg gcaaggtcta catcttgaag aaggacgaag 650 

gtggacgtca caagtcgttc tacgacggct accgcccgca gttcttcttc 700 

40 cgcaccaccg acgtgaccgg tgttattcac ctgcccgaag gcaccgaaat 750 

ggttatgcct ggcgacacca ccgaaattag cgttgagctg attcagccta 800 

tcgctatgga" ggje^ggtctcT ggcttcgcta"t " 



45 

2) INFORMATION FOR SEQ ID NO: 119 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 825 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



55 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Moellerella wisconsensis 

(B) STRAIN: ATCC 35017 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119 

72 
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GGTGCAATTC 
TGAGCACATC 
TCCTGAACAA 
5 GAAATGGAAG 
CACTCCAGTA 
AGTGGGAAGC 
CCAGAGCCAG 
CGTATTCTCA 

10 GTGGTATCGT 
ACCGTGAAAA 
CGAAGGCCGT 
GTGATGATAT 
CCGCATACAA 

15 TGGCCGTCAT 
GTACAACTGA 
GTAATGCCAG 
TGCAATGGAT 



TGGTTGTTGC 
CTGTTAGGTC 
ATGTGACATG 
TCCGTGAGCT 
ATCCGTGGTT 
TAAAATCATT 
AGCGTGACAT 
ATTTCAGGCC 
TAAAGTCGGT 
CAACATGTAC 
GCTGGTGAGA 
CGAACGTGGT 
CTTTCGAATC 
ACTCCATTCT 
CGTAACCGGT 
GTGATAACAT 
GCAGGTCTGC 



TGCAACTGAT 
GTCAGGTTGG 
GTAGACGACG 
GCTGTCTCAG 
CAGCGCTGAA 
GAACTGGCAG 
TGATAAGCCA 
GTGGTACAGT 
GAAGAAGTTG 
TGGCGTTGAA 
ACGTTGGTGT 
CAAGTATTGG 
AGAAGTTTAC 
TCAAAGGTTA 
ACTATCGAAC 
CAAAATGATC 
GTTTT 



GGCCCTATGC 
CGTTCCATAC 
AAGAGCTGTT 
TACGATTTCC 
AGCTCTGGAA 
AAGCACTGGA 
TTCCTGTTAC 
TGTTACTGGT 
AAATCGTTGG 
ATGTTCCGTA 
TCTGCTGCGT 
CTAAACCAGG 
ATCCTGAGCA 
CCGTCCACAG 
TGCCAGAAGG 
GTTACTCTGA 



CACAGACTCG 
ATCATCGTTT 
AGAACTGGTT 
CAGGCGATGA 
GGCGAAGCTG 
TTCTTATATC 
CAATCGAAGA 
CGTGTTGAGC 
TATCAAAGAT 
AACTGCTGGA 
GGTACTAAAC 
TTCAATCACT 
AAGATGAAGG 
TTCTACTTCC 
CGTTGAGATG 
TCCACCCAAT 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
825 



20 



2) INFORMATION FOR SEQ ID NO: 120 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 82 7 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



30 (ii) MOLECULE TYPE: Genomic DNA 



(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Branhamella catarrhal is 

(B) STRAIN: ATCC 43628 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120 



TGGTGCTATC TTGGTTGTTT CTGCAACTGA TGGTCCTATG CCACAAACTC 50 

GTGAGCATAT CCTACTATCT CGTCAGGTTG GTGTACCATA CATCATGGTA 100 

40 TTCATGAACA AGTGCGATAT GGTTGATGAT GAAGAGCTAC TAGAATTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTATCTGA CTATGATTTC CCTGGTGATG 200 

ATAC C CCAAT ItATO^GGT" ^CAGCACTAG /^GCaYi^AA T(JgYi^1 : GA 7 I : 2 5 0 

GGTAAATATG GCGAGCCTGC AGTTCTAGAA CTGCTAGACA CACTAGACAG 30 0 

CTATATCCCA GAGCCTGAGC GTGATATCGA TAAGTCATTC TTGATGCCAA 350 

45 TTGAAGATGT CTTCTCGATC TCAGGTCGTG GTACAGTTGT GACTGGTCGT 400 

GTTGAATCAG GTATTATTAA AGTTGGTGAT GAAATTGAAA TCATCGGTAT 450 

CAAACCAACT GCTAAAACCA CCTGTACTGG TGTTGAAATG TTCCGTAAAC 500 

TGTTAGACGA AGGTCGTGCA GGTGAGAACT GTGGTATCTT GTTGCGTGGT 550 

ACTAAGCGTG AAGAAGTTCA ACGCGGTCAA GTACTTGCAA AACCAGGTTC 600 

50" AATCACCCCA CATACTAAGT TTGATGCTGA AGTTTATGTA CTGTCAAAAG 650 

AAGAAGGTGG TCGTCACACC CCATTCTTAA ATGGCTATCG CCCACAGTTC 700 

TACTTCCGTA CCACAGATGT GACTGGTGCC ATCACTCTAC AAGAAGGTAC 750 

CGAAATGGTT ATGCCTGGTG ACAATGTTGA GATGAGTGTT GAGCTTATCC 800 

ACCCAATCGC CAGGATAAAG GTCTACG 827 

55 



2) INFORMATION FOR SEQ ID NO: 121 
60 (i) SEQUENCE CHARACTERISTICS: 
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15 



20 



25 



30 



35 



40 



(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Morganella morganii subsp. morganii 

(B) STRAIN: ATCC 25830 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121 

CGGCGCTATC CTGGTTGTTG CTGCAACTGA TGGCCCTATG CCACAGACCC 50 

GTGAGCACAT CCTGTTAGGT CGTCAGGTTG GCGTTCCTTA CATCATCGTA 100 

TTCCTGAACA AATGTGACAT GGTTGATGAT GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGATTTC CCTGGCGACG 2 00 

ACACGCCAAT CGTTCGCGGT TCAGCGCTGA AAGCACTGGA AGGCGAGCCA 250 

GAGTGGGAAG CTAARATCGT TGAACTGGCA GGTTTCCTGG ATTCTTACAT 300 

CCCTGAGCCA GAGCGTGCAA TTGACAAGCC GTTCCTGCTG CCAATCGAAG 350 

ACGTATTCTC AATCTCCGGC CGTGGTACCG TTGTTACCGG TCGTGTTGAG 400 

CGCGGTATCA TCAAGGTTGG TGAGGAAGTT GAAATCGTGG GTATCAAAGA 450 

TACTGCGAAA ACCACCTGTA CCGGTGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCMGGTGAG AACGTCGGTG TTCTGCTGCG TGGTACCAAG 550 

CGTGAAGAAA TCGAACGTGG TCAGGTTCTG GCTAAACCAG GTTCAATCAA 600 

ACCACAYACC AAATTTGAAT CAGAAGTTTA TATTCTGAGC AAAGATGAAG 650 

GTGGTCGTCA TACTCCATTC TTCAAAGGYT ACCGTCCACA GTTCTACTTC 700 

CGTACCACAG ACGTAACAGG TACTATCGAA CTGCCGGAAG GCGTTGAAAT 750 

GGTAATGCCG GGCGACAACA TCAAAATGAT CGTCACCCTG ATCCACCCAA 800 

TCGCAA 806 



2) INFORMATION FOR SEQ ID NO: 122 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

( i i ) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(B) STRAIN: TB 299 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122 



GGTGCGATCC 
CGAGCACGTT 
CGCTGAACAA 
GAGATGGAGG 
CCCGGTTGTG 
GGGTTGCCTC 
GACCCGGTCC 
CTTCACCATT 
GCGTGATCAA 
ACCACCAAGA 
CCAGGGCCAG 



TGGTGGTCGC 
CTGCTGGCGC 
GGCCGACGCA 
TCCGCGAGCT 
CGGGTCTCGG 
TGTCGAGGAA 
GCGAGACCGA 
ACCGGCCGCG 
CGTGAACGAG 
CCACCGTCAC 
GCGGGCGACA 



CGCCACCGAC GGCCCGATGC 
GTCAAGTGGG TGTGCCCTAC 
GTGGACGACG AGGAGCTGCT 
GCTGGCTGCC CAGGAATTCG 
CGCTCAAGGC GCTCGAGGGT 
CTGATGAACG CGGTCGACGA 
CAAGCCGTTC CTGATGCCGG 
GAACCGTGGT CACCGGACGT 
GAAGTTGAGA TCGTCGGCAT 
CGGTGTGGAG ATGTTCCGCA 
ACGTTGGTTT GCTGCTGCGG 



CCCAGACCCG 
ATCCTGGTAG 
CGAACTCGTC 
ACGAGGACGC 
GACGCGAAGT 
GTCGATTCCG 
TCGAGGACGT 
GTGGAGCGCG 
TCGCCCATCG 
AGCTGCTCGA 
GGCGTCAAGC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
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GCGAGGACGT CGAGCGTGGC CAGGTTGTCA 
CCGCACACCG AGTTCGAAGG CCAGGTCTAC 
CGGCCGGCAC ACGCCGTTCT TCAACAACTA 
GCACCACCGA CGTGACCGGT GTGGTGACAC 
GTGATGCCCG GTGACAACAC CAACATCTCG 
CGCCATGGAC GAAGGTCTGC GTTTC 



CCAAGCCCGG CACCACCACG 600 
ATCCTGTCCA AGGACGAGGG 650 
CCGTCCGCAG TTCTACTTCC 700 
TGCCGGAGGG CACCGAGATG 750 
GTGAAGTTGA TCCAGCCCGT 800 

825 



2 ) INFORMATION FOR SEQ ID NO: 123 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria cinerea 

(B) STRAIN: ATCC 14685 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123 

CGGTGCGATC TTGGTATGTT CCGCAGCTGA CGGTCCTATG CCGCAAACTC 50 

GCGAACACAT CCTGTTGGCC CGCCAAGTAG GTGTACCTTA CATCATCGTG 100 

TTCATGAACA AATGCGACAT GGTTGACGAT GCCGAGCTGT TGGAGCTGGT 150 

TGAAATGGAA ATCCGTGACT TGCTGTCAAG CTACGACTTC CCAGGTGACG 200 

ACTGCCCGAT CGTACAAGGT TCTGCACTGA AAGCCTTGGA AGGCGACGCA 250 

GCTTACGAAG AAAAAATCTT CGAATTGGCT GCTGCATTGG ACAGCTACAT 300 

CCCAACACCT GAGCGTGCAG TGGACAAACC TTTCTTGTTG CCTATCGAAG 350 

ACGTATTCTC TATTTCCGGT CGCGGTACAG TAGTAACCGG TCGTGTAGAG 4 00 

CGCGGTATCA TCCACGTTGG TGACGAGATC GAAATCGTAG GTCTGAAAGA 450 

AACTCAAAAA ACCACTTGTA CCGGTGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGTCA AGCTGGTGAC AACGTAGGTG TATTGCTGCG TGGTACTAAA 550 

CGTGAAGACG TAGAGCGTGG TCAAGTATTG GCTAAACCGG GTACTATCAC 600 

TCCTCACACC AAGTTCAAAG CAGAAGTATA CGTACTGAGC AAAGAAGAGG 650 

GTGGTCGTCA CACTCCGTTC TTCGCTAACT ACCGTCCACA ATTCTACTTC 700 

CGTACTACCG ACGTAACCGG CGCGGTTACT TTGGAAGAAG GTGTAGAAAT 750 

GGTAATGCCG GGTGAGAACG TAACCATTAC TGTAGAACTG ATTGCGCCTA 800 

TCGCTA - " OUb 



2) INFORMATION FOR SEQ ID NO: 124 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria elongata subsp. elongata 

(B) STRAIN: ATCC 25295 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124 

75 
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CGGCGCAATC 
GCGAACACAT 
TTCATGAATA 
5 TGAAATGGAA 
ACTGCCCGAT 
GCTTACGAAG 
CCCGACACCT 
ACGTATTCTC 

10 CGCGGTATCA 
AACCCAAAAA 
ACGAAGGTCA 
CGTGAAGAAG 
TCCTCACACC 

15 GTGGTCGTCA 
CGTACTACCG 
GGTTATGCCT 
TCGCTATGGA 



TTGGTATGTT 
CCTGTTGGCC 
AATGCGACAT 
ATCCGTGACT 
CGTACAAGGT 
AAAAAATCTT 
GAGCGTGCCG 
TATCTCCGGC 
TCCACGTCGG 
ACCACTTGTA 
AGCAGGTGAC 
TGGAACGCGG 
AAATTCAAAG 
TACTCCGTTC 
ACGTAACCGG 
GGTGAGAACG 
AGAAGGTCTG 



CCGCTGCYGA 
CGCCAAGTAG 
GGTTGAYGAT 
TGCTGTCAAG 
TCCGCACTGA 
CGAACTGGCT 
TGGACAAACC 
CGTGGTACAG 
TGACGAGATC 
CCGGTGTTGA 
AACGTAGGCG 
TCAAGTATTG 
CAGAAGTTTA 
TTCGCTAACT 
TGCGGTTACT 
TGGCCATCAC 
CG 



CGGTCCTATG 
GCGTACCTTA 
GCCGAACTGC 
CTACGACTTC 
AAGCCTTGGA 
GCTGCATTGG 
GTTCCTGTTG 
TAGTAACCGG 
GAAATCGTAG 
AATGTTCCGC 
TATTGCTGCG 
GCTAAACCGG 
CGTATTGAGC 
ACCGTCCACA 
TTGGAAGAAG 
TGTAGAACTG 



CCGCAAACTC 
CATCATCGTG 
TGGAACTGGT 
CCAGGCGACG 
AGGCGACGCA 
ACAGCTACAT 
CCTATCGAAG 
TCGTGTAGAG 
GTCTGAAAGA 
AAACTGCTGG 
CGGTACCAAA 
GTACCATCAC 
AAAGAAGAGG 
ATTCTACTTC 
GTGTAGAAAT 
ATTGCACCTA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
822 



20 



2) INFORMATION FOR SEQ ID NO: 125 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 82 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria flavescens 

(B) STRAIN: ATCC 13120 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125 



CGGCGCGACT TGGTATGTTC CGCAGCTGAC GGTCCTATGC CGCAAACCCG 50 

CGAACACATC CTGTTGGCTC GCCAAGTAGG TGTACCTTAC ATCATCGTAT 100 

4 0 TCATGAACAA ATGCGACATG GTAGACGATG CCGAGCTGTT GGAACTGGTT 150 

GAAATGGAAA TTCGTGACTT GTTGTCAAGC TACGACTTCC CAGGCGACGA 2 00 

CTGCCCAATC" GTACAAGGtTT:TG^ AGC1"1~1^AA Wu^TciClxT " Z 5U 

CTTACGAAGA AAAAATCTTC GAATTGGCTG CTGCCTTGGA CAGCTACATC 3 00 

CCAACACCTG AGCGTGCTGT GGACAAACCT TTCTTGTTGC CTATCGAAGA 350 

45 CGTATTCTCT ATCTCTGGTC GTGGTACAGT AGTAACCGGT CGCGTAGAGC 400 

GCGGTATCAT CCACGTTGGT GACGAGATCG AAATCGTAGG TCTGAAAGAA 450 

ACTCAAAAAA CCACTTGTAC CGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGTCAA GCAGGTGACA ACGTAGGCGT ATTGCTGCGT GGTACTAAAC 550 

GTGAAGACGT AGAGCGTGGT CAAGTATTGG CTAAACCAGG TACCATCACT 600 

50 CCTCACACCA AATTCAAAGC AGAAGTATAC GTACTGAGCA AAGAAGAGGG 650 

TGGTCGTCAC ACTCCATTTT TCGCTAACTA CCGTCCACAA TTCTACTTCC 700 

GTACTACCGA CGTAACTGGT GCAGTTACTT TGGAAGAAGG CGTAGAAATG 750 

GTAATGCCAG GTGAGAACGT AACCATTACT GTAGAACTGA TTGCGCCAAT 800 

CGCTATGGAA GAAGTCTGCG 820 

55 



2) INFORMATION FOR SEQ ID NO: 126 
60 (i) SEQUENCE CHARACTERISTICS: 

76 
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15 



20 



25 



30 



35 



40 



(A) LENGTH: 830 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria gonorrhoeae 

(B) STRAIN: ATCC 49226 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126 

GGTGCAATCC TGGTATGTTC TGCTGCCGAC GGCCCTATGC CGCAAACCCG 50 

CGAACACATC CTGCTGGCCC GTCAAGTAGG CGTACCTTAC ATCATCGTGT 100 

TCATGAACAA ATGCGACATG GTCGACGATG CCGAGCTGTT GGAACTGGTT 150 

GAAATGGAAA TCCGCGACCT GCTGTCCAGC TACGACTTCC CCGGCGACGA 200 

CTGCCCGATC GTACAAGGTT CCGCACTGAA AGCCTTGGAA GGCGATGCCG 250 

CTTACGAAGA AAAAATCTTC GAACTGGCTA CCGCATTGGA CAGCTACATC 3 00 

CCGACTCCCG AGCGTGCCGT GGACAAACCA TTCCTGCTGC CTATCGAAGA 3 50 

CGTGTTCTCC ATTTCCGGCC GCGGTACCGT AGTCACCGGC CGTGTAGAGC 4 00 

GAGGTATCAT CCACGTTGGT GACGAGATTG AAATCGTCGG TCTGAAAGAA 450 

ACCCAAAAAA CCACCTGTAC CGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGTCAG GCGGGCGACA ACGTAGGCGT ATTGCTGCGC GGTACCAAAC 550 

GTGAAGACGT AGAACGCGGT CAGGTATTGG CCAAACCGGG TACTATCACT 600 

CCTCACACCA AGTTCAAAGC AGAAGTGTAC GTATTGAGCA AAGAAGAGGG 650 

CGGCCGCCAT ACCCCGTTTT TCGCCAACTA CCGTCCCCAA TTCTACTTCC 700 

GTACCACTGA CGTAACCGGC GCGGTTACTT TGGAAAAAGG TGTGGAAATG 750 

GTAATGCCGG GTGAGAACGT AACCATTACT GTAGAACTGA TTGCGCCTAT 800 

CGCTATGGAA GAAGGTCTGC GCTTTGCGAT 830 



2) INFORMATION FOR SEQ ID NO: 127 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 816 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

( i i ) MOLECULE ~TY~PeY~ Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria lactamica 

(B) STRAIN: ATCC 23970 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127 



CGGCGCAATC 
GCGAACACAT 
TTCATGAACA 
TGAAATGGAA 
ACTGCCCAAT 
GCTTACGAAG 
CCCGACTCCC 
ACGTATTCTC 
CGCGGTGTCA 
AACCCAAAAA 
ACGAAGGTCA 



TTGGTATGTT 
TCTGTTGGCC 
AATGCGATAT 
ATCCGCGACC 
CGTACAAGGT 
AAAAAATCTT 
GAGCGTGCCG 
CATCTCCGGC 
TCCACGTTGG 
ACCACCTGTA 
GGCAGGCGAC 



CCGCCGCCGk 
CGCCAAGTAG 
GGTCGACGAT 
TGCTGTCAAG 
TCCGCACTGA 
CGAACTGGCT 
TGGACAAACC 
CGCGGTACGG 
CGACGAGATC 
CCGGTGTCGA 
AACGTAGGCG 



CGGCCCTATG CCGCAAACCC 
GTGTACCTTA CATCATCGTA 
GCCGAGCTGT TGGAACTGGT 
CTACGACTTC CCAGGCGACG 
AAGCTTTGGA AGGCGATGCC 
GCCGCATTGG ACAGCTACAT 
GTTCCTGCTG CCTATCGAAG 
TAGTAACCGG CCGTGTAGAG 
GAAATCGTCG GTCTGAAAGA 
GATGTTCCGC AAACTGCTGG 
TATTGCTGCG CGGTACCAAA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
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CGTGAAGAAG TGGAACGCGG 
TCCGCACACC AAGTTCAAAG 
GCGGTCGTCA CACTCCGTTC 
CGTACCACCG ACGTAACCGG 
GGTAATGCCC GGTGAGAACG 
TCGCTATGGA AGAAGG 



TCAGGTATTA GCCAAACCGG 
CAGAAGTGTA TGTATTGAGC 
TTCGCCAACT ACCGTCCGCA 
CGCGGTTACT TTGGAAGAAG 
TAACCATTAC TGTAGAACTG 



GTACCATCAC 600 
AAAGAAGAGG 650 
ATTCTACTTC 700 
GCGTGGAAAT 750 
ATTGCGCCTA 800 
816 



2) INFORMATION FOR SEQ ID NO: 128 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: ATCC 13077 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128 

CGGTGCAATC CTGGTATGTT CCGCAGCCGA CGGTCCTATG CCGCAAACCC 50 

GCGAACACAT CCTGCTGGCC CGTCAAGTAG GCGTACCTTA CATCATCGTG 100 

TTCATGAACA AATGCGACAT GGTCGACGAT GCCGAGCTGT TGGAACTGGT 150 

TGAAATGGAA ATCCGCGACC TGCTGTCCAG CTACGACTTC CCCGGCGACG 2 00 

ACTGCCCGAT CGTACAAGGT TCCGCACTGA AAGCCTTGGA AGGCGATGCC 2 50 

GCTTACGAAG AAAAAATCTT CGAATTGGCT GCTGCATTGG ACAGCTACAT 3 00 

CCCGACTCCC GAGCGTGCCG TGGACAAACC TTTCTTGTTG CCTATCGAAG 3 50 

ACGTATTCTC TATTTCCGGT CGTGGTACAG TAGTAACCGG TCGTGTAGAG 4 00 

CGCGGTATCA TCCACGTCGG TGACGAGATC GAAATCGTCG GTCTGAAAGA 450 

AACTCAAAAA ACCACTTGTA CCGGTGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGTCA AGCAGGCGAC AACGTAGGCG TATTGCTGCG CGGTACCAAA 550 

CGTGAAGACG TAGAGCGTGG TCAAGTATTG GCTAAACCGG GTACAATCAC 600 

TCCTCACACC AAGTTCAAAG CAGAAGTATA CGTACTGAGC AAAGAAGAGG 650 

GCGGCCGCCA TACCCCGTTC TTCGCCAACT ACCGTCCCCA ATTCTACTTC 700 

CGTACCACCG ACGTAACCGG CGCGGTTACT TTGGAAGAAG GTGTGGAAAT 750 

GGTAATGCCG GGCGAGAACG TAACCATCAC CGTAGAACTG ATTGCGCCTA 800 

TCGC TATGGA~ AGAAGG TTU'b" GGUIXI^GCGaT T ~ a J 1 



2) INFORMATION FOR SEQ ID NO: 129 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 815 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria mucosa 

(B) STRAIN: ATCC 19696 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129 
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CGGCGCAATC TTGGTATGTT CTGCTGCYGA 
GYGAACACAT CCTGTTGGCC CGTCAAGTAG 
TTCATGAACA AATGCGACAT GGTTGACGAT 
5 TGAAATGGAA ATCCGTGACT TGCTGTCAAG 
ACTGCCCGAT TGTACAAGGT TCTGCACTGA 
GCTTACGAAG AAAAAATCTT CGAACTGGCT 
CCCGACTCCC GAGCGTGCCG TAGACAAACC 
ACGTATTCTC CATCTCCGGT CGTGGTACAG 

10 CGCGGTGTTA TCCACGTTGG TGACGAGATC 
AACCCAAAAA ACCACATGTA CCGGTGTTGA 
ACGAAGGTCA AGCCGGTGAC AACGTAGGCG 
CGTGAAGAAG TGGAACGCGG TCAAGTATTG 
TCCGCACACC AAATTCAAAG CAGAAGTGTA 

15 GTGGTCGTCA TACTCCGTTC TTCGCTAACT 
CGTACTACCG ACGTAACCGG TGCGGTTACT 
GGTTATGCCT GGTGAGAAYG TAGCCATYAC 
TYGCTATGGA AGAAG 



CGGTCCTATG CCGCAAACCC 50 

GYGTACCTTA CATCATCGTG 100 

GCCGAAYTGT TGGAACTGGT 150 

CTACGACTTC CCTGGYGACG 200 

AAGCCTTGGA AGGCGATGCC 250 

GCCGCATTGG ACAGCTACAT 300 

GTTCCTGTTG CCTATCGAAG 350 

TAGTAACCGG CCGTGTAGAG 400 

GAAATCGTAG GTCTGAAAGA 450 

AATGTTCCGC AAACTGCTGG 500 

TATTGCTGCG CGGTACCAAA 550 

GCTAAACCGG GTACCATCAC 600 

CGTATTGAGC AAAGAAGAGG 650 

ACCGTCCTCA ATTCTACTTC 700 

TTGGAAGAAG GTGTAGAAAT 750 

TGTAGAACTG ATTGCGCCTA 800 

815 



20 



2) INFORMATION FOR SEQ ID NO: 130 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria sicca 

(B) STRAIN: ATCC 9913 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130 



GGCGCAATCT TGGTATGTTC CGCTGCTGAC GGTCCTATGC CGCAAACCCG 50 

CGAACACATC CTGTTGGCCC GCCAAGTAGG CGTACCTTAC ATCATCGTGT 100 

40 TCATGAACAA ATGCGACATG GTTGACGATG CCGAGCTGTT GGAACTGGTT 150 

GAAATGGAAA TCCGTGACTT GCTGTCAAGC TACGACTTCC CTGGTGACGA 200 

CTGCCCGATC GTACAAGGTT CTGCACTGAA AGCCTTOTAA "gGCGACGCCG" ~ 2 50 

CTTACGAAGA AAAAATCTTC GAACTGGCTG CTGCATTGGA CAGCTACATC 300 

CCGACTCCTG AGCGTGCCGT GGACAAACCG TTCCTGTTGC CTATTGAAGA 3 50 

45 CGTATTCTCC ATCTCCGGTC GCGGTACCGT AGTAACCGGC CGTGTAGAGC 4 00 

GCGGTGTTAT CCACGTTGGT GACGAGATTG AAATCGTAGG TCTGAAAGAA 4 50 

ACCCAAAAAA CCACTTGTAC CGGTGTTGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGTCAA GCCGGTGACA ACGTAGGCGT ATTGCTGCGC GGTACCAAAC 550 

GTGAAGAAGT GGAACGCGGT CAAGTATTGG CTAAACCGGG TACCATCACT 600 

50 CCTCACACTA AATTCAAAGC AGAAGTTTAC GTATTGAGTA AAGAAGAGGG 650 

TGGTCGTCAT ACTCCGTTCT TCGCTAACTA CCGTCCTCAA TTCTACTTCC 700 

GTACTACCGA CGTAACCGGC GCGGTTACTT TGGAAGAAGG TGTAGAAATG 750 

GTTATGCCTG GTGAGAACGT AGCCATCACT GTAGAACTGA TTGCACCGAT 8 00 

CGCTATGGAA GAAGGTCTGC GCTTTGCGA 829 

55 



2) INFORMATION FOR SEQ ID NO: 131 
60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 814 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria subflava 

(B) STRAIN: ATCC 14221 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131 

CGGCGCGACT TGGTATGTTC CGCAGCTGAT GGTCCTATGC CTCAAACTCG 50 

CGAACACATC CTGTTGGCTC GCCAAGTAGG TGTACCTTAC ATCATCGTAT 100 

TCATGAACAA ATGCGACATG GTTGACGATG CCGAGCTGTT GGAACTGGTT 150 

GAAATGGAAA TCCGTGACCT GTTGTCAAGC TACGACTTCC CAGGCGACGA 200 

CTGCCCAATC GTACAAGGTT CTGCACTGAA AGCTTTGGAA GGTGACGCTG 250 

GTTACGAAGA GAAAATCTTC GAATTGGCTG CTGCTCTGGA CAGCTACATC 3 00 

CCAACACCTG AGCGTGCTGT GGACAAACCT TTCTTGTTGC CTATCGAAGA 350 

CGTATTCTCT ATCTCTGGCC GTGGTACAGT AGTAACTGGT CGTGTAGAGC 4 00 

GCGGTATCAT CCACGTTGGT GACGAGATCG AAATCGTAGG TCTGAAAGAA 4 50 

ACCCAAAAAA CCACTTGTAC CGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGTCAA GCTGGTGACA ACGTAGGCGT ATTGCTGCGT GGTACCAAAC 550 

GTGAAGACGT AGAGCGTGGT CAAGTATTGG CTAAACCAGG TACCATTACT 600 

CCTCACACCA AATTCAAAGC AGAAGTATAC GTACTGAGCA AAGAAGAGGG 650 

TGGTCGTCAC ACTCCATTCT TCGCTAACTA CCGTCCACAA TTCTACTTCC 700 

GTACTACTGA CGTAACTGGT GCAGTTACTT TGGAAGAAGG CGTAGAAATG 750 

GTAATGCCAG GTGAGAACGT AACCATTACT GTAGAACTGA TTGCGCCTAT 800 

CGCTATGGAA GAAG 814 



2) INFORMATION FOR SEQ ID NO: 132 

{ i ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

( i i ) MOLECULE TYPE ~ Genomic DNA - " 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria weaveri 

(B) STRAIN: ATCC 51223 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132 



GCCATCTTGG 
GCACATCCTG 
TGAACAAATG 
ATGGAAATCC 
YCCAATCGTG 
ACGAAGAAAA 
ACWCCTGAGC 
ATTCTCAATT 
GTATTATTCA 
CAAAAAACTA 
AGGTCAGGCT 



TATGTTCTGC 
TTGGCTCGTC 
CGATATGGTT 
GTGATCTGCT 
CAAGGTTCTG 
AATCTTTGAA 
GYGCTGTTGA 
TCAGGTCGTG 
CGTAGGCGAT 
CTTGTACCGG 
GGTGATAACG 



TGCTGACGGT 
AAGTAGGTGT 
GATGATGCAG 
GAGCAGCTAC 
CTTTGAAAGC 
TTAGCTGCTG 
TAAACCATTC 
GTACAGTAGT 
GAAATTGAAA 
CGTTGAAATG 
TAGGCGTATT 



CCTATGCCGC AAACCCGTGA 
ACCCTACATC ATCGTATTCA 
AGCTGCTGGA ATTGGTAGAA 
GATTTCCCTG GCGATGATTG 
TTTGGAAGGT GATGCCGCTT 
CATTGGATAG CTATATTCCA 
CTGTTGCCGA TTGAAGATGT 
AACTGGTCGT GTAGAGCGCG 
TTGTAGGTTT GAAAGARACY 
TTCCGTAAAT TGCTGGATSA 
GTTGCGTGGT ACCAAACGTG 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
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AAGACGTTGA- GCGTGGTCAA 
CAYACCAAAT TCAAAGCAGA 
TCGTCATACT CCGTTCTTCG 
CTACAGACGT TACCGGTGCK 
ATGCCTGGTG AGAAYGTTGC 
KATGGAAGAA GGYTGCGT 



GTATTGGCTA AGCCTGGTWC 
RGTKTATGTW TTGAGYAAGG 
CTAACTATCG TCCGCAATTC 
GTRACTTTAG AAGAAGGTGT 
CATTACTGTW GARYTGATYG 



TATTACTCCG 600 
AAGAAGGCGG 650 
TATTTCCGTA 700 
GGAAATGGTA 750 
CTCCGATTGC 800 
818 



2 ) INFORMATION FOR SEQ ID NO: 133 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 836 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ochrobac trum anthropi 

(B) STRAIN: ATCC 49188 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133 

CGGCGCAATT CTGGTTGTTT CGGCCGCTGA CGGCCCGATG CCGCAGACCC 50 

GTGAGCACAT CCTGCTCGCT CGTCAGGTTG GCGTTCCGGC AATCGTCGTG 100 

TTCCTGAACA AGTGCGACCA GGTTGACGAT GCAGAACTGC TCGAACTGGT 150 

TGAACTGGAA GTTCGCGAAC TTCTGTCGAA ATACGATTTC CCGGGCGACG 200 

AAGTTCCGAT CATCAAGGGC TCGGCTCTTG CTGCTCTGGA AGATTCTTCG 250 

AAGGAACTGG GCGAAGACGC CGTTCGTTCG CTGATGGCCG CTGTTGACGA 3 00 

CTACATTCCG ACCCCGGAAC GTCCGATCGA CCAGCCGTTC CTGATGCCGA 350 

TCGAAGACGT TTTCTCGATC TCGGGCCGTG GTACGGTTGT GACGGGTCGC 400 

GTTGAGCGCG GTATCGTCAA GGTTGGTGAA GAAGTTGAAA TCGTCGGCAT 450 

CAAGGCGACG GCGAAGACGA CGGTAACCGG CGTTGAAATG TTCCGCAAGC 500 

TGCTCGAYCA GGGCCAGGCT GGCGACAACA TCGGCGCTCT GATCCGCGGC 550 

GTTGGCCGTG AAGACGTTGA ACGCGGCCAG GTTCTCTGCA AGCCGGGTTC 600 

TGTGAAGCCG CACACCAAGT TCAAGGCAGA AGCCTACATT CTGACCAAGG 650 

ACGAAGGTGG CCGTCATACG CCGTTCTTTA CGAACTACCG TCCGCAGTTC 700 

TACTTCCGCA CGACGGACGT GACCGGTGTT GTCACGCTGC CGGAAGGCAC 750 

GGAAATGGTT ATGCCTGGCG ACAACGTCGC TATGGACGTC ACCCTGATCG 800 

TGCCGATCGC ' CATC^AAGAG AAGCTCl:GCT~^C^CTA~ " d J fa 



2) INFORMATION FOR SEQ ID NO: 134 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 805 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pantoea agglomerans 

(B) STRAIN: ATCC 27155 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134 
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CCTGGTTGTT GCTGCGACTG ATGGCCCAAT GCCACAGACC CGTGAGCACA 50 

TCCTGCTGGG TCGTCAGGTT GGCGTTCCTT ACATCATCGT GTTCCTGAAC 100 

AAGTGTGACA TGGTTGATGA TGAAGAGCTG CTGGAACTGG TAGAGATGGA 150 

5 AGTACGTGAC CTGCTGTCAC AGTACGACTT CCCAGGCGAT GACACCCCGA 200 

TCGTTCGTGG TTCTGCTCTG AAAGCGCTGG AAGGCGTTCC TGAGTGGGAA 250 

GCAAAAATCG TTGAGCTGGC TGAACACCTG GACAACTACA TCCCGGATCC 300 

AGTCCGTGCG ATCGACATGC CGTTCCTGCT GCCAATCGAA GACGTATTCT 350 

CAATCTCTGG CCGTGGTACC GTTGTTACCG GTCGTGTTGA GCGCGGCATC 400 

10 GTTAAAGTCG GCGACGAAGT TGAAATCGTG GGTATCAAAG ATACTGCGAA 450 

ATCAACCTGT ACCGGTGTTG AGATGTTCCG TAAGCTGCTG GACCAGGGTC 500 

AGGCAGGCGA AAACTGTGGT GTTCTGCTGC GCGGTATCAA GCGTGAAGAC 550 

ATCCAGCGTG GCCAGGTTCT GGCTAAGCCA GGCTCAATCA AGCCGCACAC 600 

CCAGTTCGAG TCAGAAGTTT ACGTTCTGTC TAAAGACGAA GGTGGCCGCC 650 

15 ATACTCCGTT CTTCAAAGGC TATCGTCCAC AGTTCTACTT CCGTACAACT 700 

GATGTAACCG GTTCAGTAGA GCTGCCAGAA GGCGTTGAGA TGGTCATGCC 750 

AGGCGACAAC ATCAAAATGG TTGTTACCCT GATCCACCCA ATCGCAATGG 800 

ACGAA 1 805 



20 



35 



2) INFORMATION FOR SEQ ID NO: 135 



(i) SEQUENCE CHARACTERISTICS : 
25 (A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE : Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pantoea di speraa 

(B) STRAIN: ATCC 14589 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135 



CGCGATCCTG GTTGTTGCTG CGACTGATGG CCCAATGCCT CAGACCCGTG 50 

AGCACATCCT GCTGGGCCGT CAGGTTGGCG TTCCTTACAT CATCGTGTTC 100 

40 CTGAACAAGT GTGACATGGT TGATGACGAA GAGCTGCTGG AACTGGTTGA 150 

GATGGAAGTT CGCGATCTGC TGTCTCAGTA CGACTTCCCA GGCGACGATA 200 

C C~C C AA T CG T~ ACGCGGTTC 7 !'" ~G CG CTGA^G~ CGLTC^/^GG™ UGaUgUi^aI/ ~ Z bV " 

TGGGAAGCGA AAGTCGTTGA GCTGGCTGGT CACCTGGATA CTTACATTCC 300 

AGATCCAGTA CGTGCTATCG ATCTGCCGTT CCTGCTGCCA ATCGAAGACG 35 0 

45 TATTCTCAAT CTCTGGCCGT GGTACCGTTG TTACCGGTCG TGTTGAGCGC 400 

GGCATCGTGA AAGTGGGCGA CGAAGTAGAA ATCGTTGGTA TCAAAGCGAC 450 

TGCCAAGTCT ACCTGTACCG GTGTTGAAAT GTTCCGCAAA CTGCTGGACC 500 

AGGGTCAGGC AGGCGAGAAC TGTGGTGTTC TGCTGCGCGG TATCAAGCGT 550 

GAAGAGATCC AGCGTGGTCA GGTTCTGGCT AAGCCAGGCA CCATCAAGCC 600 

50 ACACACCAAG TTCGTATCAG AAGTGTACGT ACTGTCTAAA GACGAAGGCG 650 

GCCGTCATAC TCCGTTCTTC AAAGGCTACC GTCCACAGTT CTACTTCCGT 700 

ACYACTGATG TGACCGGCAM CATMGAACTG CCAGAAGGCG TTGAGATGGT 750 

AATGCCAGGC GACAACATCA AAATGRCCGT TGAGCTGATC CACCCAATCG 800 

CGATGGACCA GGGTCTGCGT TTCGC 825 

55 

2) INFORMATION FOR SEQ ID NO: 136 
60 (i) SEQUENCE CHARACTERISTICS: 

82 
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(A) LENGTH: 762 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pasteurella wultocida 
10 (B) STRAIN: NCTC 10322 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136 

CACAAACACG TGAGCACATC CTTTTAGGTC GCCAAGTAGG CGTTCCTTAC 50 

15 ATCATCGTAT TCTTAAACAA ATGCGACATG GTGGATGATG AAGAATTATT 100 

AGAATTAGTT GAAATGGAAG TGCGTGAACT TCTTTCTCAA TATGATTTCC 150 

CAGGTGATGA TACACCAATC GTACGTGGTT CAGCGTTACA AGCGTTAAAC 200 

GGYGTAGCTG AGTGGGAAGA GAAAATTCTT GAGTTAGCCA ACCACTTAGA 250 

TACTTACATT CCAGAGCCAC AACGTGCAAT CGACCAACCG TTCCTTCTTC 300 

20 CGATTGAAGA CGTGTTCTCA ATTTCTGGTC GTGGTACAGT AGTAACAGGT 350 

CGTGTTGAGC GTGGTATCAT CCGTACAGGT GAAGAGGTTG AAATTGTTGG 4 00 

TATTAAAGCG ACAACGAAGA CCACAGTAAC AGGTGTTGAG ATGTTCCGTA 4 50 

AATTATTAGA CGAAGGTCGT GCGGGTGAGA ACGTTGGTGC TTTATTACGT 500 

GGTACTAARC GTGAAGAAAT CGAACGTGGT CAAGTGTTAG CGAAACCGGG 550 

25 TTCAATYACG CCACACACTG ATTTTGAATC AGAAGTTTAC GTGTTATCAA 600 

AAGAAGAAGG TGGTCGTCAT ACACCATTCT TCAAAGGTTA CCGTCCACAG 650 

TTCTACTTCC GTACAACGGA CGTAACAGGT ACAATCGAAT TACCGGAAGG 700 

TGTTGAGATG GTGATGCCTG GTGATAACAT CAAGATGACT GTAAGTTTGA 750 

TTCACCCAAT CG 762 

30 



2) INFORMATION FOR SEQ ID NO: 137 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Peptostreptococcus anaerobius 

(B) STRAIN: ATCC 27337 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137 

TGGAGCTATC TTAGTTGTAT CAGCAGCGGA TGGACCAATG CCACAAACAA 50 

GAGAACACAT CTTATTATCA AGACAAGTAG GAGTACCATA TATCGTAGTA 100 

TATTTGAATA AAGCAGATAT GGTAGAAGAT GAAGAATTAT TAGAATTAGT 150 

AGAAATGGAA GTAAGAGAAT TACTATCTGA ATATGGATTC CCAGGAGATG 200 

AAATTCCAAT CATAACAGGA TCATCCTTAG GAGCATTAAA TGGAGAACAA 250 

AAATGGATAG ATCAAATCAT GGCATTGATG AAAGCCGTAG ATGAATATAT 300 

TCCAACACCG GAAAGAGCAG TAGATCAACC ATTCTTGATG CCAATCGAAG 350 

ACGTATTTAC AATTACAGGA AGAGGAACTG TAGTAACAGG AAGAGTTGAA 4 00 

AGAGGAGTTG TAAAAGTWGG AGAAGAAGTT GAAATCGTAG GAATCAAAGC 4 50 

GACAACAAAG ACAACTTGTA CYGGAGTAGA AATGTTCCGA AAATTATTGG 500 

ATCAAGGACA AGCAGGAGAT AACATCGGAG CTTTATTRAG AGGAACCAAG 550 

AAAGAAGATG TAGAAAGAGG ACAAGTATTG GCAAAACCAG GAACAATTCA 600 

83 
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TCCTCATACA AACTTCAGTG 
GAGGAAGACA TACTCCATTC 
AGAACCACAG ATATTACAGG 
GGTAATGCCR GGAGATAATA 
5 TTGCAATGGA AACAGGATTA 



GAGAAGTATA TGTATTGACA 
TTCTCAGGAT ACAGACCACA 
AGCAGTAACA TTACCAGAAG 
TCACAATGAC AGTAGAATTG 
CGATTTGCAA TT 



AAAGAAGAAG 650 
ATTTTACTTT 700 
GAGTAGAAAT 750 
ATTCACCCAA 800 
832 



2) INFORMATION FOR SEQ ID NO: 138 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM : Peptostreptococcus asaccharolyticus 

(B) STRAIN: LSPQ 2639 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138 

25 TAGTATGTTC AGCAGCAGAY GGTCCAATGC CACAAACAAG AGAACACATT 50 

CTACTAGCAA GACAAGTTGG TGTACCAAAG ATAGTAGTAT TCCTAAACAA 100 

AGAAGACCAA GTAGACGATC CAGAACTAAT TGAATTAGTA GAGATGGAAA 150 

TCAGAGACCT ACTATCAGAA TATGACTTCG AYGGAGACAA CACACCAATC 200 

GTAGTAGGAT CAGCATTAAA AGCCCTAGAC GATCCAGACG GAGAATGGGG 250 

30 AGACAAAATC GTAAAACTAA TGGAAGMAGT AGACGAATAC ATCCCAACAC 3 00 

CAGTAAGAGA TACAGAACAC CCATTCCTAA TGCCAATCGA AGACRTATTC 350 

TCAATYACAG GAAGAGGAAC AGTAGCAACA GGAAGAGTAG AACAAGGTGT 400 

AGTAAAAGTA GGMGACACAG TAGAACTAGT AGGCTTAACA GACGAAAGCA 450 

GACAAGTAGT AGTAACAGGT GTAGAAATGT TTAGAAAACA ACTAGACCTA 500 

35 GCAGAAGCMG GAGACAACAT TGGAGCCCTA CTAAGAGGAG TACAAAGAGA 550 

AGAAATCCAA AGAGGACAAG TACTAGCAGC ACCAGGAACA ATCAAACCAC 600 

ACACAAAATT TGAAGCAGAA GTATACGTAC TAACAAAAGA AGAAGGTGGA 650 

AGACACACAC CATTCTTTAA CGGATACAGA CCACAATTCT ACTTCAGAAC 700 

AACAGACGTA ACAGGAGACA TCCAACTAGC AGACGGAGTA GAAATGGTAA 750 

40 TGCCAGGAGA CAACTCAACA TTTACAGTAA CACTAATCAC ACCAATCGCA 800 

ATGGACGAAG GACTAAGATT CGC 823 



45 2) INFORMATION FOR SEQ ID NO: 139 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Peptostreptococcus prevotii 

(B) STRAIN : ATCC 9321 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139 
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CTATCATCGT AGTATCTGCA GCAGACGGTC 
CACATCCTAC TAGCAAGACA AGTAGGCGTT 
AAACAAAGAA GACCAAGTAG ACGATCCAGA 
TGGAAATCAG AGACCTACTT TCAGAATACG 
5 CCAGTAGTAG TAGGATCTGC TCTTAAATCA 
CCCATGGTCA GACAAAATCC TTGACCTAAT 
TCGACATCCC AGAAAGAGAC AACGACCAAC 
GACGTAATGA CAATCTCAGG ACGTGGAACA 
AAGAGGAACA CTAAAAGTTG GTGATACAGT 

10 AAGATACAAA AGAAACAGTA GTAACTGGAG 
CTAGACCAAG CAGAATCTGG AGATAACGTA 
AACAAGAGAT CAAATCTCAA GAGGACAAGT 
TAAACCCACA CACAGAATTC GAAGGTCAAG 
GAAGGTGGAC GTCACACACC ATTCTTCAGT 

15 CTTTAGAACA ACAGACGTAA CAGGAGACAT 
AAATGGTAAT GCCAGGAGAC AACGCAACAT 
CCAATCGCTC TAGAAGAAGG ACTAAGATTC 



CAATGCCCAA ACAGAGAGAA 50 

CCAAAAATCG CAGTATTCCT 100 

ACTAATCGAA TTAGTAGAAA 150 

ACTTCGATGG AGACAACGCT 200 

CTAGAAGAAG GCGGAGAAGG 250 

GGCACAAGTA GACGAATACT 300 

CATTCCTAAT GCCAGTAGAA 350 

GTAGCAACAG GAAGAGTTGA 400 

AGAAATCGTA GGACTAACAG 450 

TAGAAATGTT CCACAAATCM 500 

GGACTACTAC TAAGAGGAGT 550 

ACTAGCAAAA CCAGGWTCAG 600 

TATACGTACT AACAAAAGAA 650 

GGATATAGAC CACAATTCTT 700 

CGAACTAGAA GAAGGCGTAG 750 

TCAAAATCAC ACTCCAAAAA 800 

GC 832 



20 



2) INFORMATION FOR SEQ ID NO: 140 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 831 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas asaccharolytica 

(B) STRAIN: ATCC 25260 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140 

CGGTGCTATC ATCGTAGTTG CTGCAACTGA TGGTCCTATG CCTCAGACGC 50 

GTGAGCACAT CCTACTAGCA CGTCAGGTCA ACGTACCTCG TCTAGTTGTC 100 

TTTATGAACA AGTGCGACCT TGTTGATGAC GAGGAGATGC TCGAGCTCGT 150 

4 0 AGAGATGGAT ATGCGTGAGC TACTAAGCTT CTATGACTTT GACGGCGACA 200 

ACACTCCTGT CATCCGTGGT TCTGCTCTTG GTGCTCTCAA TGGTGAGCCT 250 

AAGTGGGTAG AGAAgWtAT ^GAGCTCATcf GAGGCTGTAG ACAlVi^AT 3UU 

CCCACTACCT GAGCGCGACA TCGACAAGCC TTTCCTAATG CCTGTAGAGG 350 

ACGTATTCTC TATCACAGGT CGTGGTACTG TCGCTACTGG TCGTATCGAG 400 

4 5 ACTGGTGTCG TTAAGGTCAA CGATGAGGTT CAGATCATCG GTCTAGGTGC 450 

TGAGGGTAAG AAGAGCGTCG TAACTGGCGT GGAAATGTTC CGCAAGATCC 500 

TTGATGAGGG TGAAGCTGGT GATAACGTAG GTCTCCTACT CCGTGGTATC 550 

GACAAGGACG AGATCAAGCG CGGTATGGTC CTAGCACACC CAGGTCAGGT 600 

CAAGCCTCAC GATCACTTCA AGGCTGAGGT CTATATCCTG AAGAAGGAAG 650 

50 AGGGTGGTCG TCACACACCA TTCCACAACA AGTACCGTCC TCAGTTCTAC 700 

ATCCGTACGC TAGACGTAAC GGGCGAGATC ACACTCCCAG AGGGTGTAGA 750 

GATGGTTATG CCTGGTGATA ACGTCACCAT CGATGTCAAG CTCATCTCTC 800 

CAGTAGCTTG TAGCGTAGGT CTACGCTTCG C 831 



55 



2) INFORMATION FOR SEQ ID NO: 141 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 818 bases 

85 
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10 



(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Porphyromonas gingivalis 

(B) STRAIN: ATCC 33277 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141 



CGGTGCTATA ATCGTTGTAG CAGCTACAGA CGGTCCTATG CCTCAGACTC 50 

GCGAGCACAT CCTTTTGGCT CGCCAGGTAA ACGTTCCTCG TCTGGTTGTT 100 

15 TTCATGAACA AATGTGACAT GGTAGACGAT GAAGAGATGC TCGAGCTTGT 150 

TGAAATGGAC ATGCGCGAAC TCCTTTCTTT CTACGATTTC GATGGTGACA 200 

ATACCCCTAT CATCCGTGGT TCTGCTCTGG GCGCTTTGAA TGGAGAGCCT 250 

CAGTGGGAAG ACAAGGTGAT GGAGCTTATG GAAGCTGTTG ACAACTGGGT 3 00 

TCCCCTGCCT GAGCGCGATA TCGACAAACC GTTCTTGATG CCGGTTGAAG 350 

20 ACGTGTTCTC TATCACGGGT CGTGGTACGG TCGCTACAGG ACGTATCGAA 400 

ACCGGTATTG TGAAGACCGG TGACGAAGTT CAAATCATCG GCCTCGGTGC 4 50 

AGAAGGAATG AAGTCGGTTG TTACGGGTGT TGAAATGTTC CGTAAGATTC 500 

TTGACGAAGG TCAGGCTGGT GACAACGTTG GTCTCCTCCT GCGTGGTATC 550 

GATAAGGATC AGATCAAGCG TGGTATGGTT ATCTCTCACC CGGGTAAGAT 600 

25 TACTCCTCAC AAGAGATTTA AGGCCGAGGT TTATATCTTG AAGAAAGAAG 650 

AAGGTGGTCG CCACACTCCT TTCCACAACA AATATCGTCC GCAGTTCTAC 700 

ATCCGTACGC TTGACGTGAC CGGTGAAATC ACTCTTCCCG AAGGAACAGA 750 

AATGGTTATG CCCGGTGACA ACGTAACGAT CACTGTAGAA CTCATCTACC 800 

CGGTTGCATG TAATGTAG 818 

30 



2) INFORMATION FOR SEQ ID NO: 142 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pragia fontium 
45 (B) STRAIN: ATCC 4 9100 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142 

CGGCGCTATT CTGGTTGTTG CTGCAACTGA TGGTCCTATG CCTCAAACTC 50 

50 GTGAGCACAT CCTGTTAGGY CGCCAGGTTG GCGTACCATA CATCATTGTG 100 

TTCCTGAACA AGTGTGACAT GGTTGAYGAT GAAGAGCTGT TAGAACTGGT 150 

TGAAATGGAA GTTCGTGAGC TTCTGTCTCA GTACGATTTC CCAGGTGATG 2 00 

ATACTCCAGT TGTTCGTGGT TCTGCGCTGA AAGCGTTRGA AGGCGAAGCT 250 

GAGTGGGAAG CTAAAATCAT TGAATTGGCT GACTCCCTGG AYAGCTACAT 300 

55 TCCACAGCCA GAGCGTGCAA TTGATAAGCC GTTCCTGCTG CCAATCGAAG 350 

ACGTTTTCTC AATCTCTGGC CGTGGTACAG TAGTAACCGG TCGTGTAGAG 4 00 

CGCGGTATCG TTAAAGTTGG TGAAGAAGTT GAAATCGTTG GTATCAAAGA 4 50 

TACTGTGAAA ACAACTTGTA CTGGCGTTGA AATGTTCCGT AARTTACTGG 5 00 

ATGAAGGCCG TGCGGGTGAG AACGTTGGTG TTCTGCTGCG TGGTACTAAG 550 

60 CGTGATGAAA TCGAACGTGG TCAAGTATTA GCAAAACCAG GTTCAATCAA 600 

86 
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CCCGCATACT AACTTCGTAT CAGAAGTTTA 

GTGGTCGTCA TACTCCATTC TTCAAAGGCT 

CGTACAACTG ACGTGACCGG TACCATCGAA 

GGTAATGCCA GGTGATAACA TTCAGATGAC 

5 TCGCGATGGA CGAAGGTTTA CGCTTCGCTA 



TATCCTGAGC AAAGATGAAG 650 
ACCGTCCACA GTTYTACTTC 700 
CTGCCAGAAG GCGTAGAGAT 750 
TGTAACTCTG ATTGCCCCAA 800 

830 



2 ) INFORMATION FOR SEQ ID NO: 143 



10 



( i ) SEQUENCE CHARAC TER I S T I CS : 

(A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Prevotella melaninogenica 

(B) STRAIN: ATCC 25845 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143 

25 TGGTGCTATC TTGGTTGTAG CTGCTACTGA TGGTCCTATG CCTCAGACTC 50 

GTGAGCACGT ATTGCTCGCT CGTCAGGTAA ACGTACCTCG CTTGGTTGTA 100 

TTCTTGAACA AGTGTGATAT GGTTGACGAT GCTGAGATGC TTGACCTCGT 150 

TGAGATGGAG GTTCGTGAGA TCCTCGAGCA GTACGGTTAT GAGGAGGATA 2 00 

CTCCTATTAT TCGTGGTTCT GCACTCGGTG CTTTGAACGG TGTTGAGAAG 250 

30 TGGGTAGACT CTGTAATGGA GCTCATGGAT ACTGTTGACA CTTGGATTGA 300 

AGAGCCAGAG CGTGAGATTG ACAAGCCATT CTTGATGCCT GTTGAGGACG 350 

TATTCTCTAT CACAGGTCGT GGTACTGTAG CTACTGGTCG TATCGAGACT 4 00 

GGTATCTGTA AGGTAGGTGA TGAGGTTCAG TTGCTCGGTC TCGGTGAGGA 450 

CAAGAAGTCT GTTATCACTG GTGTTGAGAT GTTCCGTAAG AACCTTCCAA 500 

35 CAGGTCAGGC TGGTGACAAC GTAGGTCTCC TCCTTCGTGG TATCGATAAG 550 

GCTGAGGTTA AGCGTGGTAT GGTTGTTGTG CACCCAGGTG CTATTACTCC 600 

TCACGATCAC TTCAAGGCAT CTATCTATGT ATTGAAGAAG GAAGAGGGTG 650 

GTCGTCATAC TCCATTCGGT AACAAGTATC GTCCACAGTT CTACCTCCGT 700 

ACAATGGACT GTACAGGTGA AATCCACCTC CCAGAGGGCG TTGAGATGGT 750 

40 TATGCCAGGT GACAACGTAG AGATTGAAGT TGTATTGATC TATAAGGTTG 800 

CTTTGAACGA GGGTCTTCGT T 821 



45 2) INFORMATION FOR SEQ ID NO: 144 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 827 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE : 

(A) ORGANISM: Prevotella oralis 

(B) STRAIN : ATCC 33269 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144 

87 
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TGGTGCTATT CTTGTAGTAG CTGCTACTGA CGGTCCTATG CCTCAAACTC 50 

GTGAACACGT GCTTCTTGCT CGTCAGGTGA ACGTACCTCG TTTGGTCGTT 100 

TTCTTGAACA AGTGCGATAT GGTTGACGAT GAAGAAATGC TTGAGCTCGT - 150 

AGAAATGGAG CTTCATGAAC TTCTCGAGCA GTATGAATAT GAGGAGGATA 200 

5 CTCCTATTGT TCGTGGTTCG GCACTTGGCG CTCTGAATGG AGTAGAGAAG 2 50 

TGGGTTGACA GCGTGATGAA GTTGATGGAT ACCGTTGATG AATGGATACA 300 

GGAACCACCG CGTGATCTTG ATAAGCCTTT CTTGATGCCG GTAGAGGATG 350 

TATTTTCTAT TACTGGTCGT GGAACGGTTG TTACAGGCCG TATTGAAACT 4 00 

GGTAAGGTTA AGGTGGGCGA TGAAGTTCAA CTTCTTGGTC TCGGTGAAGA 4 50 

10 TAAGAAGTCC GTTGTGACAG GCGTTGAGAT GTTCCGTAAG ATTCTTGACG 500 

AAGGTGAAGC TGGTGATAAT GTAGGCTTGC TGCTTCGTGG TATCGATAAG 550 

ACGGAAGTAA AGCGTGGTAT GGTTGTCGTA CATCCGGGGG CTATTACTCC 600 

TCACGATCAT TTCAAGGCTT CAGTTTACGT ATTGAAGAAA GAAGAAGGCG 650 

GTCGCCATAC TCCGTTTGGT AMCAAGTATC GTCCACAGTT CTATCTTCGT 700 

15 ACCATGGACT GTACTGGTGA AATTACTCTT CCGGAAGGAG TTGAGATGGT 750 

AATGCCGGGT GATAACGTCG AAATTGAAGT TAAGTTGATC TATCCGGTAG 800 

CTTTGAACGA GGGACTTCGT TTCGCTA 827 



20 



30 



2 ) INFORMATION FOR SEQ ID NO: 14 5 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 833 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Propionibacterium acnes 

(B) STRAIN: ATCC 6919 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145 

CGGCGCCATC CTCGTGGTTG CTGCTACCGA CGGCCCGATG CCTCAGACTC 50 

GCGAGCACGT TCTGCTCGCT CGTCAGGTGG GCGTGCCCGC CATCGTCGTC 100 

GCCCTCAACA AGTGCGACAT GGTTGACGAT GAGGAGCTCA TTGAGCTCGT 150 

40 CGAGATGGAG GTCCGCGAGC TGCTGACCTC GCAGGAGTTC GACGGCGACA 2 00 

ACTGCCCTGT CGTTCGCATC TCCGCCTTCC AGGCCCTCCA GGGTGATGAG 250 

AAGTGGACCC AGT'cGATCCT" CGACCTCATG" GATOCCGTGG" ACGAGI^'aT J O U 

CCCGCAGCCT GAGCGCGATC TCGACAAGCC CTTCCTTATG CCGATCGAGG 3 50 

ACGTCTTCAC CATCACCGGC CGTGGCACCG TTGTCACCGG TCGTGTCGAG 4 00 

45 CGCGGCGTCG TCAAGACTGG CGAAGAGGTC GAGATCGTCG GTATCCACGA 450 

GAAGACCCAG AAGACCACCG TTACCGGTGT CGAGATGTTC CGCAAGATCC 500 

TCGACGAGGG CCGCGCTGGT GAGAACGTCG GCGTTCTGCT CCGTGGCACC 550 

AAGAAGGAGG ATGTCGTTCG CGGCATGGTC CTCTCCAAGC CTGGTTCCAC 600 

CACCCCCCAC ACCGACTTCG AGGGCCAGGT CTACGTCCTC AAGAAGGATG 650 

50 AGGGTGGCCG CCACAAGCCG TTCTTCTCCC ACTACAGCCC CCAGTTCTAC 700 

TTCCGTACCA CGGACGTGAC TGGCACTGTT GAGCTCCCCG AGGGCACCGA 750 

GATGGTCATG CCTGGCGACA ACACCGACAT GACTGTGCAC CTGATTCACC 800 

CGGTTGCCAT GGAGGATCAG CTCA^GTTCG CTA 833 



55 



2) INFORMATION FOR SEQ ID NO: 146 



( i ) SEQUENCE CHARACTERISTICS : 
60 (A) LENGTH: 74 5 bases 

88 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus mirabilis 

(B) STRAIN: ATCC 35659 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146 

CACAAACTCG TGAGCACATC CTGTTAGGTC GTCAGGTTGG TGTTCCTTAC 50 

ATCATCGTAT TCCTGAACAA ATGTGACATG GTAGATGATG AAGAGCTGTT 100 

15 AGAATTAGTT GAAATGGAAG TTCGTGAACT TCTGTCTCAA TACGATTTCC 150 

CAGGTGATGA CACTCCAGTA ATCCGTGGTT CAGCGCTGAA AGCACTGGAA 200 

GGCGAAGCAG AGTGGGAAGC AAAAATTGTT GAATTAGCAG AAGCACTGGA 250 

TTCTTATATC CCAGAGCCAG AGCGTGCAAT TGACAAACCA TTCCTGTTAC 300 

CAATCGAAGA TGTATTCTCA ATCTCAGGCC GTGGTACAGT AGTTACTGGT 350 

20 CGTGTAGAGC GTGGTATCAT CAAAGTAGGT GATGAAGTTG AGATTGTTGG 400 

TATCAAAGAA ACCGCCAAAA CAACTTGTAC TGGCGTTGAA ATGTTCCGTA 450 

AATTACTTGA CGAAGGTCGT GCAGGTGAGA ACGTAGGTGT TCTGCTGCGT 500 

GGTACAAAAC GTGAAGAAAT CGAACGTGGA CAAGTACTGG CRAAACCAGG 550 

CTCAATCAAC CCACACAACA AATTTGAATC AGAAGTTTAT ATTCTGAGCA 600 

25 AAGATGAAGG TGGTCGTCAC ACTCCATTCT TCAAAGGCTA CCGTCCACAG 650 

TTCTACTTCC GTACAACTGA CGTAACTGGT ACTATCGAAT TACCAGAAGG 700 

CGTAGAAATG GTAATGCCAG GCGACAACGT GAACATGATC GTTGA 745 



30 

2) INFORMATION FOR SEQ ID NO: 147 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE :_ 

(A) or~G~anism7~ Proteus pennen 

(B) STRAIN: ATCC 33519 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 7 



35 



40 



GGAGCTATCC 
TGAGCACATC 
TCCTGAACAA 

50 GAAATGGAAG 
CACTCCAGTA 
AGTGGGAAGC 
CCAGARCCAG 
CGTATTCTCA 

55 GTGGCGTAAT 
ACAGCGAAAA 
CGAAGGTCGT 
GTGAAGAAAT 
CCACACACTA 

60 TGGTCGTCAT 



TGGTTGTTGC 
CTGTTAGGTC 
ATGTGACATG 
TTCGTGAACT 
ATCCGTGGTT 
AAAAATTGTT 
AGCGTGCAAT 
ATTTCAGGCC 
CAAAGTTGGT 
CAACTTGTAC 
GCAGGTGAGA 
CGAACGTGGA 
AATTTGAATC 
ACTCCATTCT 



TGCGACAGAT 
GTCAGGTTGG 
GTAGATGATG 
TCTGTCTCAG 
CAGCGCTGAA 
GAATTAGCAG 
TGACAAACCA 
GTGGTACAGT 
GAAGAAGTTG 
TGGCGTTGAA 
ACGTAGGTGT 
CAAGTACTGG 
AGAAGTTTAT 
TCAAAGGCTA 
8 



GGCCCAATGC 
TGTTCCTTAC 
AAGAGTTACT 
TACGATTTCC 
AGCACTGGAA 
AAGCACTGGA 
TTCCTGTTAC 
AGTAACAGGT 
AAATCGTTGG 
ATGTTCCGTA 
TCTTCTGCGT 
CGAAACCAGG 
ATTCTGAGCA 
CCGTCCACAG 



CACAAACTCG 
ATCATCGTAT 
GGAATTAGTM 
CAGGTGATGA 
GGCGAAGCAG 
TTCATACATC 
CAATTGAAGA 
CGTGTTGAGC 
TATTAAACCA 
AATTACTTGA 
GGTACTAAAC 
TTCAATCAAC 
AAGATGAAGG 
TTCTACTTCC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
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GTACAACTGA CGTAACTGGT ACTATCGAAT TACCAGAAGG CGTAGAAATG 750 
GTAATGCCAG GTGACAACAT CAACATGATC GTTGAACTGA TTCACCCAAT 800 
CGCGATGGAC GACGGTTTAC GTTTCGCTA 829 



2) INFORMATION FOR SEQ ID NO: 14 8 



(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus vulgaris 

(B) STRAIN: ATCC 13315 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148 



CGGAGCTATT CTGGTTGTTG CTGCGACTGA TGGCCCAATG CCACAAACTC 50 

GTGAGCACAT CCTGTTAGGT CGCCAGGTTG GTGTACCTTA CATCATCGTA 100 

25 TTCCTGAACA AATGTGACAT GGTTGATGAT GAAGAACTGC TGGAATTAGT 150 

AGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGATTTC CCAGGTGATG 200 

ACACTCCAGT AATCCGTGGT TCAGCGCTGA AAGCACTGGA AGGCGAAGCT 250' 

GAGTGGGAAG CAAAAATTGT TGAATTAGCA GAAGCACTGG ATTCTTACAT 300 

CCCAGAACCA GAGCGTGCAA TTGACAAACC ATTCCTGCTG CCTATCGAAG 350 

30 ACGTATTCTC AATCTCTGGT CGTGGTACAG TAGTAACAGG CCGTGTAGAG 400 

CGTGGTGTTG TTAAAGTTGG TGAAGAAGTT GAGATTGTTG GTATTAAAGA 450 

CACAGTTAAA ACAACTTGTA CTGGCGTTGA AATGTTCCGT AAATTACTTG 500 

ACGAAGGTCG TGCAGGTGAG AACGTAGGTG TTCTTCTGCG TGGTACTAAA 550 

CGTGAAGAAA TCGAACGTGG ACAAGTACTG GCTAAACCAG GTTCAATCAA 600 

35 GCCACACACT AAATTCGAAT CAGAAGTTTA TATCCTGAGC AAAGATGAAG 650 

GTGGTCGTCA CACTCCATTC TTCAAAGGTT ACCGTCCACA GTTCTACTTC * 700 

CGTACAACTG ACGTAACTGG TACTATCGAA TTACCAGAAG GCGTAGAAAT 750 

GGTAATGCCA GGTGACAACA TCAACATGAT CGTTGAACTG ATTCACCCTA 800 

TCGCGTAGGA CGACGGTTTA CGTT 824 

40 



2 ) INFORMATION FOR SEQ ID NO: 149 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Providencia alcalifaciens 
55 (B) STRAIN: ATCC 9886 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149 

CACAAACTCG TGAGCACATC CTGTTAGGTC GCCAAGTAGG TGTTCCTTAC 50 
60 ATCATCGTTT TCCTGAACAA ATGTGACATG GTAGACGACG AAGAACTGTT 100 

90 
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AGAATTAGTT GAAATGGAAG TTCGTGAACT TCTGTCTCAG TACGATTTCC 150 

CAGGCGATGA CACTCCAGTT GTTCGCGGTT CAGCACTGAA AGCGCTGGAA 200 

GGCAACCCAG AGTGGGAAGC AAAAATTGTT GAATTAGCAG GTTACCTGGA 250 

TTCTTACATC CCAGAACCAG AGCGTGCAAT TGACAAGCCA TTCCTGCTGC 300 

5 CAATCGAAGA CGTATTCTCA ATCTCTGGTC GTGGTACAGT AGTAACAGGC 350 

CGTGTTGAGC GTGGTATCAT CAAAGTTGGT GAAGAAGTTG AAATCGTTGG 4 00 

TATTCAAGCG ACTGCGAAAA CAACTTGTAC TGGCGTTGAA ATGTTCCGTA 4 50 

AACTGCTGGA TGAAGGTCGT GCGGGTGAGA ACGTTGGTGT TCTGCTGCGT 500 

GGTACTAAAC GTGAAGAAAT TCAACGTGGT CAAGTACTGG CTAAACCAGG 550 

10 TTCAATCAAG CCACACACTC AATTCGAATC AGAAGTATAT ATTCTGAGCA 600 

AAGATGAAGG TGGTCGTCAT ACTCCATTCT TCAAAGGCTA CCGTCCACAG 650 

TTCTACTTCC GTACAACTGA CGTAACCGGT ACTATCGAAC TGCCAGAAGG 7 00 

CGTAGAGATG GTAATGCCAG GCGACAACAT CAACATGATC GTGAC 745 



15 



2) INFORMATION FOR SEQ ID NO: 150 

(i) SEQUENCE CHARACTERISTICS : 
20 (A) LENGTH: 830 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Providencia rettgeri 

(B) STRAIN: ATCC 9250 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150 

CGGTGCAATC CTGGTTGTTG CTGCGACTGA TGGCCCAATG CCACAAACTC 50 

GTGAGCACAT CCTGTTAGGY CGCCAAGTAG GTGTWCCTTA CATCATCGTT 100 

35 TTCCTGAACA AATGTGACAT GGTAGACGAC GAAGAACTGT TAGAATTAGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA ATACGATTTC CCAGGCGACG 200 

ACACTCCAGT TGTCCGTGGT TCAGCTCTGA AAGCGCTGGA AGGCAACCCA 250 

GAGTGGGAAG CGAAAATTGT TGAATTAGCA GGTCACTTGG ATTCTTACAT 3 00 

CCCAGAACCA GAGCGTGCAA TTGACAAACC ATTCCTGCTG CCAATCGAAG 350 

40 ACGTATTCTC AATCTCTGGT CGTGGTACAG TAGTAACAGG CCGTGTTGAG 400 

CGTGGTATCA TCAAAGTTGG TGAAGAAGTT GAAATCGTTG GTATCCAAGA 450 

CACGGTTAAA ACAACTTGTA~ CTGGCGTTGA" AAYgTTCCGT~ luuCcT^TGCf bUU 

ACGAAGGTCG TGCGGGTGAG AACGTTGGTG TTCTGCTGCG TGGTACTAAA 550 

CGTGAAGAAA TTCAACGTGG TCAAGTACTG GCAAAACCAG GTTCAATCAA 600 

45 GCCACACACT AAATTCGAAT CAGAAGTCTA TATTCTGAGC AAAGATGAAG 650 

GTGGTCGTCA CACTCCATTC TTCAAAGGTT ACCGTCCACA GTTCTACTTC 700 

CGTACAACTG ACGTAACAGG TACTATCGAA CTGCCAGAAG GCGTAGAGAT 750 

GGTAATGCCA GGTGATAACA TCAACATGAT CGTTACCCTG ATCCACCCAA 800 

TCGCGATGGA CGACGGTTTA CGTTTCGCAA 830 



2) INFORMATION FOR SEQ ID NO: 151 

55 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 

91 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Providencia rustigianii 
5 (B) STRAIN: ATCC 33673 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151 

CGGTGCAATC CTGGTTGTTG CTGCGACTGA TGGCCCAATG CCACAAACTC 50 

10 GTGAGCACAT CCTGTTAGGT CGCCAAGTAG GTGTTCCTTA CATCATCGTT 100 

TTCCTGAACA AATGTGACAT GGTTGACGAC GAAGAACTGT TAGAATTAGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGATTTC CCAGGCGACG 2 00 

ACACTCCAGT TGTTCGYGGT TCAGCACTGA AAGCGCTGGA AGGTATCCCT 2 50 

GAGTGGGAAG CGAAAATTGT TGAATTAGCC GGTTACCTGG ACAGCTACAT 3 00 

15 CCCAGAACCA GAGCGCGCAA TTGACCGTCC ATTCCTGCTG CCAATCGAAG 350 

ACGTATTCTC AATCTCTGGT CGTGGTACAG TAGTAACAGG SCGTGTTGAG 4 00 

CGTGGTATCG TTAAAGTTGG TGAAGAAGTT GAAATCGTTG GTATCCAAGA 450 

CACRGTTAAA ACAACTTGTA CTGGCGTTGA AATGTTCCGT AAACTGCTTG 500 

ACGAAGGTCG TGCTGGTGAG AACGTTGGTG TTTTACTGCG TGGTACTAAG 550 

20 CGTGAAGAAA TTCAACGTGG TCAAGTACTG GCTAAACCAG GTTCAATCAA 600 

GCCACACACT ACTTTTGAAT CAGAAGTTTA TATTCTGAGC AAAGATGAAG 650 

GTGGTCGTCA TACTCCATTC TTCAAAGGCT ACCGTCCACA GTTCTACTTC 700 

CGTACAACTG ACGTAACCGG TACTATCGAA CTGCCAGAAG GCGTAGAGAT 750 

GGTAATGCCA GGCGACAACA TCAACATGAT CGTGACACTG ATTCACCCAA 800 

25 TCGCGATGGA TGATGGTTTA CGTTTC 826 



2) INFORMATION FOR SEQ ID NO: 152 



30 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 830 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Providencia stuartii 

(B) STRAIN: ATCC 33672 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152 

45 CGGTGCAATC CTAGTTGTTG CGGCAACAGA TGGCCCAATG CCACAAACTC 50 

GTGAGCACAT CCTGTTAGGT CGTCAGGTTG GCGTTCCTTA CATCATCGTG 100 

TTCCTGAACA AATGTGACAT GGTAGACGAC GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA ATACGATTTC CCAGGTGATG 200 

ACACTCCAGT TATCCGTGGT TCAGCGCTGA AAGCGTTGGA AGGCAACCCA 250 

50 GAGTGGGAAG CGAAAATCGT TGAACTAGCA GAAGCACTGG ACAGCTACAT 300 

CCCAGAGCCA GAGCGTGCAA TTGACAAGCC ATTCCTGCTG CCAATCGAAG 350 

ACGTATTCTC AATCTCAGGT CGTGGTACAG TAGTCACAGG CCGTGTTGAG 400 

CGTGGTATCA TCAAAGTTGG TGAAGAAGTT GAAATCGTAG GTATCAAAGA 450 

GACTGCGAAA ACCACTTGTA CTGGCGTTGA AATGTTCCGT AAACTGCTGG 500 

55 ACGAAGGCCG TGCGGGTGAG AACGTAGGTG TTCTGCTGCG TGGTACTAAG 550 

CGTGAAGAAA TCGAACGTGG TCAAGTTCTG GCGAAACCAG GTTCAATCAA 600 

GCCACACACA ACTTTCGAAT CAGAAGTTTA TATTCTGAGC AAAGATGAAG 650 

GTGGTCGTCA CACGCCATTC TTCAAAGGYT ACCGTCCACA GTTCTACTTC 700 

CGTACAACTG ACGTAACAGG TACTATCGAA CTGCCAGAAG GCGTAGAGAT 750 

60 GGTAATGCCA GGCGACAACG TGAACATGAA AGTAACTCTG ATTCACCCAA 8 00 

92 
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TCGCGATGGA CGATGGTTTG CGTTTCGCAA 



5 2) INFORMATION FOR SEQ ID NO: 153 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 827 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas aeruginosa 

(B) STRAIN: ATCC 35554 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153 



CCTGGTTTGC TCGGCTGCCG ACGGCCCCAT GCCGCAGACC CGCGAGCACA 50 

TCCTGCTGTC CCGCCAGGTA GGCGTTCCCT ACATCGTCGT GTTCCTGAAC 100 

AAAGCCGACA TGGTCGACGA CGCCGAGCTG CTGGAACTGG TCGAGATGGA 150 

AGTTCGCGAT CTGCTGAACA CCTACGACTT CCCGGGCGAC GACACTCCGA 200 

25 TCATCATCGG TTCCGCGCTG ATGGCGCTGG AAGGCAAGGA TGACAACGGC 250 

ATCGGCGTAA GCGCCGTGCA GAAGCTGGTA GAGACCCTGG ACTCCTACAT 300 

TCCGGAGCCG GTTCGTGCCA TCGACCAGCC GTTCCTCATG CCGATCGAAG . 350 

ACGTGTTCTC GATCTCCGGT CGCGGTACCG TGGTAACCGG TCGTGTAGAG 4 00 

CGCGGCATCA TCAAGGTCCA GGAAGAAGTG GAAATCGTCG GCATCAAGGC 450 

30 GACCACCAAG ACCACCTGCA CCGGCGTTGA AATGTTCCGC AAGCTGCTCG 500 

ACGAAGGTCG TGCTGGTGAG AACGTTGGTA TCCTGCTGCG TGGCACCAAG 550 

CGTGAAGACG TAGAGCGTGG CCAGGTACTG GCCAAGCCGG GCACCATCAA 600 

GCCGCACACC AAGTTCGAGT GCGAAGTGTA CGTGCTGTCC AAGGAAGAAG 650 

GTGGTCGTCA CACCCCGTTC TTCAAGGGCT ACCGTCCGCA GTTCTACTTC 700 

35 CGTACCACKG ACGTGACCGG TAMCTGCGAG CTGCCGGAAG GCGTAGAGAT 750 

GGTAATGCCG GGCGACAACA TCAAGATGGT TGTCACCCTG ATCGCTCCGA BOO 

TCGCGATGGA AGATGGCTGC GTTCGCG 82 7 



40 



50 



2) INFORMATION FOR SEQ ID NO: 154 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 841 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas fluorescens 

(B) STRAIN: ATCC 13525 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154 

CGGCGCAATC CTGGTTTGCT CGGCCGCTGA TGGTCCGATG CCACAAACCC 50 

GTGAACACAT CCTGCTGTCC CGTCAGGTTG GCGTTCCGTA CATCGTGGTT 100 

TACCTGAACA AGGCTGACCT GGTAGACGAC GCTGAGCTGC TGGAACTGGT 150 

60 TGAGATGGAA GTGCGCGATC TGCTGAGCAC TTACGACTTC CCAGGCGACG 200 

93 
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ACACTCCGAT CATCATCGGT TCTGCTCGTA 
GACAACGAAA TGGGCACCAC GTCCGTTCGT 
CAGCTACATC CCAGATCCAG TTCGTGTTAT 
CAATCGAAGA CGTGTTCTCG ATCTCCGGTC 
5 CGTATCGAGC GCGGTATCGT TAAGGTTCAA 
TCTGCGTGAC ACTACCGTCA CCACCTGCAC 
AGCTGCTCGA CGAAGGTCGT GCTGGCGAGA 
GGTACCAAGC GTGACGACGT TGAGCGTGGC 
TTCGGTTAAG CCGCACACCA AGTTCGAAGC 
10 AAGAAGAAGG CGGTCGTCAC ACTCCGTTCT 
TTCTACTTCC GTACTACTGA CGTGACTGGT 
CGTTGAAATG GTTATGCCAG GCGACAACAT 
TCAAAACCAT CGCAATGGAA GACGGTCTGC 

15 

2) INFORMATION FOR SEQ ID NO: 155 



TGGCTCTGGA 


AGGCAAAGAC 


250 


AAACTGGTTG 


AAACTCTGGA 


300 


CGACAAGCCG 


TTCCTGATGC 


j50 


GCGGTACTGT 


TGTGACTGGT 


4 00 


GATCCACTGG 


AAATCGTTGG 


4 50 


CGGTGTTGAA 


ATGTTCCGTA 


C A A 

500 


ACTGCGGCGT 


TCTGCTGCGT 


c c n 

550 


CAGGTTCTGG 


TTAAGCCAGG 


600 


TGAAGTCTAC 


GTACTGAGCA 


650 


TCAAAGGCTA 


CCGTCCACAG 


700 


AACTGCGAGC 


TGCCGGAAGG 


750 


CAAAATGGTT 


GTTACCCTGA 


800 


GTTTCGCTAT 


T 


841 



(i) SEQUENCE CHARACTERISTICS: . 
(A) LENGTH: 841 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas stutzeri 

(B) STRAIN: ATCC 17588 



30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155 

CGGCGCGATC CTGGTCTGCT CGGCTGCTGA CGGCCCCATG CCGCAGACTC 50 

GCGAGCACAT CCTGCTGTCC CGTCAGGTTG GTGTTCCGTA CATCGTCGTG 100 

TTCCTGAACA AGGCCGACAT GGTTGATGAC GCCGAGCTGC TCGAGCTGGT 150 

35 CGAGATGGAA GTTCGYGACC TGCTGTCGAC CTACGACTTC CCGGGTGAYG 200 

ACACTCCGAT CATCATCGGC TCCGCGCTGA TGGCGCTGAA CGGCGAAGAC 250 

GACAACGAGC TCGGCACCAC TGCGGTGAAG AAGCTGGTCG AGACCCTGGA 300 

CAGCTACATT CCCGAGCCGG TTCGTGCCAT CGACAAGCCG TTCCTGATGC 3 50 

CGATCGAAGA CGTGTTCTCG ATCTCCGGTC GCGGCACSGT GGTAACCGGT 4 00 

40 CGCGTAGAGC GCGGCATCGT CAAGGTTCAG GAAGAGATCG AGATCGTCGG 4 50 

TCTGCGTCCG ACCACCAAGA CTACCTGCAC CGGCGTTGAG ATGTTCCGCA 500 

AGCTG^TCG A~ "y UARwffwff T^Ci^Oi Ab : A^ ACl^Ub r Y Ul^ TiCl^^lI^iT s^u 

GGCACCAAGC GTGACGAAGT GGAGCGTGGT CAGGTTCTGG CCAAGCCGGG 600 

CACCATCAAG CCGCACACCA AGTTCGAAGC CGAAGTGTAC GTGCTGTCCA 650 

45 AGGAAGAAGG TGGTCGTCAC ACCCCGTTCT TYAAGGGCTA CCGTCCKCAG 700 

TTCTACTTCC GTACCACTGA YGTGACYGGW TCGTGCGARC TGCCGGAAGG 750 

CGTCGAGATG GTAATGCCGG GCGACAACGT GAAGATGGTT GTCACCCTGA 800 

TCAAGCCGAT CGCCATGGAA GACGGCCTGC GCTTCGCGAT T 841 



50 



2) INFORMATION FOR SEQ ID NO: 156 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 833 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



60 (ii) MOLECULE TYPE: Genomic DNA 

94 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Psychrobacter phenylpyruvicus 

(B) STRAIN: ATCC 23333 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156 



GCTATTCTAG TAGTATCAGC AACTGACGGT CCAATGCCAC AAACACGTGA 50 

GCACATTCTA TTATCACGTC AGGTTGGTGT ACCATACATC ATCGTATTCA 100 

10 TGAACAAATG TGACATGGTA GATGACGAAG AGTTACTAGA GCTAGTAGAA 150 

ATGGAAGTGC GTGAATTACT TTCAGACTAC GACTTCCCAG GTGATGACAC 200 

TCCAATCATC AAAGGTTCAG CTTTAGAAGC GTTAAATGGT AACGACGGTA 250 

AGTACGGTGA GCCAGCAGTT ATCGAACTAC TAAACACTCT AGACACTTAC 300 

ATTCCAGAGC CAGAGCGTGA CATCGATAAG CCATTCCTAA TGCCAATCGA 350 

15 AGACGTATTC TCAATCTCAG GTCGTGGTAC AGTAGTAACA GGCCGTGTTG 400 

AATCTGGTAT CATCAAAGTT GGTGACGAAA TCGAAATCGT TGGTATCAAA 450 

GACACAGTTA AAACAACTTG TACTGGTATC GAGATGTTCC GTAAGTTACT 500 

AGACGAAGGT CGTGCTGGTG AGAACTGTGG TGTACTATTA CGTGGTACTA 550 

AGCGTGAAGA CGTACAACGT GGTCAAGTAC TTGCTAAGCC AGGTTCAATC 600 

20 ACTCCACACA CCAACTTCGA CGCAGAAGTA TACGTACTAT CAAAAGAAGA 650 

AGGTGGTCGT CACACTCCAT TCTTAAATGG TTACCGTCCA CAGTTCTACT 700 

TCCGTACTAC TGACGTAACA GGTGCAATCA CGTTACAAGA AGGTACTGAA 750 

ATGGTAATGC CAGGCGATAA CGTTGAGATG AGCGTAGAGC TAATCCACCC 800 

AATCGCTAGG ACAAAGGTTT ACGTTTCGCA ATC 833 

25 



2) INFORMATION FOR SEQ ID NO: 157 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rahnella aquatilis 
40. '(B) STRAIN: ATCC 33071 



GGCGCTATCC 

4 5 TGAGCACATC 
TCATGAACAA 
GAAATGGAAG 
CATCCCGGTC 
CTTGGGAAGC 

50 CCATTGCCAG 
CGTATTCTCC 
GCGGTATCGT 
ACTGTTAAGT 
CGAAGGCCGT 

55 GTGAAGACAT 
CCACACACCA 
TGGTCGTCAC 
GTACAACTGA 
GTTATGCCTG 

60 CGCGATGGAT 



TGGTTGTTGC 
CTGCTGGGTC 
ATGCGACATG 
TTCGCGAACT 
ATCAAAGGTT 
GAAAATCATC 
AGCGTGCTAT 
ATCTCCGGTC 
TAAAGTGGGC 
CTACTTGTAC 
GCGGGCGAGA 
CGAACGTGGT 
AGTTTGATTC 
ACTCCATTCT 
CGTGACCGGT 
GTGACAACGT 
GACGGTCTGC 



TGCAACTGAC GGCCCTATGC 
GCCAGGTTGG CGTTCCATAC 
GTAGATGACG AAGAGCTGCT 
TCTGTCTGCT TACGAATTCC 
CAGCGCTGAA AGCACTGGAA 
GAACTGGCAG AAGCACTGGA 
CGATAAGCCA TTCCTGCTGC 
GTGGTACAGT GGTTACCGGT 
GAAGAAGTTG AAATCGTCGG 
TGGCGTTGAA ATGTTCCGCA 
ACGTGGGTGT TCTGCTGCGT 
CAGGTTCTGG CTAAACCAGG 
CGAAGTGTAC ATCCTGAGCA 
TCAAAGGCTA CCGTCCACAG 
ACTATCGAAC TGCCAGAAGG 
GAACATGGTT GTTACCCTGA 
GTTTC 

95 



CTCAGACTCG 
ATGATCGTGT 
GGAACTGGTA 
CAGGCGACGA 
GGCGATGCTA 
CAGCTACATT 
CAATCGAAGA 
CGTGTAGAGC 
TATCAAGGAC 
AACTGCTGGA 
GGTATCAAGC 
TTCAATCAAA 
AAGATGAAGG 
TTCTACTTCC 
CGTTGAGATG 
TCCACCCAAT 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
825 
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2) INFORMATION FOR SEQ ID NO: 158 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: B30 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Salmonella choleraesuis subsp. arizonae 

(B) STRAIN: ATCC 13314 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158 

20 CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACCC 50 

GTGAGCACAT CCTGCTGGGC CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGACTTC CCGGGCGACG 200 

ATACGCCGAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA 250 

25 GAGTGGGAAG CGAAAATCAT CGAACTGGCT GGCTTCCTGG ACTCTTACAT 300 

CCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 3 50 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAA 4 00 

CGCGGTATCA TCAAAGTGGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA 4 50 

GACTCAGAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

3 0 ACGAAGGCCG TGCCGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCATCAA 600 

GCCGCACACC AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 700 

CGTACTACTG ACGTGACTGG CACCATCGAA CTGCCGGAAG GCGTGGAGAT 750 

35 GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 800 

TCGCGATGGA CGACGGTCTG CGTTTCGCAA 83 0 



40 2) INFORMATION FOR SEQ ID NO: 159 



( 1 ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 83 2 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp, choleraesuis 
serotype Choleraesuis 

(B) STRAIN: ATCC 7001 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159 

CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGYCCGATG CCGCAGACCC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

60 TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGACTTC CCGGGCGACG 200 

96 
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ACACGCCGAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGTGACGCA 250 

GAGTGGGAAG CGAAAATCAT CGAACTGGCT GGCTTCCTGG ATTCTTACAT 3 00 

TCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 350 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAG 4 00 

5 CGCGGTATCA TCAAAGTGGG YGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

GACTCAGAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGTTGG 500 

ACGAAGGCCG TGCCGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCATCAA 600 

GCCGCACACC AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG 650 

10 GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 700 

CGTACTACTG ACGTGACTGG CACCATCGAA CTGCCGGAAG GCGTAGAGAT 750 

GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 800 

TCGCAATGGA CGACGGTCTG CGTTTCGCAA TC 832 



15 



30 



2) INFORMATION FOR SEQ ID NO: 160 



( i) SEQUENCE CHARACTERISTICS : 
2 0 (A) LENGTH: 807 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

2 5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Salmonella choleraesuis subsp. diari zonae 

(B) STRAIN: ATCC 43973 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 0 



CCTGGTTGTT GCTGCGACTG ACGGCCCGAT GCCGCAGACC CGTGAGCACA 50 

TCCTGCTGGG TCGTCAGGTA GGCGTTCCGT ACATTATCGT GTTCCTGAAC 100 

3 5 AAATGCGACA TGGTTGATGA CGAAGAGCTG CTGGAACTGG TAGAAATGGA 150 

AGTTCGTGAA CTTCTGTCTC AGTACGACTT CCCGGGCGAC GACACGCCAA 200 

TCGTTCGTGG TTCTGCTCTG AAAGCGCTGG AAGGCGACGC AGAGTGGGAA 250 

GCGAAAATCA TCGAACTGGC TGGCTTCCTG GATTCTTACA TCCCGGAACC 300 

AGAGCGTGCG ATTGACAAGC CGTTCCTGCT GCCGATCGAA GACGTATTCT 350 

40 CCATCTCCGG TCGTGGTACC GTTGTTACCG GTCGTGTAGA GCGCGGTATC 4 00 

ATCAAAGTGG GCGAAGAAGT TGAAATCGTT GGTATCAAAG AGACTCAGAA 450 

GTCTACCTGT ACTGGCGTTG AAATGTTCCG CAARCTGCTG GACGAAGGCC 500 

GTGCWGGTGA GAACGTAGGT GTTCTGCTGC GTGGTATCAA ACGTGAAGAA 550 

ATCGAACGTG GTCAGGTACT GGCTAAGCCG GGCACCATCA AGCCGCACAC 600 

45 CAAGTTCGAA TCTGAAGTGT ATATTCTGTC CAAAGATGAA GGCGGCCGTC 650 

ATACTCCGTT CTTCAAAGGC TACCGTCCGC AGTTCTACTT CCGTACCACT 7 00 

GACGTGACTG GCACCATCGA ACTGCCGGAA GGCGTGGAGA TGGTAATGCC 750 

GGGCGACAAC ATCAAAATGG TTGTTACCCT GATCCACCCG ATCGCGATGG 8 00 

ACGACGG 807 

50 



2) INFORMATION FOR SEQ ID NO: 161 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
5 serotype Heidelberg 

(B) STRAIN: ATCC 8326 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161 

10 CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACCC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGACTTC CCGGGCGACG 200 

ACACTCCGAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA 250 

15 GAGTGGGAAG CGAAAATCAT CGAACTGGCT GGCTTCCTGG ATTCTTACAT 300 

CCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 350 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAA 4 00 

CGCGGTATCA TCAAAGTGGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

GACTCAGAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

2 0 ACGAAGGCCG TGCCGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCATCAA 600 

GCCGCACACC AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG 650 

GCGGCCGTCA YACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 700 

CGTACTACTG ACGTGACTGG CACCATCGAA TTGCCGGAAG GCGTAGAGAT 750 

25 GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 800 

TCGCGATGGA CGACGGTCTG CGTTTCGCAA TC 832 



30 2) INFORMATION FOR SEQ ID NO: 162 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. houtenae 

(B) STRAIN: ATCC 43974 



45 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162 



CCTGGTTGTT GCTGCGACTG ACGGCCCGAT GCCGCAGACC CGTGAGCACA 50 

TCCTGCTGGG TCGTCAGGTA GGCGTTCCGT ACATCATCGT GTTCCTGAAC 100 

AAATGCGACA TGGTTGATGA CGAAGAGCTG CTGGAACTGG TAGAAATGGA 150 

AGTTCGTGAA CTTCTGTCTC AGTACGATTT CCCGGGCGAC GACACGCCGA 200 

50 TCGTGCGTGG TTCTGCTCTG AAAGCGCTGG AAGGCGACGC AGAATGGGAA 250 

GCGAAAATCA TCGAACTGGC TGGCTACCTG GATTCTTACA TCCCGGAACC 300 

AGAGCGTGCG ATTGACAAAC CGTTCCTGCT GCCGATCGAA GACGTATTCT 350 

CTATCTCCGG TCGTGGTACC GTTGTTACCG GTCGTGTAGA GCGCGGTATC 400 

ATCAAAGTGG GCGAAGAAGT TGAAATCGTT GGTATCAAAG AGACTCAGAA 450 

55 GTCGACCTGT ACTGGCGTTG AAATGTTCCG CAAACTGCTG GACGAAGGCC 500 

GTGCTGGCGA GAACGTAGGT GTTCTGCTGC GTGGTATCAA ACGTGAAGAA 550 

ATCGAACGTG GTCAGGTACT GGCTAAGCCG GGCACCATCA AGCCGCACAC 600 

CAAGTTCGAA TCTGAAGTGT ACATTCTGTC CAAAGATGAA GGCGGCCGTC 650 

ATACTCCGTT CTTCAAAGGC TACCGTCCGC AATTCTACTT CCGTACGACT 700 

60 GACGTGACTG GCACCATCGA ACTGCCGGAA GGCGTGGAGA TGGTAATGCC 750 

98 
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GGGCGACAAC ATCAAAATGG TTGTTACCCT GATCCACCCG ATCGCGATGG 
ACGACGG 



800 
807 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 163 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 7 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. indica 

(B) STRAIN: ATCC 43976 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163 

CGCGATCCTG GTTGTTGCTG CGACTGACGG CCCGATGCCG CAGACCCGTG 50 

AGCACATCCT GCTGGGTCGT CAGGTAGGCG TTCCGTACAT CATCGTGTTC 100 

CTGAACAAAT GCGACATGGT TGATGACGAA GAGCTGCTGG AACTGGTTGA 150 

AATGGAAGTT CGTGAACTTC TGTCTCAGTA CGACTTCCCG GGTGACGACA 200 

CGCCGATCGT GCGTGGTTCT GCTCTGAAAG CGCTGGAAGG CGACGCAGAG 250 

TGGGAAGCGA AAATCATCGA ACTGGCTGGC TTCCTGGATT CTTACATTCC 300 

GGAACCAGAG CGTGCGATTG ACAAGCCGTT CCTGCTGCCG ATCGAAGACG 350 

TATTCTCCAT CTCCGGTCGT GGTACYGTTG TTACCGGTCG TGTAGAGCGC 4 00 

GGTATCATCA AAGTGGGCGA AGAAGTTGAA ATCGTTGGTA TCAAAGAGAC 4 50 

TCAGAAGTCT ACCTGTACTG GCGTTGAAAT GTTCCGCAAA CTGCTGGACG 500 

AAGGCCGTGC CGGTGAGAAC GTAGGTGTTC TGCTGCGTGG TATCAAACGT 550 

GAAGAAATCG AACGTGGTCA GGTACTGGCT AAGCCGGGCA CCATCAAGCC 600 

GCACACCAAG TTCGAATCTG AAGTGTACAT TCTGTCCAAA GATGAAGGCG 650 

GCCGTCATAC TCCGTTCTTC AAAGGCTACC GTCCGCAGTT CTACTTCCGT 700 

ACTACTGACG TGACTGGCAC CATCGAACTG CCGGAAGGCG TAGAGATGGT 750 

AATGCCGGGC GACAACATCA AAATGGTTGT TACCCTGATC CATCCGATCG 800 

CRATGGACGA CGGTCTGCGT TTCGCAA 827 



2) INFORMATION FOR SEQ ID NO: 164 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. salamae 

(B) STRAIN: ATCC 43972 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164 

CCTGGTTGTT GCTGCGACTG ACGGCCCGAT GCCGCAGACC CGTGAGCACA 50 
TCCTGCTGGG TCGTCAGGTA GGCGTTCCGT ACATCATCGT GTTCCTGAAC 100 
AAATGCGACA TGGTTGATGA CGAAGAGCTG CTGGAACTGG TAGAAATGGA 150 
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AGTTCGTGAA CTTCTGTCTC AGTACGACTT CCCGGGCGAC GACACGCCGA 200 

TCGTGCGTGG TTCCGCTCTG AAAGCGCTGG AAGGCGAMGC TGAGTGGGAA 2 50 

GMGAAAATCA TCGAACTGGC TGGCTWCCTG GATTCTTACA TTCCGGAACC 300 

AGAGCGTGCG ATTGACAAGC CGTTCCTGCT GCCGATCGAA GACGTATTCT 350 

5 CCATCTCCGG TCGTGGTACC GTTGTTACCG GTCGTGTAGA GCGCGGTATC 4 00 

ATCAAAGTGG GCGAAGAAGT TGAAATCGTT GGTATCAAAG AGACTCAGAA 4 50 

GTCTACCTGT ACTGGCGTTG AAATGTTCCG CAAACTGCTG GACGAAGGCC 500 

GTGCCGGTGA GAACGTAGGT GTTCTGCTGC GTGGTATCAA ACGTGAAGAA 550 

ATCGAACGTG GTCAGGTACT GGCTAAGCCG GGCACCATCA AGCCGCACAC 600 

10 CAAGTTCGAA TCTGAAGTGT ACATTCTGTC CAAAGATGAA GGCGGCCGTC 650 

ATACTCCGTT CTTCAAAGGC TACCGTCCGC AGTTCTACTT CCGTACCACT 700 

GACGTGACTG GCACCATCGA ACTGCCGGAA GGCGTGGAGA TGGTAATGCC 750 

GGGCGACAAC ATCAAAATGG TTGTTACCCT GATCCACCCG ATCGCGATGG 800 

ACGACGG , 807 

15 



2) INFORMATION FOR SEQ ID NO: 165 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp . choleraesuis 
3 0 serotype Typhi 

(B) STRAIN: ATCC 10749 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165 

35 CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACCC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGACTTC CCGGGCGACG 200 

ACACGCCGAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA 250 

40 GAGTGGGAAG CGAAAATCAT CGAACTGGCT GGCTTCCTGG ATTCTTACAT 300 

CCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 350 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAA 4 00 

CGCGGTATCA TCAAAGTGGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

GACTCAGAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

45 ACGAAGGCCG TGCNGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCATCAA 600 

GCCGCACACY AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 700 

CGTACTACTG ACGTGACTGG CACCATCGAA CTGCCGGAAG GCGTAGAGAT 750 

50 GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 800 

TCGCAATGGA CGACGGTCTG CGTTTCGCAA TC 832 



55 2) INFORMATION FOR SEQ ID NO: 166 

( i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

100 
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10 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia fonticola 

(B) STRAIN: DSM 4576 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166 



CGGCGCTATC CTGGTTGTAG CTGCGACTGA CGGCCCTATG CCTCAGACTC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTTG GCGTTCCTTA CATCATCGTA 100 

TTCATGAACA AATGCGACAT GGTTGATGAT GAAGAGCTGC TGGAACTGGT 150 

AGAAATGGAA GTTCGTGAAC TTCTGTCTGC TTATGACTTC CCTGGTGATG 200 

15 ACCTGCCGGT TGTTCGTGGT TCAGCGCTGA AAGCACTGGA AGGCGAAGCT 250 

GAGTGGGAAG CTAAAATCAT CGAGCTGGCC GGTCACCTGG ATTCCTACAT 3 00 

CCCAGAACCA GAGCGTGCTA TCGATCAGCC GTTCCTGCTG CCAATCGAAG 350 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TAGTTACCGG TCGTGTTGAG 4 00 

CGCGGTATCG TTAAAGTTGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

20 CACCGTTAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCTGGTGAG AACGTTGGTG TTCTGCTGCG TGGTATCAAG 550 

CGTGAAGACA TCGAACGTGG TCAGGTACTG GCTAAACCAG GTTCCATCAA 600 

GCCGCACACT CAGTTCGATT CAGAAGTGTA TATCCTGAGC AAAGAAGAAG 650 

GTGGTCGTCA TACTCCATTC TTCAAAGGCT ACCGTCCACA GTTCTACTTC 700 

25 CGTACAACTG ACGTGACCGG TACCATCGAA CTGCCAGAAG GCGTAGAGAT 750 

GGTAATGCCA GGCGATAACG TGAACATGGT TGTTACCCTG ATCCACCCAA 800 

TCGCTATGGA CCAAGGC 817 



30 



35 



40 



45 



2) INFORMATION FOR SEQ ID NO: 167 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 787 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia liquefaciens 

(B) STRAIN: ATCC 27592 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167 



GCTGCGACTG 
TCGTCAGGTT 
TGGTTGATGA 

50 CTTCTGTCTG 
TTCAGCGCTG 
TCGAGCTGGC 
ATCGACAAGC 
TCGTGGTACC 

55 GCGAAGAAGT 
ACTGGCGTTG 
GAACGTTGGT 
GTCAGGTACT 
TCAGAAGTGT 

60 CTTCAAAGGC 



ACGGCCCAAT 
GGCGTTCCTT 
TGAAGAGCTG 
CTTACGACTT 
AAAGCACTGG 
CGGTTACCTG 
CGTTCCTGCT 
GTTGTTACCG 
TGAAATCGTT 
AAATGTTCCG 
GTTCTGCTGC 
GGCTAAACCA 
ACATCCTGAG 
TACCGTCCAC 



GCCTCAGACC 
TCATCATCGT 
CTGGAACTGG 
CCCTGGTGAT 
AAGGCGAAGC 
GATTCTTACA 
GCCAATCGAA 
GTCGTGTTGA 
GGTATCAAAG 
CAAACTGCTG 
GTGGTATCAA 
GGTTCAATCA 
CAAAGAAGAA 
AGTTCTACTT 
1 



CGTGAGCACA 
ATTCATGAAC 
TAGAAATGGA 
GACCTGCCGG 
TGAGTGGGAA 
TCCCAGAACC 
GACGTCTTCT 
GCGCGGTATC 
ACACCGTTAA 
GACGAAGGCC 
GCGTGAAGAC 
AGCCACACAC 
GGTGGTCGTC 
CCGTACAACT 
01 



TCC TGCTGGG 50 

AAATGCGACA 100 

AGTTCGTGAA 150 

TTGTTCGTGG 2 00 

GCTAAAATCA 250 

AGAGCGTGCT 3 00 

CCATCTCCGG 350 

GTTAAAGTTG 4 00 

GTCTACCTGT 4 50 

GTGCTGGTGA 500 

ATCGAACGTG 550 

CAAGTTCGAC 600 

ATACTCCATT 650 

GACGTGACCG 700 
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GTACCATCGA ACTGCCAGAA GGCGTTGAAA TGGTAATGCC AGGTGACAAC 
GTGAACATGG TTGTTACCCT GATCCACCCA ATCGCGA 



750 
787 



2) INFORMATION FOR SEQ ID NO: 168 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 745 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

( i i ) MOLECULE TYPE: Genomic DNA t 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia marcescens 

(B) STRAIN: ATCC 13880 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168 

GCCTCAGACT CGTGAGCACA TCCTGCTGGG TCGTCAGGTT GGCGTTCCTT 50 

TCATCATCGT ATTCATGAAC AAATGCGACA TGGTTGATGA TGAAGAGCTG 100 

YTGGAACTGG TAGAAATGGA AGTTCGCGAA CTGCTGTCCG CTTACGACTT 150 

CCCTGGCGAC GACCTGCCGG TAATCCGCGG TTCCGCGCTG AAAGCGCTGG 200 

AAGGCGAAGC TGAGTGGGAA GCGAAAATCA TCGAACTGGC CGAAGCCCTG 2 50 

GACAGCTACA TCCCAGAGCC AGAGCGTGCT ATCGACAAGC CGTTCCTGCT 300 

GCCAATCGAA GACGTATTCT CCATCTCCGG TCGTGGTACC GTTGTTACCG 350 

GTCGTGTTGA GCGCGGCATC ATCAAAGTTG GCGAAGAAGT TGAAATCGTT 400 

GGTATCAAAG ACACCGTTAA GTCTACCTGT ACTGGCGTTG AAATGTTCCG 4 50 

CAAACTGCTG GACGAAGGCC GTGCTGGTGA GAACGTAGGT GTTCTGCTGC 500 

GTGGTATCAA ACGTGAAGAA ATCGAACGTG GTCAGGTACT GGCTAAGCCA 550 

GGCTCCATCA AGCCGCACAC CCAGTTCGAA TCTGAAGTGT ACATCCTGAG 600 

CAAAGATGAA GGTGGTCGTC ACACKCCATT CTTCAAAGGC TACCGTCCAC 650 

AGTTCTACTT CCGTACCACT GACGTGACCG GTACCATCGA ACTGCCAGAA 700 

GGCGTAGAGA TGGTAATGCC AGGCGACAAC GTGAACATGG TTGTA 745 



2) INFORMATION FOR SEQ ID NO: 169 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia odorifera 

(B) STRAIN: ATCC 33077 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 169 

GGCGCAATCC TGGTTGTTGC TGCGACTGAC GGCCCTATGC CTCAGACCCG 50 

TGAGCACATC CTGCTGGGTC GCCAGGTTGG CGTTCCTTTC ATCATCGTGT 100 

TCATGAACAA ATGTGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTA 150 

GAAATGGAAG TTCGCGAGCT GCTGTCTGCT TACGATTTCC CTGGCGACGA 200 

CCTGCCAGTA ATCCGCGGTT CTGCGCTGAA AGCGCTGGAA GGCGAAGCAG 250 
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AGTGGGAAGC 
CCAGAACCAG 
CGTATTCTCC 
GCGGTATCAT 
ACCGTTAAGT 
CGAAGGCCGT 
GTGAAGACAT 
CCGCACACCA 
TGGTCGTCAC 
GTACTACTGA 
GTAATGCCAG 
CGCAATGGAC 



TAAGATTGTA 
AGCGTGCTAT 
ATCTCCGGTC 
CAAAGTTGGC 
CTACCTGTAC 
GCTGGTGAGA 
CGAACGTGGT 
AATTCGACTC 
ACGCCATTCT 
CGTGACCGGT 
GCGATAACGT 
GACGGTCTGC 



GAACTGGCTG 
CGACAAGCCG 
GTGGTACCGT 
GAAGAAGTTG 
CGGTGTAGAA 
ACGTTGGTGT 
CAGGTTCTGG 
AGAAGTGTAC 
TCAAAGGCTA 
ACCATCGAAC 
GAACATGGTT 
GTTTCGCAA 



AAGCGCTGGA TTCTTACATC 
TTCCTGCTGC CAATCGAAGA 
TGTTACCGGT CGTGTTGAGC 
AAATCGTTGG TATCAAAGAC 
ATGTTCCGCA AACTGCTGGA 
TCTGCTGCGT GGTATCAAGC 
CTAAACCAGG TTCTATCAAG 
ATCCTGAGCA AAGAAGAAGG 
CCGTCCACAG TTCTACTTCC 
TGCCAGAAGG CGTAGAGATG 
GTTACCCTGA TTCACCCAAT 



300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
829 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 170 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 83 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia plymuthica 

(B) STRAIN: DSM 4540 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170 

CGGCGCAATC CTGGTTGTTG CTGCGACTGA CGGCCCAATG CCTCAGACCC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTTG GCGTTCCTTT CATCATCGTA 100 

TTCATGAACA AATGCGACAT GGTTGATGAT GAAGAGCTGC TGGAACTGGT 150 

AGAAATGGAA GTTCGTGAAC TTCTGTCTGC TTACGACTTC CCTGGTGATG 200 

ACCTGCCGGT TGTTCGTGGT TCAGCGCTGA AAGCACTGGA AGGCGAACCA 250 

GAGTGGGAAG CTAAAATCAT CGAGCTGGCT GGTTTCCTGG ATTCTTACAT 300 

CCCAGAACCA GAGCGTGCTA TCGACAAGCC GTTCCTGCTG CCAATCGAAG 350 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTTGAG 400 

CGCGGTATCG TTAAAGTTGG CGAAGAAGTT GAAATCGTGG GTATCAAAGA 450 

CACCGTTAAG TCTACCTGTA CCGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCTGGTGAG AACGTGGGTG TTCTGCTGCG TGGTATCAAG 550 

CGCGAAGATA TCGAACGTGG TCAGGTCCTG GCTAAACCAG GTTCAATCAA 600 

GCCACACACC AAGTTTGACT CAGAAGTGTA CATCCTGAGC AAAGAAGAAG 650 

GTGGTCGTCA TACTCCATTC TTCAAAGGCT ACCGTCCACA GTTCTACTTC 700 

CGTACAACTG ACGTGACCGG TACCATCGAA CTGCCAGAAG GCGTAGAGAT 750 

GGTAATGCCA GGTGACAACG TGAACATGGT TGTAACCCTG ATCCACCCAA 800 

TCGCGATGGA CGACGGCCTG CGTTTCGCAA 830 



2) INFORMATION FOR SEQ ID NO: 171 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Serratia rubidaea 

(B) STRAIN: ATCC 27593 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171 



GGCGCAATCC 
TGAGCACATC 

10 TCATGAACAA 
GAGATGGAAG 
CCTGCCGGTA 
AGTGGGAAGC 
CCAGAGCCAG 

15 CGTATTCTCC 
GCGGTATCAT 
ACCGTTAAGT 
CGAAGGCCGT 
GTGAAGAAAT 

20 CCGCACACCC 
CGGCCGTCAT 
GTACAACTGA 
GTAATGCCAG 
CGCAATGGAC 

25 



TGGTAGTAGC 
CTGCTGGGCC 
ATGCGACATG 
TTCGCGAACT 
ATCCGTGGTT 
GAAAATCGTT 
AGCGTGCTGT 
ATCTCCGGTC 
CAAAGTTGGT 
CTACCTGTAC 
GCTGGTGAGA 
CGAACGTGGT 
AGTTCGAATC 
ACTCCGTTCT 
CGTGACCGGT 
GCGACAACGT 
GACGGTCTGC 



AGCGACTGAC 
GCCAGGTAGG 
GTAGATGATG 
GCTGTCTGCT 
CCGCGCTGAA 
GAGCTGGCAG 
AGACAAGCCG 
GTGGTACCGT, 
GAAGAAGTAG 
TGGCGTAGAA 
ACGTAGGTGT 
CAGGTACTGG 
TGAAGTGTAC 
TCAAAGGCTA 
ACCATCGAAC 
GAACATGAAA 
GTTTCGCAA 



GGCCCAATGC 
CGTACCTTAC 
AAGAGCTGCT 
TACGACTTCC 
AGCGCTGGAA 
AAGCGCTGGA 
TTCCTGCTGC 
TGTTACCGGT 
AAATCGTAGG 
ATGTTCCGCA 
TCTGCTGCGT 
CGAAGCCAGG 
ATTCTGTCCA 
CCGTCCACAG 
TGCCAGAAGG 
GTTACTCTGA 



CTCAGACCCG 50 

ATCATCGTAT 100 

GGAACTGGTA 150 

CAGGCGACGA 200 

GGCGAAGCCG 250 

CAGCTACATC 3 00 

CAATCGAAGA 350 

CGTGTAGAGC 4 00 

TATCAAAGAC 450 

AACTGCTGGA 500 

GGTATCAAGC 550 

TTCAATCAAG 600 

AAGACGAAGG 650 

TTCTACTTCC 700 

CGTAGAGATG 750 

TTCACCCAAT 8 00 
829 



30 



35 



2 ) INFORMATION FOR SEQ ID NO: 172 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shigella boydii 
40 (B) STRAIN: ATCC 9207 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172 

CGGCGCGATC YTGGTAGTTG CTGCGACTGA CGGCCCGATG 

4 5 GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGACTTC 

ACACTCCGAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA 

GAGTGGGAAG CGAAAATCCT GGAACTGGCT GGCTTCCTGG 

50 TCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG 

CGCGGTATCA TCAAAGTTGG TGAAGAAGTT GAAATCGTTG 

GACTCAGAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC 

ACGAAGGCCG TGCTGGTGAG AACGTAGGTG TTCTGCTGCG 

55 CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG 

GCCGCACACC AAGTTCGAAT CTGAAGTGTA CATTCTGTCC 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA 

CGTACTACTG ACGTGACTGG TACCATCGAA CTGCCGGAAG 

GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG 

50 TCGCGATGGA CGACGGTCTG CGTTTC 

104 



CCGCAGACTC 50 

CATCATCGTG 100 

TGGAACTGGT 150 

CCGGGCGACG 200 

AGGCGACGCA 250 

ATTCTTACAT 30 0 

CCGATCGAAG 350 

TCGTGTAGAA 400 

GTATCAAAGA 4 50 

AAACTGCTGG 500 

TGGTATCAAA 550 

GCACCATCAA 600 

AAAGATGAAG 650 

GTTCTACTTC 700 

GCGTAGAGAT 750 

ATCCACCCGA 800 
826 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



2) INFORMATION FOR SEQ ID NO: 173 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Shigella dysenteriae 

(B) STRAIN: ATCC 11835 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173 

20 TGGTAGTTGC TGCGACTGAC GGCCCGATGC CGCAGACTCG TGAGCACATC 50 

CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT TCCTGAACAA 100 

ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT GAAATGGAAG 150 

TTCGTGAACT TCTGTCTCAG TACGACTTCC CGGGCGACGA CACTCCGATC 200 

GTTCGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGACGCAG AGTGGGAAGC 250 

25 GAAAATCCTG GAACTGGCTG GCTTCCTGGA TTCYTAYATT CCGGAACCAG 300 

AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA CGTATTCTCC 350 

ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAAC GCGGTATCAT 4 00 

CAAAGTTGGT GAAGAAGTTG AAATCGTTGG TATCAAAGAG ACYCAGAAGT 4 50 

CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA CGAAGGCCGT 500 

30 GCTGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAAC GTGAAGAAAT 550 

CGAACGTGGT CAGGTACTGG CGAAGCCRGG CACCATCAAG CCGCACACCA 600 

AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGATGAAGG CGGCCGTCAT 650 

ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC GTACTACTGA 700 

CGTGACTGGT ACCATCGAAC TGCCGGAAGG CGTAGAGATG GTAATGCCGG 750 

35 GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT CGCGATGGAC 800 

GACGGTCTGC GTTTCGCA 818 



40 2) INFORMATION FOR SEQ ID NO: 174 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shigella flexneri 

(B) STRAIN: ATCC 12022 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174 



CCTGGTAGTT GCTGCGACTG ACGGCCCGAT GCCGCAGACT CGTGAGCACA 50 

TCCTGCTGGG TCGTCAGGTA GGCGTTCCGT ACATCATCGT GTTCCTGAAC 100 

AAATGCGACA TGGTTGATGA CGAAGAGCTG CTGGAACTGG TTGAAATGGA 150 

AGTTCGTGAA CTTCTGTCTC AGTACGACTT CCCGGGCGAC GACACTCCGA 200 

60 TCGTTCGTGG TTCTGCTCTG AAAGCGCTGG AAGGCGACGC AGAGTGGGAA 250 

105 
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GCGAAAATCC 
AGAGCGTGCG 
CCATCTCCGG 
ATCAAAGTTG 
GTCTACCTGT 
GTGCTGGTGA 
ATCGAACGTG 
CAAGTTCGAA 
ATACTCCGTT 
GACGTGACTG 
GGGCGACAAC 
ACGACG 



TGGAACTGGC 
ATTGACAAGC 
TCGTGGTACC 
GTGAAGAAGT 
ACTGGCGTTG 
GAACGTAGGT 
GTCAGGTACT 
TCTGAAGTGT 
CTTCAAAGGC 
GTACCATCGA 
ATCAAAATGG 



TGGCTTCCTG 
CGTTCCTGCT 
GTTGTTACCG 
TGAAATCGTT 
AAATGTTCCG 
GTTCTGCTGC 
GGCTAAGCCG 
ACATTCTGTC 
TACCGTCCGC 
ACTGCCGGAA 
TTGTTACCCT 



GATTCTTACA TTCCGGAACC 
GCCGATCGAA GACGTATTCT 
GTCGTGTAGA ACGCGGTATC 
GGTATCAAAG AGACTCAGAA 
CAAACTGCTG GACGAAGGCC 
GTGGTATCAA ACGTGAAGAA 
GGCACCATCA AGCCGCACAC 
CAAAGATGAA GGCGGMCGTC 
AGTTCTACTT CCGTACTACT 
GGCGTAGAGA TGGTAATGCC 
GATCCACCCG ATCGCGATGG 



300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
806 



2) INFORMATION FOR SEQ ID NO: 175 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii ) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shigella sonnei 

(B) STRAIN: ATCC 29930 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175 

CGGCGCGATC CTGGTAGTTG CTGCGACTGA CGGCCCGATG CCGCAGACTC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGACTTC CCGGGCGACG 200 

ACACTCCGAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA 250 

GAGTGGGAAG CGAAAATCCT GGAACTGGCT GGCTTCCTGG ATTCTTACAT 3 00 

TCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 350 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAA 4 00 

CGCGGTATCA TCAAAGTTGG TGAAGAAGTT GAAATCGTTG GTATCAAAGA 4 50 

GACTCAGAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCTGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCATCAA 600 

GCCGCACACC AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG 650 

GCGGYCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 700 

CGTACTACTG ACGTGACTGG TACCATCGAA CTGCCGGAAG GCGTAGAGAT 750 

GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 800 

TCGCGATGGA CGACGGTCTG CGTTTCGCAA TC 832 



2) INFORMATION FOR SEQ ID NO: 176 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 716 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: ATCC 13301 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176 



TCTGCTGCTG 
ACGTAACGTT 
TGGTTGACGA 
TTATTAAGCG 
TTCAGCATTA 
TAGAATTAAT 
TCTGACAAAC 
TCGTGGTACT 
GTGAAGAAGT 
ACAGGTGTTG 
CAACATTGGT 
GTCAAGTATT 
GCAGAAGTAT 
CTTCTCAAAC 
GTGTTGTTCA 



ACGGTCCAAT 
GGTGTACCAG 
TGAAGAATTA 
AATATGACTT 
AAAGCTTTAG 
GGAAGCTGTA 
CATTCATGAT 
GTTGCTACAG 
TGAAATCATC 
AAATGTTCCG 
GCATTATTAC, 
AGCTGCTCCT 
ACGTATTATC 
TATCGTCCAC 
CTTACC 



GCCACAAACT 
CATTAGTAGT 
TTAGAATTAG 
CCCAGGTGAC 
AAGGCGATGC 
GATACTTACA 
GCCAGTTGAG 
GCCGTGTTGA 
GGTTTACATG 
TAAATTATTA 
GTGGTGTTGC 
GGTTCAATTA 
AAAAGACGAA 
AATTCTATTT 



CGTGAACACA TTCTTTTATC 
ATTCTTAAAC AAAGTTGACA 
TAGAAATGGA AGTTCGTGAC 
GATGTACCTG TAATCGCTGG 
TCAATACGAA GAAAAAATCT 
TTCCAACTCC AGAACGTGAT 
GACGTATTCT CAATCACTGG 
ACGTGGTCAA ATCAAAGTTG 
ACACATCTAA AACAACTGTT 
GACTACGCTG AAGCTGGTGA 
TCGTGAAGAC GTACAACGTG 
CACCACATAC TGAATTCAAA 
GGTGGACGTC ACACTCCATT 
CCGTACTACT GACGTAACTG 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
716 



2) INFORMATION FOR SEQ ID NO: 177 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 719 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: ATCC 29247 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 177 

TTCTTTTATC ACGTAACGTT GGTGTACCAG CATTAGTAGT ATTCTTAAAC 50 

AAAGTTGACA TGGTTGACGA TGAAGAATTA TTAGAATTAG TAGAAATGGA 100 

AGTTCGTGAC TTATTAAGCG AATATGACTT CCCAGGTGAC GATGTACCTG 150 

TAATCGCTGG TTCAGCATTA AAAGCTTTAG AAGGCGATGC TCAATACGAA 200 

GAAAAAATCT TAGAATTAAT GGAAGCTGTA GATACTTACA TTCCAACTCC 250 

AGAACGTGAT TCTGACAAAC CATTCATGAT GCCAGTTGAG GACGTATTCT 300 

CAATCACTGG TCGTGGTACT GTTGCTACAG GCCGTGTTGA ACGTGGTCAA 350 

ATCAAAGTTG GTGAAGAAGT TGAAATCATC GGTTTACATG ACACATCTAA 4 00 

AACAACTGTT ACAGGTGTTG AAATGTTCCG TAAATTATTA GACTACGCTG 450 

AAGCTGGTGA CAACATTGGT GCATTATTAC GTGGTGTTGC TCGTGAAGAC 500 

GTACAACGTG GTCAAGTATT AGCTGCTCCT GGTTCAATTA CACCACATAC 550 

TGAATTCAAA GCAGAAGTAT ACGTATTATC AAAAGACGAA GGTGGACGTC 600 

ACACTCCATT CTTCTCAAAC TATCGTCCAC AATTCTATTT CCGTACTACT 650 

GACGTAACTG GTGTTGTTCA CTTACCAGAA GGTMCTGAAA TGGTAATGCC 700 

TGGTGATAAC GTTGAAATG 719 



2) INFORMATION FOR SEQ ID NO: 178 
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( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 62 5 bases 

(B) TYPE: Nucleic acid 

(C) STRAND EDNESS : Double 

(D) TOPOLOGY: Linear 

( i i ) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: ATCC 33591 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178 

GTGAACACAT TCTTTTATCA CGTAACGTTG GTGTACCAGC ATTAGTAGTA 50 

TTCTTAAACA AAGTTGACAT GGTTGACGAT GAAGAATTAT TAGAATTAGT 100 

AGAAATGGAA GTTCGTGACT TATTAAGCGA ATATGACTTC CCAGGTGACG 150 

ATGTACCTGT AATCGCTGGT TCAGCATTAA AAGCTTTAGA AGGCGATGCT 200 

CAATACGAAG AAAAAATCTT AGAATTAATG GAAGCTGTAG ATACTTACAT 250 

TCCAACTCCA GAACGTGATT CTGACAAACC ATTCATGATG CCAGTTGAGG 300 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 350 

CGTGGTCAAA TCAAAGTTGG TGAAGAAGTT GAAATCATCG GTTTACATGA 400 

CACATCTAAA ACAACTGTTA CAGGTGTTGA AATGTTCCGT AAATTATTAG 450 

ACTACGCTGA AGCTGGTGAC AACATTGGTG CATTATTACG TGGTGTTGCT 500 

CGTGAAGACG TACAACGTGG TCAAGTATTA GCTGCTCCTG GTTCAATTAC 550 

ACCACATACT GAATTCAAAG CAGAAGTATA CGTATTATCA AAAGACGAAG 600 

GTGGACGTCA CACTCCATTC TTCTC 62 5 



2) INFORMATION FOR SEQ ID NO: 179 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 704 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: ATCC 433 00 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179 

GTTGGTGTAC CAGCATTAGT AGTATTCTTA AACAAAGTTG ACATGGTTGA 50 

CGATGAAGAA TTATTAGAAT TAGTAGAAAT GGAAGTTCGT GACTTATTAA 100 

GCGAATATGA CTTCCCAGGT GACGATGTAC CTGTAATCGC TGGTTCAGCA 150 

TTAAAAGCTT TAGAAGGCGA TGCTCAATAC GAAGAAAAAA TCTTAGAATT 200 

AATGGAAGCT GTAGATACTT ACATTCCAAC TCCAGAACGT GATTCTGACA 2 50 

AACCATTCAT GATGCCAGTT GAGGACGTAT TCTCAATCAC TGGTCGTGGT 3 00 

ACTGTTGCTA CAGGCCGTGT TGAACGTGGT CAAATCAAAG TTGGTGAAGA 350 

AGTTGAAATC ATCGGTTTAC ATGACACATC TAAAACAACT GTTACAGGTG 4 00 

TTGAAATGTT CCGTAAATTA TTAGACTACG CTGAAGCTGG TGACAACATT 450 

GGTGCATTAT TACGTGGTGT TGCTCGTGAA GACGTACAAC GTGGTCAAGT 500 

ATTAGCTGCT CCTGGTTCAA TTACACCACA TACTGAATTC AAAGCAGAAG 550 

TATACGTATT ATCAAAAGAC GAAGGTGGAC GTCACACTCC ATTCTTCTCA 600 

AACTATCGTC CACAATTCTA TTTCCGTACT ACTGACGTAA CTGGTGTTGT 650 
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TCACTTACCA GAAGGTACTG AAATGGTAAT GCCTGGTGAT AACGTTGAAA 
TGAC 



700 
704 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 180 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 730 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus subsp. aureus 

(B) STRAIN: ATCC 6538 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180 

GTGAACACAT TCTTTTATCA CGTAACGTTG GTGTACCAGC ATTAGTAGTA 50 

TTCTTAAACA AAGTTGACAT GGTTGACGAT GAAGAATTAT TAGAATTAGT 100 

AGAAATGGAA GTTCGTGACT TATTAAGCGA ATATGACTTC CCAGGTGACG 150 

ATGTACCTGT AATCGCTGGT TCAGCATTAA AAGCTTTAGA AGGCGATGCT 200 

CAATACGAAG AAAAAATCTT AGAATTAATG GAAGCTGTAG ATACTTACAT 250 

TCCAACTCCA GAACGTGATT CTGACAAACC ATTCATGATG CCAGTTGAGG 300 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 350 

CGTGGTCAAA TCAAAGTTGG TGAAGAAGTT GAAATCATCG GTTTACATGA 400 

CACATCTAAA ACAACTGTTA CAGGTGTTGA AATGTTCCGT AAATTATTAG 450 

ACTACGCTGA AGCTGGTGAC AACATTGGTG CATTATTACG TGGTGTTGCT 500 

CGTGAAGACG TACAACGTGG TCAAGTATTA GCTGCTCCTG GTTCAATTAC 550 

ACCACATACT GAATTCAAAG CAGAAGTATA CGTATTATCA AAAGACGAAG 600 

GTGGACGTCA CACTCCATTC TTCTCAAACT ATCGTCCACA ATTCTATTTC 650 

CGTACTACTG ACGTAACTGG TGTTGTTCAC TTACCAGAAG GTACTGAAAT 700 

GGTAATGCCT GGTGATAACG TTGAAATGAC 730 



2) INFORMATION FOR SEQ ID NO: 181 

{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 834 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus auricularis 

(B) STRAIN: ATCC 33753 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181 

CGGTGCGATC TTAGTTGTAT CTGCAGCTGA TGGTCCAATG CCACAAACTC 50 

GTGAACACAT CTTATTATCA CGTAACGTTG GTGTACCAGC ATTAGTTGTA 100 

TTCTTAAACA AAGTTGACCA AGTTGACGAC GAAGAATTAT TAGAATTAGT 150 

AGAAATGGAA GTTCGTGACT TATTAAGCGA ATACGACTAC CCAGGTGACG 200 

ATGTACCTGT AATCTCTGGT TCTGCGTTGA AAGCATTAGA AGGCGACAAA 250 
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GAATACGAAC 
TCCAACTCCA 
ACGTATTCTC 
CGTGGTCAAA 
CGGTTCACAA 
TAGACTACGC 
TCACGTGAAG 
TACACCACAC 
AAGGTGGACG 
TTCCGTACTA 
AATGGTAATG 
CAATCGCTAT 



AAAAAATCTT 
GAACGTGACT 
AATCACTGGT 
TCAAAGTCGG 
AAAACAACAG 
TGAAGCTGGT 
AAGTACAACG 
ACTAAATTCA 
TCACACTCCA 
CTGACGTAAC 
CCTGGCGATA 
CGAAGACGGT 



AGACTTAATG 
CTGATAAACC 
CGTGGTACTG 
TGAAGAAGTT 
TTACTGGTGT 
GACAACATCG 
TGGTCAAGTT 
CTGCAGAAGT 
TTCTTCTCTA 
AGGTGTTGTT 
ACGTTAAAAT 
ACTCGTTTCT 



CAACAAGTTG ACGATTACAT 
ATTCATGATG CCAGTTGAAG 
TTGCAACAGG CCGTGTTGAA 
GAAATCATCG GTATGAAAGA 
AGAAATGTTC CGTAAATTAT 
GTGCTTTATT ACGTGGTATT 
TTAGCTGCTC CTGGTTCAAT 
TTACGTATTA TCTAAAGATG 
ACTACCGTCC ACAATTCTAT 
ACTTTACCAG AAGGTACAGA 
GGAAGTTGAA TTAATTTCTC 
CAAT 



300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
834 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 182 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus capitis subsp. capitis 

(B) STRAIN: ATCC 27840 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182 

CGGCGGTATC TTAGTAGTAT CTGCTGCTGA CGGTCCAATG CCACAAACTC 50 

GTGAACACAT CTTATTATCA CGTAACGTTG GTGTACCAGC ATTAGTTGTA 100 

TTCTTAAACA AAGTTGACAT GGTAGACGAC GAAGAATTAT TAGAATTAGT 150 

TGAAATGGAA GTTCGTGACT TATTAAGCGA ATATGACTTC CCAGGTGATG 200 

ATGTACCTGT AATCGCTGGT TCAGCATTAA AAGCTTTAGA AGGCGATGCT 250 

CAATACGAAG AAAAAATCTT AGAATTAATG CAAGCAGTTG ATGATTACAT 300 

TCCAACTCCA GAACGTGATT CTGACAAACC ATTCATGATG CCAGTTGAGG 350 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 400 

CGTGGTCAAA TCAAAGTTGG TGAAGAAGTT GAAATCATCG GTATCCACGA 450 

AACTTCTAAA ACAACTGTTA CTGGTGTAGA AATGTTCCGT AAATTATTAG 500 

ACTACGCTGA AGCTGGTGAC AACATCGGTG CTTTATTACG TGGTGTTGCT 550 

CGTGAAGACG TACAACGTGG TCAAGTATTA GCTGCTCCTG GTTCAATCAC 600 

ACCACACACT AAATTCAAAG CGGAAGTTTA CGTTTTATCT AAAGACGAAG 650 

GTGGACGTCA CACTCCATTC TTCAGTAACT ACCGCCCACA ATTCTATTTC 700 

CGTACTACTG ACGTAACTGG TGTTGTTAAC TTACCAGAAG GTACTGAAAT 750 

GGTTATGCCT GGCGACAACG TTGAAATGAC AGTTGAATTA ATCGCTCCTA 800 

TCGCTATTGA AGACGGTACT CGTTTCTCAA TCGGA 835 



2) INFORMATION FOR SEQ ID NO: 183 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 804 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Macrococcus caseolyticus 

(B) STRAIN: ATCC 1354 8 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183 

GTATCTTAGT AGTATCTGCT GCTGACGGTC CAATGCCACA AACTCGTGAA 50 

CACATCCTTT TATCACGTAA CGTTGGTGTA CCAGCATTAG TAGTATTCTT 100 

10 GAACAAAGTT GACATGGTTG ACGATGAAGA ATTATTAGAA TTAGTTGAAA 150 

TGGAAGTTCG TGACTTATTA TCTGAATATG ACTTCCCTGG TGACGATGTA 200 

CCTGTAATCG CTGGATCTGC TTTAAAAGCA TTAGAAGGCG TTGAAGAATA 250 

CGAAGACAAA ATCATGGAAT TAATGGACGC AGTTGATGAG TACATCCCAA 300 

CTCCAGAACG TGATTCTGAC AAACCATTCA TGATGCCAGT TGAGGACGTA 350 

15 TTCTCAATCA CTGGTCGTGG TACAGTTGCA ACTGGACGTG TTGAGCGTGG 400 

ACAAGTTAAA GTTGGTGAAG AAGTTGAAAT CATTGGTTTA ACTGAAGAAC 450 

CAGCAAAAAC TACAGTTACA GGTGTAGAAA TGTTCCGTAA ATTATTAGAT 500 

TACGCTGAAG CTGGAGATAA CATCGGTGCT TTATTACGTG GTGTTTCTCG 550 

TGAAGACGTA CAACGTGGAC AAGTATTAGC TAAACCAGGT TCAATTACTC 600 

20 CACATACTAA ATTCAAAGCT GAAGTTTACG TATTATCTAA AGAAGAAGGT 650 

GGACGTCATA CTCCATTCTT CACTAACTAC CGCCCTCAGT TCTACTTCCG 700 

TACAACTGAC GTAACTGGTG TAGTTAACTT ACCAGAAGGT ACTGAAATGG 750 

TAATGCCTGG AGATAACATC GAAATGAACG TTGAATTAAT TTCTCCAATC 800 

GCGA 804 

25 



2) INFORMATION FOR SEQ ID NO: 184 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus cohnii 
*0 (B) STRAIN: DSM 20260 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184 

CGGAGCTATC TTAGTAGTAT CTGCTGCTGA TGGCCCAATG CCACAAACTC 50 

45 GTGAACATAT CCTTTTATCA CGTAACGTTG GTGTTCCAGC ATTAGTTGTA 100 

TTCTTAAACA AAGTTGACAT GGTTGACGAT GAAGAATTAT TAGAATTAGT 150 

AGAAATGGAA GTTCGTGACT TATTAAGCGA ATATGACTTC CCAGGTGACG 200 

ATGTACCTGT AATCTCTGGT TCAGCATTAA AAGCTCTTGA AGGCGACGCT 250 

GACTATGAGC AAAAAATCTT AGACTTAATG CAAGCTGTTG ATGACTTCAT 3 00 

50 TCCAACACCA GAACGTGATT CTGACAAACC ATTCATGATG CCAGTTGAGG 350 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG GCGTGTTGAA 400 

CGTGGTCAAA TCAAAGTCGG TGAAGAAGTT GAAATCATCG GTATGCAAGA 4 50 

AGATTCAAGC AAAACAACTG TTACTGGTGT AGAAATGTTC CGTAAATTAT 500 

TAGACTACGC TGAAGCTGGT GACAACATTG GTGCGTTATT ACGTGGTGTT 550 

55 GCACGTGAAG ACATCCAACG TGGTCAAGTT TTAGCTGCTC CTGGTTCAAT 600 

TACACCACAC ACAAACTTTA AAGCGGAAGT TTACGTTTTA TCAAAAGATG 650 

AAGGTGGCCG TCATACGCCA TTCTTCAGTA ACTATCGCCC ACAATTCTAT 700 

TTCCGTACTA CTGACGTAAC AGGTGTTGTT ACTTTACCAG AAGGTACTGA 750 

AATGGTTATG CCTGGCGACA ACGTAGAAAT GGAAGTTGAA CTAATTTCTC 800 

50 CAATCGCTAT CGAAGACGGT ACACGTTTCT CT 832 
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2) INFORMATION FOR SEQ ID NO: 185 

( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 699 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus epidermidis 

(B) STRAIN: CSG 269 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 185 

ATCTGCTGCT GACGGTCCAA TGCCACAAAC TCGTGAACAC ATCTTATTAT 50 

CACGTAACGT TGGTGTACCA GCATTAGTTG TATTCTTAAA CAAAGTTGAC 100 

ATGGTAGACG ACGAAGAATT ATTAGAATTA GTTGAAATGG AAGTTCGTGA 150 

CTTATTAAGC GAATATGACT TCCCAGGTGA CGATGTACCT GTAATCGCTG 200 

GTTCTGCATT AAAAGCATTA GAAGGCGATG CTGAATACGA ACAAAAAATC 2 50 

TTAGACTTAA TGCAAGCAGT TGATGATTAC ATTCCAACTC CAGAACGTGA 300 

TTCTGACAAA CCATTCATGA TGCCAGTTGA GGACGTATTC TCAATCACTG 350 

GTCGTGGTAC TGTTGCTACA GGCCGTGTTG AACGTGGTCA AATCAAAGTT 400 

GGTGAAGAAG TTGAAATCAT CGGTATGCAC GAAACTTCTA AAACAACTGT 450 

TACTGGTGTA GAAATGTTCC GTAAATTATT AGACTACGCT GAAGCTGGTG 500 

ACAACATCGG TGCTTTATTA CGTGGTGTTG CACGTGAAGA CGTACAACGT 550 

GGTCAAGTAT TAGCTGCTCC TGGTTCTATT ACACCACACA CAAAATTCAA 600 

AGCTGAAGTA TACGTATTAT CTAAAGATGA AGGTGGACGT CACACTCCAT 650 

TCTTCACTAA CTATCGCCCA CAATTCTATT TCCGTACTAC TGACGTAAC 699 



2) INFORMATION FOR SEQ ID NO: 186 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 

(B) STRAIN: ATCC 29970 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186 



CGGCGGTATC TTAGTAGTAT 
GTGAACACAT TCTTTTATCA 
TTCTTAAATA AAGTTGACAT 
TGAAATGGAA GTACGTGACT 
ATGTACCTGT AATCGCTGGT 
CAATACGAAG AAAAAATCTT 
TCCAACTCCA GAACGTGATT 
ACGTATTCTC AATCACTGGT 



CTGCTGCTGA CGGTCCAATG 
CGTAACGTTG GTGTACCAGC 
GGTTGACGAT GAAGAATTAT 
TATTATCTGA ATACGACTTC 
TCAGCATTAA AAGCTTTAGA 
AGAATTAATG CAAGCAGTTG 
CTGACAAACC ATTCATGATG 
CGTGGTACTG TTGCTACAGG 
112 



CCACAAACTC 50 

ATTAGTAGTA 100 

TAGAATTAGT 150 

CCAGGTGACG 2 00 

AGGCGATGCT 250 

ATGACTACAT 300 

CCAGTTGAGG 350 

CCGTGTTGAA 4 00 
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20 



25 



30 
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40 



45 



50 



55 



CGTGGGCAAA TCAAAGTTGG TGAAGAAGTT GAAATCATTG X2¥KTCC#KA % 4*)' 



2) INFORMATION FOR SEQ ID NO: 187 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 705 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus warneri 

(B) STRAIN: CSG 123 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187 

CACAAACTCG TGAACACATT CTTTTATCAC GTAACGTTGG TGTACCAGCT 50 

TTAGTTGTAT TCTTAAACAA AGTTGATATG GTAGACGACG AAGAATTATT 100 

AGAATTAGTA GAAATGGAAG TTCGTGACTT ATTATCTGAA TATGACTTCC 150 

CAGGTGACGA CGTACCTGTA ATCGCTGGTT CAGCATTAAA AGCTTTAGAA 200 

GGCGACGAAA AATACGAAGA AAAAATCTTA GAATTAATGC AAGCAGTTGA 250 

TGACTACATT CCAACTCCAG AACGTGATTC TGACAAACCA TTCATGATGC 300 

CAGTTGAGGA CGTATTCTCA ATCACTGGTC GTGGTACTGT TGCTACAGGC 350 

CGTGTTGAAC GTGGTCAAAT CAAAGTTGGT GAAGAAGTTG AAATCATCGG 400 

TTTACATGAC ACTTCTAAAA CAACTGTTAC TGGTGTAGAA ATGTTCCGTA 450 

AGTTATTAGA CTACGCTGAA GCTGGTGACA ACATCGGTGC TTTATTACGT 500 

GGTGTTGCTC GTGAAGACGT ACAACGTGGT CAAGTATTAG CTGCTCCTGG 550 

TTCAATTACA CCACATACAA AATTCAAAGC GGAAGTTTAC GTTTTATCTA 600 

AAGACGAAGG TGGACGTCAC ACTCCATTCT TCAGTAACTA CCGCCCACAA 650 

TTCTATTTCC GTACTACTGA CGTAACTGGC GTTGTTCAAT TACCAGAAGG 700 

TACTG 705 



2) INFORMATION FOR SEQ ID NO: 188 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 678 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 

(B) STRAIN: CSG 23 



CACTTCTAAA ACAACTGTTA 
ACTACGCTGA AGCTGGTGAC 
CGTGAAGACG TACAACGTGG 
ACCTCACACA AAATTTAAAG 
GTGGACGTCA CACTCCATTC 
CGTACTACTG ACGTAACTGG 
GGTTATGCCT GGCGACAACG 
TCGCGATTGA AGACGGTACT 



CTGGTGTAGA 
AACATCGGTG 
TCAAGTATTA 
CAGACGTATA 
TTCACAAACT 
TGTTGTTAAC 
TTGAAATGAC 
CGTTTCTCA 



AATGTTCCGT AAATTATTAG 
CATTATTACG TGGTGTTGCT 
GCTGCTCCAG GTTCAATCAC 
CGTTTTATCT AAAGACGAAG 
ATCGTCCACA ATTCTATTTC 
TTACCAGAAG GTACTGAAAT 
AGTAGAATTA ATCGCTCCTA 



500 
550 
600 
650 
700 
750 
800 
829 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 188. 



TTTTATCACG 
GTTGACATGG 
ACGTGACTTA 
TCGCTGGTTC 
AAAATCTTAG 
ACGTGACTCT 
TCACTGGTCG 
AAAGTTGGTG 
AACTGTTACT 
CTGGTGACAA 
CAACGTGGTC 
ATTCAAAGCA 
CTCCATTCTT 
GTAACTGGTG 



TAACGTTGGT 
TTGACGATGA 
TTATCTGAAT 
AGCTTTAAAA 
AATTAATGCA 
GATAAACCAT 
TGGTACTGTT 
AAGAAGTTGA 
GGTGTAGAAA 
CATCGGTGCT 
AAGTATTAGC 
GACGTATACG 
CACTAACTAT 
TTGTTAACTT 



GTACCAGCAT 
AGAATTATTA 
ACGACTTCCC 
GCTTTAGAAG 
AGCAGTTGAT 
TCATGATGCC 
GCTACAGGTC 
AATTATTGGT 
TGTTCCGTAA 
TTATTACGTG 
TGCTCCAGGT 
TTTTATCAAA 
CGTCCACAAT 
ACCAGAAG 



TAGTAGTATT CTTAAACAAA 
GAATTAGTTG AAATGGAAGT 
AGGTGACGAC GTACCTGTAA 
GCGATGCTCA ATACGAAGAA 
GATTACATTC CAACTCCAGA 
AGTTGAGGAC GTATTCTCAA 
GTGTTGAACG TGGTCAAATC 
ATCAAAGAAA CTTCTAAAAC 
ATTATTAGAC TACGCTGAAG 
GTGTTGCTCG TGAAGATGTA 
TCAATTACAC CTCACACAAA 
AGATGAAGGT GGACGTCATA 
TCTATTTCCG TACTACTGAC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
678 



2) INFORMATION FOR SEQ ID NO: 189 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 668 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE : Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 

(B) STRAIN: CSG 33 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189 

ACCAGCATTA GTAGTATTCT TAAATAAAGT TGACATGGTT GACGATGAAG 5 0 

AATTATTAGA ATTAGTTGAA ATGGAAGTAC GTGACTTATT ATCTGAATAC 100 

GACTTCCCAG GTGACGATGT ACCTGTAATC GCTGGTTCAG CATTAAAAGC 150 

TTTAGAAGGC GATGCTCAAT ACGAAGAAAA AATCTTAGAA TTAATGCAAG 2 00 

CAGTTGATGA CTACATTCCA ACTCCAGAAC GTGATTCTGA CAAACCATTC 250 

ATGATGCCAG TTGAGGACGT ATTCTCAATC ACTGGTCGTG GTACTGTTGC 300 

TACAGGCCGT GTTGAACGTG GTCAAATCAA AGTTGGTGAA GAAGTTGAAA 350 

TCATTGGTAT CCATGACACT TCTAAAACAA CTGTTACTGG TGTAGAAATG 4 00 

TTCCGTAAAT TATTAGACTA CGCTGAAGCT GGTGACAACA TCGGTGCATT 4 50 

ATTACGTGGT GTTGCTCGTG AAGACGTACA ACGTGGTCAA GTATTAGCTG 500 

CTCCAGGTTC AATCACACCT CACACAAAAT TTAAAGCAGA CGTATACGTT 550 

TTATCTAAAG ACGAAGGTGG A'CGTCACACT CCATTCTTCA CAAACTATCG 600 

TCCACAATTC TATTTCCGTA CTACTGACGT AACTGGTGTT GTTAACTTAC 650 

CAGAAGGTAC TGAAATGG 668 



2) INFORMATION FOR SEQ ID NO: 190 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 593 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 

(B) STRAIN: CSG 8 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 190 



AAAGTTGACA 
10 AGTACGTGAC 
TAATCGCTGG 
GAAAAAATCT 
AGAACGTGAT 
CAATCACTGG 
15 ATCAAAGTTG 
AACAACTGTT 
AAGCTGGTGA 
GTACAACGTG 
AAAATTTAAA 
2 0 ACACTCCATT 



TGGTTGACGA 
TTATTATCTG 
TTCAGCATTA 
TAGAATTAAT 
TCTGACAAAC 
TCGTGGTACT 
GTGAAGAAGT 
ACTGGTGTAG 
CAACATTGGT 
GTCAAGTATT 
GCAGACGTAT 
CTTCACAAAC 



TGAAGAATTA 
AATACGACTT 
AAAGCTTTAG 
GCAAGCAGTT 
CATTCATGAT 
GTTGCTACAG 
TGAAATCATT 
AAATGTTCCG 
GCATTATTAC 
AGCTGCTCCA 
ACGTTTTATC 
TATCGTCCAC 



TTAGAATTAG 
CCCAGGTGAC 
AAGGCGATGC 
GATGATTACA 
GCCAGTTGAG 
GCCGTGTTGA 
GGTATCCATG 
TAAATTATTA 
GTGGTGTTGC 
GGTTCAATCA 
TAAAGACGAA 
AATTCTATTT 



TTGAAATGGA 50 

GATGTACCTG 100 

TCAATACGAA 150 

TTCCAACTCC 200 

GACGTATTCT 250 

ACGTGGTCAA 300 

ACACTTCTAA 350 

GACTACGCTG 4 00 

TCGTGAAGAC 450 

CACCTCACAC 500 

GGTGGACGTC 550 

CCG 593 



25 



30 



35 



2) INFORMATION FOR SEQ ID NO: 191 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid. 

(C) STRANDEDNESS ; Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus hominis subsp. hominis 

(B) STRAIN: ATCC 27844 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191 



40 CGGCGCTATC 
GTGAACACAT 
TTCTTAAACA 
TGAAATGGAA 
ACGTACCTGT 

4 5 CAATACGAAG 
TCCAACTCCA 
ACGTATTCTC 
CGTGGTCAAA 
AACTTCTAAA 

50 ACTACGCTGA 
CGTGAAGATG 
ACCTCACACA 
GTGGACGTCA 
CGTACTACTG 

55 GGTAATGCCT 
TCGCGATTGA 



TTAGTAGTAT 
TCTTTTATCA 
AAGTTGACAT 
GTACGTGACT 
AATCGCTGGT 
AAAAAATCTT 
GAACGTGACT 
AATCACTGGT 
TCAAAGTTGG 
ACAACTGTTA 
AGCTGGTGAC 
TACAACGTGG 
AAATTCAAAG 
TACTCCATTC 
ACGTAACTGG 
GGTGACAACG 
AGACGGTACT 



CTGCTGCTGA 
CGTAACGTTG 
GGTTGACGAT 
TATTATCTGA 
TCAGCTTTAA 
AGAATTAATG 
CTGATAAACC 
CGTGGTACTG 
TGAAGAAGTT 
CTGGTGTAGA 
AACATCGGTG 
TCAAGTATTA 
CAGACGTATA 
TTCTCTAACT 
TGTTGTTAAC 
TTGAAATGAC 
CGTTTCTC 



TGGTCCAATG 
GTGTACCAGC 
GAAGAATTAT 
ATACGACTTC 
AAGCTTTAGA 
CAAGCAGTTG 
ATTCATGATG 
TTGCTACAGG 
GAAATTATTG 
AATGTTCCGT 
CTTTATTACG 
GCTGCTCCAG 
CGTTTTATCA 
ATCGTCCACA 
TTACCAGAAG 
AGTAGAATTA 



CCACAAACTC 50 

ATTAGTAGTA 100 

TAGAATTAGT 150 

CCAGGTGACG 2 00 

AGGCGATGCT 250 

ATGATTATAT 300 

CCAGTTGAGG 3 50 

CCGTGTTGAA 4 00 

GTATCAAAGA 450 

AAATTATTAG 500 

TGGTGTTGCT 550 

GTTCAATTAC 600 

AAAGATGAAG 650 

ATTCTATTTC 700 

GTACTGAAAT 750 

ATCGCTCCTA 800 
828 



60 2) INFORMATION FOR SEQ ID NO: 192 
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( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 620 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus warneri 

(B) STRAIN: ATCC 35982 



15 



30 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192 



ATGGTCCAAT GCCACAAACT CGTGAACACA TTCTTTTATC ACGTAACGTT 50 

GGTGTACCAG CTTTAGTTGT ATTCTTAAAC AAAGTTGATA TGGTAGACGA 100 

CGAAGAATTA TTAGAATTAG TAGAAATGGA AGTTCGTGAC TTATTATCTG 150 

AATATGACTT CCCAGGTGAC GACGTACCTG TAATCGCTGG TTCAGCATTA 200 

20 AAAGCTTTAG AAGGCGACGA AAAATACGAA GAAAAAATCT TAGAATTAAT 250 

GCAAGCAGTT GATGACTACA TTCCAACTCC AGAACGTGAT TCTGACAAAC 300 

CATTCATGAT GCCAGTTGAG GACGTATTCT CAATCACTGG TCGTGGTACT 350 

GTTGCTACAG GCCGTGTTGA ACGTGGTCAA ATCAAAGTTG GTGAAGAAGT 4 00 

TGAAATCATC GGTTTACATG ACACTTCTAA AACAACTGTT ACTGGTGTAG 450 

25 AAATGTTCCG TAAGTTATTA GACTACGCTG AAGCTGGTGA CAACATCGGT 500 

GCTTTATTAC GTGGTGTTGC TCGTGAAGAC GTACAACGTG GTCAAGTATT 550 

AGCTGCTCCT GGTTCAATTA CACCACATAC AAAATTCAAA GCGGAAGTTT 600 

ACGTTTTATC TAAAGACGAA 620 



2) INFORMATION FOR SEQ ID NO: 193 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 692 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus hominis 

(B) STRAIN: CSG 170 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193 



CCAGCATTAG TAGTATTCTT AAACAAAGTT GACATGGTTG ACGATGAAGA 50 

ATTATTAGAA TTAGTTGAAA TGGAAGTACG TGACTTATTA TCTGAATACG 100 

50 ACTTCCCAGG TGACGACGTA CCTGTAATCG CTGGTTCAGC TTTAAAAGCT 150 

TTAGAAGGCG ATGCTCAATA CGAAGAAAAA ATCTTAGAAT TAATGCAAGC 200 

AGTTGATGAT TATATTCCAA CTCCAGAACG TGACTCTGAT AAACCATTCA 250 

TGATGCCAGT TGAGGACGTA TTCTCAATCA CTGGTCGTGG TACTGTTGCT 300 

ACAGGCCGTG TTGAACGTGG TCAAATCAAA GTTGGTGAAG AAGTTGAAAT 350 

55 TATTGGTATC AAAGAAACTT CTAAAACAAC TGTTACTGGT GTAGAAATGT 4 00 

TCCGTAAATT ATTAGACTAC GCTGAAGCTG GTGACAACAT CGGTGCTTTA 450 

TTACGTGGTG TTGCTCGTGA AGATGTACAA CGTGGTCAAG TATTAGCTGC 500 

TCCAGGTTCA ATTACACCTC ACACAAAATT CAAAGCAGAC GTATACGTTT 550 

TATCAAAAGA TGAAGGTGGA CGTCATACTC CATTCTTCTC TAACTATCGT 600 

60 CCACAATTCT ATTTCCGTAC TACTGACGTA ACTGGTGTTG TTAACTTACC 650 

116 
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5 2) INFORMATION FOR SEQ ID NO: 194 

( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 684 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus hominis 

(B) STRAIN: CSG 36 



20 



25 



30 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194 

CATTCTTTTA TCACGTAACG TTGGTGTACC AGCATTAGTA 
ACAAAGTTGA CATGGTTGAC GATGAAGAAT TATTAGAATT 
GAAGTACGTG ACTTATTATC TGAATACGAC TTCCCAGGTG 
TGTAATCGCT GGTTCAGCTT TAAAAGCTTT AGAAGGCGAT 
AAGAAAAAAT CTTAGAATTA ATGCAAGCAG TTGATGATTA 
CCAGAACGTG ACTCTGATAA ACCATTCATG ATGCCAGTTG 
CTCAATCACT GGTCGTGGTA CTGTTGCTAC AGGCCGTGTT 
AAATCAAAGT TGGTGAAGAA GTTGAAATTA TTGGTATCAA 
AAAACAACTG TTACTGGTGT AGAAATGTTC CGTAAATTAT 
TGAAGCTGGT GACAACATCG GTGCTTTATT ACGTGGTGTT 
ATGTACAACG TGGTCAAGTA TTAGCTGCTC CAGGTTCAAT 
ACAAAATTCA AAGCAGACGT ATACGTTTTA TCAAAAGATG 
TCATACTCCA TTCTTCTCTA ACTATCGTCC ACAATTCTAT 
CTGACGTAAC TGGTGTTGTT AACTTACCAG AAGG 



GTATTCTTAA 50 

AGTTGAAATG 100 

ACGACGTACC 150 

GCTCAATACG 200 

TATTCCAACT 250 

AGGACGTATT 300 

GAACGTGGTC 3 50 

AGAAACTTCT 4 00 

TAGACTACGC 4 50 

GCTCGTGAAG 500 

TACACCTCAC 550 

AAGGTGGACG 600 

TTCCGTACTA 650 
684 



2) INFORMATION FOR SEQ ID NO: 195 

4 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 685 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus hominis 
50 (B) STRAIN: CSG 6 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195 



ACCAGCATTA GTAGTATTCT 
55 AATTATTAGA ATTAGTTGAA 
GACTTCCCAG GTGACGACGT 
TTTAGAAGGC GATGCTCAAT 
CAGTTGATGA TTACATTCCA 
ATGATGCCAG TTGAGGACGT 
60 TACAGGCCGT GTTGAACGTG 



TAAACAAAGT TGACATGGTT 
ATGGAAGTAC GTGACTTATT 
ACCTGTAATC GCTGGTTCAG 
ACGAAGAAAA AATCTTAGAA 
ACTCCAGAAC GTGACTCTGA 
ATTCTCAATC ACTGGTCGTG 
GTCAAATCAA AGTTGGTGAA 
117 



GACGATGAAG 50 

ATCTGAATAC 100 

CTTTAAAAGC 150 

TTAATGCAAG 200 

TAAACCATTC 250 

GTACTGTTGC 300 

GAAGTTGAAA 350 
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TTATTGGTAT CAAAGAAACT 
TTCCGTAAAT TATTAGACTA 
ATTACGTGGT GTTGCTCGTG 
CTCCAGGTTC AATTACACCT 
TTATCAAAAG ATGAAGGTGG 
TCCACAATTC TATTTCCGTA 
CAGAAGGTAC TGAAATGGTA 



TCTAAAACAA CTGTTACTGG TGTAGAAATG 
CGCTGAAGCT GGTGACAACA TCGGTGCTTT 
AAGATGTACA ACGTGGTCAA GTATTAGCTG 
CACACAAAAT TCAAAGCAGA CGTATACGTT 
ACGTCATACT CCATTCTTCA CTAACTATCG 
CTACTGACGT AACTGGTGTT GTTAACTTAC 
ATGCCTGGCG ACAAC 



400 
450 
500 
550 
600 
650 
685 



2) INFORMATION FOR SEQ ID NO: 196 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 611 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus hominis 

(B) STRAIN: CSG 62 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196 

GACTTATTAT CTGAATACGA CTTCCCAGGT GACGACGTAC CTGTAATCGC 50 

TGGTTCAGCT TTAAAAGCTT TAGAAGGCGA TGCTCAATAC GAAGAAAAAA 100 

TCTTAGAATT AATGCAAGCA GTTGATGATT ACATTCCAAC TCCAGAACGT 150 

GACTCTGATA AACCATTCAT GATGCCAGTT GAGGACGTAT TCTCAATCAC 200 

TGGTCGTGGT ACTGTTGCTA CAGGCCGTGT TGAACGTGGT CAAATCAAAG 250 

TTGGTGAAGA AGTTGAAATT ATTGGTATCA AAGATACTTC TAAAACAACT 300 

GTTACTGGTG TAGAAATGTT CCGTAAATTA TTAGACTACG CTGAAGCTGG 350 

TGACAACATC GGTGCTTTAT TACGTGGTGT TGCTCGTGAA GATGTACAAC 4 00 

GTGGTCAAGT ATTAGCTGCT CCAGGTTCAA TCACACCTCA CACAAAATTC 450 

AAAGCAGACG TATATGTTTT ATCAAAAGAT GAAGGTGGAC GTCATACTCC 500 

ATTCTTCACT AACTATCGTC CACAATTCTA TTTCCGTACT ACTGACGTAA 550 

CTGGTGTTGT TAACTTACCA GAAGGTACTG AAATGGTAAT GCCTGGCGAC 600 

AACGTTGAAA T 611 



2) INFORMATION FOR SEQ ID NO: 197 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus lugdunensis 

(B) STRAIN: ATCC 43809 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197 

CGGCGGTATC TTAGTAGTTT CTGCTGCAGA TGGTCCAATG CCACAAACTC 50 
GTGAACACAT TCTTTTATCA CGTAACGTTG GTGTGCCAGC ATTAGTAGTA 100 
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TTCTTAAACA 
AGAAATGGAA 
ATGTGCCTGT 
AAATACGAAG 
5 TCCAACTCCA 
ACGTATTCTC 
CGTGGTCAAA 
TACT ACT AAA 
ACTACGCTGA 

10 CGTGAAGATG 
ACCTCACACT 
GTGGACGTCA 
CGTACTACAG 
GGTTATGCCT 

15 TCGCTATCGA 



AAGTTGACAT 
GTTCGTGATT 
AATCGCTGGT 
CTAAAATCTT 
GAACGTGACT 
AATCACTGGT 
TCAAAGTCGG 
ACAACTGTTA 
AGCTGGTGAC 
TACAACGTGG 
AAATTTAAAG 
TACACCATTC 
ACGTAACTGG 
GGCGACAACG 
AGACGGAACT 



GGTTGACGAT 
TATTAACTGA 
TCAGCATTAA 
AGAATTAATG 
CTGACAAACC 
CGTGGTACTG 
TGAAGAAGTT 
CTGGTGTAGA 
AACATCGGTG 
ACAAGTATTA 
CTGACGTATA 
TTCTCAAACT 
TGTTGTTAAC 
TTGAAATGAC 
CGTTTCTC 



GAAGAATTAT 
ATATGACTTC 
AAGCTTTAGA 
GATGCAGTTG 
ATTCATGATG 
TTGCTACAGG 
GAAATTATTG 
AATGTTCCGT 
CGTTATTACG 
GCTGCTCCAG 
TGTTTTATCT 
ACCGCCCACA 
TTACCAGAAG 
AGTTGAATTA 



TAGAATTAGT 
CCAGGTGACG 
AGGCGACGAA 
ATAACTACAT 
CCAGTTGAGG 
CCGTGTTGAA 
GTATCCACGA 
AAATTATTAG 
TGGTGTTGCT 
GTTCAATTAC 
AAAGATGAAG 
ATTCTATTTC 
GTACAGAAAT 
ATCGCTCCAA 



150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
823 



20 



2 ) INFORMATION FOR SEQ ID NO: 198 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 690 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Staphylococcus saprophyticus 

(B) STRAIN: ATCC 35552 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198 

35 AGTAGTATCT GCTGCTGATG GCCCAATGCC ACAAACTCGT GAACACATTC 50 

TTTTATCACG TAACGTTGGT GTTCCAGCAT TAGTTGTATT CTTAAACAAA 100 

GTTGACATGG TTGACGATGA AGAATTATTA GAATTAGTAG AAATGGAAGT 150 

TCGTGACTTA TTAAGCGAAT ATGACTTCCC AGGTGACGAT GTACCTGTAA 200 

TCTCTGGTTC TGCATTAAAA GCTTTAGAAG GCGACGCTGA CTATGAGCAA 250 

40 AAAATCTTAG ACTTAATGCA AGCTGTTGAT GACTTCATTC CAACACCAGA 300 

ACGTGATTCT GACAAACCAT TCATGATGCC AGTTGAGGAC GTATTCTCAA 3 50 

TCACTGGTCG TGGTACTGTT GCTACAGGCC GTGTTGAACG TGGTCAAATC 400 

AAAGTCGGTG AAGAAATCGA AATCATCGGT ATGCAAGAAG AATCAAGCAA 4 50 

AACAACTGTT ACTGGTGTAG AAATGTTCCG TAAATTATTA GACTACGCTG 500 

45 AAGCTGGTGA CAACATTGGT GCATTATTAC GTGGTGTTTC ACGTGATGAC 550 

GTACAACGTG GTCAAGTTTT AGCTGCTCCT GGTACTATTA CACCACATAC 6 00 

AAAATTCAAA GCGGATGTTT ACGTTTTATC TAAAGATGAA GGTGGTCGTC 6 50 

ATACACCATT CTTCACTAAC TACCGCCCAC AATTCTATTT 690 



50 



2) INFORMATION FOR SEQ ID NO: 199 



( i ) SEQUENCE CHARACTERISTICS : 
55 (A) LENGTH: 723 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 

119 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 

(B) STRAIN: CSG 83 



10 



15 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199 



GCATTAGTTG 
ATTAGAATTA 
TCCCAGGTGA 
GAAGGCGACG 
TGATGACTTC 
TGCCAGTTGA 
GGCCGTGTTG 
CGGTATGCAA 
TCCGTAAATT 
TTACGTGGTG 
TCCTGGTACT 
TATCTAAAGA 
CCACAATTCT 
AGAAGGTACT 
AATTAATTTC 



TATTCTTAAA 
GTAGAAATGG 
CGATGTACCT 
CTGACTATGA 
ATTCCAACAC 
GGACGTATTC 
AACGTGGTCA 
GAAGAATCAA 
ATTAGACTAC 
TTTCACGTGA 
ATTACACCAC 
TGAAGGTGGT 
ATTTCCGTAC 
GAAATGGTTA 
TCCAATCGCT 



CAAAGTTGAC 
AAGTTCGTGA 
GTAATCTCTG 
GCAAAAAATC 
CAGAACGTGA 
TCAATCACTG 
AATCAAAGTC 
GCAAAACAAC 
GCTGAAGCTG 
TGACGTACAA 
ATACAAAATT 
CGTCATACAC 
TACTGACGTA 
TGCCTGGCGA 
ATT 



ATGGTTGACG 
TTTATTAAGC 
GTTCTGCATT 
TTAGACTTAA 
TTCTGACAAA 
GTCGTGGTAC 
GGTGAAGAAA 
,TGTTACTGGT 
GTGACAACAT 
CGTGGTCAAG 
CAAAGCGGAT 
CATTCTTCAC 
ACTGGTGTTG 
TAACGTTGAA 



ATGAAGAATT 50 

GAATATGACT 100 

AAAAGCTTTA 150 

TGCAAGCTGT 200 

CCATTCATGA 250 

TGTTGCTACA 300 

TCGAAATCAT 350 

GTAGAAATGT 400 

TGGTGCATTA 4 50 

TTTTAGCTGC 500 

GTTTACGTTT 550 

TAACTACCGC 600 

TTAACTTACC 650 

ATGGATGTTG 7 00 
723 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 200 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 697 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 

(B) STRAIN: CSsa 18 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 00 



CGTTGGTGTT 
ACGATGAAGA 
AGCGAATATG 
ATTAAAAGCT 
TAATGCAAGC 
AAACCATTCA 
TACTGTTGCT 
AAATCGAAAT 
GGTGTAGAAA 
CATTGGTGCA 
AAGTTTTAGC 
GATGTTTACG 
CACTAACTAC 
TTGTTAACTT 



CCAGCATTAG 
ATTATTAGAA 
ACTTCCCAGG 
TTAGAAGGCG 
TGTTGATGAC 
TGATGCCAGT 
ACAGGCCGTG 
CATCGGTATG 
TGTTCCGTAA 
TTATTACGTG 
TGCTCCTGGT 
TTTTATCTAA 
CGCCCACAAT 
ACCAGAAGGT 



TTGTATTCTT 
TTAGTAGAAA 
TGACGATGTA 
ACGCTGACTA 
TTCATTCCAA 
TGAGGACGTA 
TTGAACGTGG 
CAAGAAGAAT 
ATTATTAGAC 
GTGTTTCACG 
ACTATTACAC 
AGATGAAGGT 
TCTATTTCCG 
ACTGAAATGG 



AAACAAAGTT 
TGGAAGTTCG 
CCTGTAATCT 
TGAGCAAAAA 
CACCAGAACG 
TTCTCAATCA 
TCAAATCAAA 
CAAGCAAAAC 
TACGCTGAAG 
TGATGACGTA 
CACATACAAA 
GGTCGTCATA 
TACTACTGAC 
TTATGCCTGG 



GACATGGTTG 50 

TGACTTATTA 100 

CTGGTTCTGC 150 

ATCTTAGACT 2 00 

TGATTCTGAC 250 

CTGGTCGTGG 3 00 

GTCGGTGAAG 350 

AACTGTTACT 400 

CTGGTGACAA 450 

CAACGTGGTC 500 

ATTCAAAGCG 550 

CACCATTCTT 600 
GTAACTGGTG • 650 

CGATAAC 697 



60 



2) INFORMATION FOR SEQ ID NO: 201 

120 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus sciuri subsp. sciuri 

(B) STRAIN: ATCC 29060 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 201 

CGGCGGTATC TTAGTAGTAT CTGCTGCTGA ,CGGTCCAATG CCTCAAACTC 50 

GTGAGCACAT TCTTTTATCA CGTAACGTAG GTGTTCCTGC ATTAGTAGTA 100 

TTCTTAAACA AAGTTGACAT GGTTGACGAT GAAGAATTAT TAGAATTAGT 150 

TGAAATGGAA GTTCGTGACT TATTATCTGA ATATGACTTC CCAGGCGACG 200 

ACGTTCCTGT AATTGCTGGT TCAGCATTAA AAGCATTAGA AGGCGACGAA 250 

GCTTACGAAG ACAAAATCAT GGAATTAATG GATGCTGTTG ATACATTCAT 300 

CCCAACTCCA GAACGTGACT CTGACAAACC ATTCATGATG CCAGTTGAGG 350 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 400 

CGTGGTCAAA TCACTGTTGG TGAAGAAGTT GAAATCATCG GTTTAACTGA 450 

AGAATCTTCT AAAACAACTG TAACTGGTGT TGAAATGTTC CGTAAATTAT 500 

TAGACTTCGC TGAAGCTGGA GATAACATCG GTGCATTATT ACGTGGTGTT 550 

GCTCGTGAAG ACGTTAACCG TGGTCAAGTA TTAGCTAAAC CAGGTTCAAT 600 

CACACCTCAC ACTAAATTCA AAGCTGAAGT TTATGTATTA TCTAAAGACG 650 

AAGGTGGACG TCATACTCCA TTCTTCACAA ACTACCGCCC ACAATTCTAT 700 

TTCCGTACTA CTGACGTAAC TGGTGTAGTT AACTTACCAG AAGGTACTGA 750 

AATGGTTATG CCTGGCGACA ACGTTGAAAT GGACGTTGAA TTAATTTCAC 800 

CAATCGCTAT TGAAGACGGT ATCGTTTCTC AATCA 835 



2) INFORMATION FOR SEQ ID NO: 202 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus warneri 

(B) STRAIN: ATCC 27836 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 202 



CGGCGGTATC TTAGTTGTAT 
GTGAACACAT TCTTTTATCA 
TTCTTAAACA AAGTTGATAT 
AGAAATGGAA GTTCGTGACT 
ACGTACCTGT AATCGCTGGT 
AAATACGAAG AAAAAATCTT 
TCCAACTCCA GAACGTGATT 
ACGTATTCTC AATCACTGGT 
CGTGGTCAAA TCAAAGTTGG 
CACTTCTAAA ACAACTGTTA 



CTGCAGCTGA TGGTCCAATG 
CGTAACGTTG GTGTACCAGC 
GGTAGACGAC GAAGAATTAT 
TATTATCTGA ATATGACTTC 
TCAGCATTAA AAGCTTTAGA 
AGAATTAATG CAAGCAGTTG 
CTGACAAACC ATTCATGATG 
CGTGGTACTG TTGCTACAGG 
TGAAGAAGTT GAAATCATCG 
CTGGTGTAGA AATGTTCCGT 
121 



CCACAAACTC 50 

TTTAGTTGTA 100 

TAGAATTAGT 150 

CCAGGTGACG 200 

AGGCGACGAA 250 

ATGACTACAT 300 

CCAGTTGAGG 350 

CCGTGTTGAA 400 

GTTTACATGA 450 

AAGTTATTAG 500 
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ACTACGCTGA AGCTGGTGAC 
CGTGAAGACG TACAACGTGG 
ACCACATACA AAATTCAAAG 
GTGGACGTCA CACTCCATTC 
5 CGTACTACTG ACGTAACTGG 
GGTTATGCCT GGTGATAACG 
TCGCGATTGA AGACGGTACT 



AACATCGGTG CTTTATTACG 
TCAAGTATTA GCTGCTCCTG 
CGGAAGTTTA CGTTTTATCT 
TTCAGTAACT ACCGCCCACA 
CGTTGTTCAA TTACCAGAAG 
TTGAAATGAC AGTAGAATTA 
CGTTTCTCAA C 



PCT/CAOO/01150 

TGGTGTTGCT' 5%d 

GTTCAATTAC 600 

AAAGACGAAG 650 

ATTCTATTTC 700 

GTACTGAAAT 750 

ATCGCTCCTA 800 
831 



10 



20 



45 



2) INFORMATION FOR SEQ ID NO: 203 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 82 9 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus warneri 

(B) STRAIN: CSG 50 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 203 

CGGCGGTATC TTAGTTGTAT CTGCAGCTGA TGGTCCAATG CCACAAACTC 50 

GTGAACACAT TCTTTTATCA CGTAACGTTG GTGTACCAGC TTTAGTTGTA 100 

TTCTTAAACA AAGTTGATAT GGTAGACGAC GAAGAATTAT TAGAATTAGT 150 

30 AGAAATGGAA GTTCGTGACT TATTATCTGA ATATGACTTC CCAGGTGACG 200 

ACGTACCTGT AATCGCTGGT TCAGCATTAA AAGCTTTAGA AGGCGACGAA 2 50 

AAATACGAAG AAAAAATCTT AGAATTAATG CAAGCAGTTG ATGACTACAT 3 00 

TCCAACTCCA GAACGTGATT CTGACAAACC ATTCATGATG CCAGTTGAGG 3 50 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 4 00 

35 CGTGGTCAAA TCAAAGTTGG TGAAGAAGTT GAAATCATCG GTTTACATGA 450 

CACTTCTAAA ACAACTGTTA CTGGTGTAGA AATGTTCCGT AAGTTATTAG 500 

ACTACGCTGA AGCTGGTGAC AACATCGGTG CTTTATTACG TGGTGTTGCT 550 

CGTGAAGACG TACAACGTGG TCAAGTATTA GCTGCTCCTG GTTCAATTAC 600 

ACCACATACA AAATTCAAAG CGGAAGTTTA CGTTTTATCT AAAGACGAAG 650 

4 0 GTGGACGTCA CACTCCATTC TTCAGTAACT ACCGCCCACA ATTCTATTTC 700 

CGTACTACTG ACGTAACTGG CGTTGTTCAA TTACCAGAAG GTACTGAAAT 750 

GGTTATGCCT GGTGATAACG TTGAAATGAC AGTAGAATTA ATCGCTCCTA 800 

TCGCGATTGA AGACGGTACT CGTTTCTCA 82 9 



2) INFORMATION FOR SEQ ID NO: 204 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 839 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bifidobacterium longum 

(B) STRAIN: ATCC 15707 

60 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 204 
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TGGCGCTATC 
GCGAGCACGT 
GCCCTGAACA 
CGAAGAAGAG 
GCCCGGTCAT 
GACCACGAGA 
CGACTACATC 
CGATCGAGGA 
CGTGTCGAGC 
TATCCGTCCG 
AGACCATGGA 
GGTCTCGGCC 
CTCCGTCACC 
AGGACGAAGG 
TTCTACTTCC 
CGTCGAGATG 
TTCAGCCCAT 



CTCGTTGTGG 
GCTGCTCGCC 
AGTGCGACAT 
GTCCGCGACC 
CCACACCTCC 
AGTGGGTCCA 
CCGACCCCGG 
CGTCTTCACC 
GTGGCCAGCT 
ACCCAGCAGA 
CGCCTGCGAG 
GTGACGATGT 
CCGCACACCA 
CGGCCGTCAC 
GCACCACCGA 
GTTCAGCCGG 
CGCTATGGAG 



CCGCCACCGA CGGCCCGATG GCCCAGACTC 
CGTCAGGTTG GCGTTCCGAA GATCCTCGTC 
GGTCGACGAT GAAGAGCTCA TCGAGCTCGT 
TCCTCGACGA GAACGGCTTC GACCGTGACT 
GCTTACGGTG CTCTGCACGA CGACGCTCCG 
GTCCGTTAAG GACCTCATGG ACGCTGTCGA 
TTCACGACCT GGACAAGCCG TTCCTGATGC 
ATCTCCGGCC GTGGTACCGT TGTCACCGGT 
GGCCGTCAAC ACCCCGGTCG AGATCGTTGG 
CCACCGTCAC CTCCATCGAG ACCTTCCACA 
GCTGGCGACA ACACCGGTCT GCTTCTGCGT 
CGAGCGTGGC CAGGTTGTGG CCAAGCCGGG 
AGTTCGAGGG, CGAAGTCTAC GTGCTGACCA 
TCGCCGTTCT TCTCCAACTA CCGTCCGCAG 
CGTCACCGGC GTCATCGAGC TGCCGGAAGG 
GCGACCACGC TACCTTCACC GTTGAGCTGA 
GAAGGCCTGA CCTTCGCTG 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
839 



2) INFORMATION FOR SEQ ID NO: 205 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 754 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Stenotrophomonas maltophilia 

(B) STRAIN: CDC F3338 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205 

CGGCGCGATC CTGGTGTGCT CGGCCGCTGA CGGCCCGATG CCGCAGACCC 50 

GTGAGCACAT CCTGCTGTCG CGCCAGGTCG GCGTGCCGTA CATCGTCGTG 100 

TTCCTGAACA AGGCCGACAT GGTCGACGAC GCCGAGCTGC TCGAGCTGGT 150 

CGAGATGGAA GTGCGCGAAC TGCTGAGCAA GTACGAGTTC CCGGGCGACG 2 00 

ACACCCCGAT CATCGCCGGT TCGGCCCGCC TGGCGCTGGA AGGCGACCAG 2 50 

AGCGACATCG GCGTGCCGGC CATCCTGAAG CTGGTCGACG CGCTGGACAG 300 

CTGGATTCCG GAGCCGGAGC GTGCGATCGA CAAGCCGTTC CTGATGCCGG 350 

TGGAAGACGT GTTCTCGATC TCGGGCCGCG GCACCGTGGT GACCGGTCGT 4 00 

ATCGAGCGCG GCGTGATCAA GGTTGGCGAC GAAATCGAAA TCGTCGGCAT 450 

CCGTCCGGTG CAGAAGACCA CCGTGACCGG CGTTGAAATG TTCCGCAAGC 500 

TGCTGGACCA GGGTCAGGCA GGCGACAACG CTGGCCTGCT GCTGCGCGGC 550 

ACCAAGCGTG ATGACGTCGA GCGTGGCCAG GTGCTGGCCA AGCCGGGCAC 600 

GATCAAGCCG CACACCAAGT TCGAAGGCGA AGTGTACGTC CTGTCGAAGG 650 

ACGAGGGCGG CCGCCACACC CCGTTCTTCA ACGGCTACCG TCCGCAGTTC 700 

TACTTCCGCA CCACCGACAT CACCGGCGCC GCTGCACTGC CGGAAGGCGT 750 

CGAA 754 



2) INFORMATION FOR SEQ ID NO: 206 
( i ) SEQUENCE CHARACTERI STI CS : 
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(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus acidominimus 
L0 (B) STRAIN: ATCC 51726 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 206 

TGGTGCTATC CTTGTAGTAG CTTCAACTGA CGGACCAATG CCACAAACTC 50 

L5 GTGAGCACAT CCTTCTTTCA CGTCAAGTTG GTGTTAAAAA CCTTATCGTT 100 

TTCATGAACA AAGTTGACCT TGTTGATGAT GAAGAATTGC TTGAATTGGT 150 

TGAAATGGAA ATCCGTGACC TTCTTTCAGA ATACGATTTC CCAGGTGATG 200 

ATCTTCCAGT TGTTCAAGGT , TCAGCTCTTA AAGCGCTTGA AGGTGATTCA 250 

GCACAAGAAG ATGTTATCAT GGAATTGATG TCAATCGTTG ACACATACAT 300 

10 TCCAGAACCA GAACGTGATA CTGACAAACC ATTGCTTCTT CCAGTCGAGG 350 

ATGTATTCTC AATCACTGGA CGTGGTACTG TTGCTTCAGG ACGTATCGAC 400 

CGTGGTACTG TTAAAGTTAA TGACGAAGTT GAAATCGTTG GTATCAAAGA 450 

CGAAATCTCT AAAGCAGTTG TTACTGGTGT TGAAATGTTC CGTAAACAAC 500 

TTGACGAAGG TCTTGCTGGA GATAACGTTG GTGTTCTTCT TCGTGGTGTA 550 

25 CAACGTGATG AAATCGAACG TGGTCAAGTT CTTGCTAAAC CAGGTTCAAT 600 

CAACCCACAC ACTAAATTCA AAGGTGAAGT TTACGTTCTT TCTAAAGAAG 650 

AAGGTGGACG TCACACTCCA TTCTTCGATA ACTACCGTCC TCAGTTCTAC 700 

TTCCGTACAA CTGACGTAAC TGGTTCAATC AAATTGCCAG AAGGTACTGA 750 

AATGGTAATG CCTGGTGATA ACGTAACTAT CGAAGTTGAG TTGATCCACC 800 

30 CAATCGCCGT TGAACAAGGT ACTACTTTCT CTATC 835 



2) INFORMATION FOR SEQ ID NO: 207 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Streptococcus agalactiae 

(B) STRAIN: ATCC 12403 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 207 

50 CTATCCTTGT AGTTGCTTCA ACTGATGGAC CAATGCCACA AACTCGTGAG 50 

CACATCCTTC TTTCACGTCA AGTTGGTGTT AAACACCTTA TCGTATTCAT 100 

GAACAAAGTT GACCTTGTTG ATGATGAAGA ATTGCTTGAA TTGGTTGAAA 150 

TGGAAATTCG TGACCTTCTT TCAGAATACG ACTTCCCAGG TGATGACCTT 200 

CCAGTTATCC AAGGTTCAGC TCTTAAAGCA CTTGAAGGCG ACGAAAAATA 250 

55 CGAAGACATC ATCATGGAAT TGATGAGCAC TGTTGATGAG TACATTCCAG 300 

AACCAGAACG TGATACTGAC AAACCTTTAC TTCTTCCAGT TGAAGATGTA 350 

TTCTCAATCA CTGGACGTGG TACAGTTGCT TCAGGACGTA TCGACCGTGG 4 00 

TACTGTTCGT GTCAACGACG AAGTTGAAAT CGTTGGTATT AAAGAAGATA 4 50 

TCCAAAAAGC AGTTGTTACT GGTGTTGAAA TGTTCCGTAA ACAACTTGAC 500 

50 GAAGGTCTTG CAGGGGACAA CGTTGGTGTT CTTCTTCGTG GTGTTCAACG 550 
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TGATGAAATC GAACGTGGTC AAGTTCTTGC 
CACACACTAA ATTTAAAGGT GAAGTTTACA 
GGACGTCATA CTCCATTCTT CAACAACTAC 
TACAACTGAC GTAACAGGTT CAATCGAACT 
TTATGCCTGG TGATAACGTT ACTATCGAAG 
GCCGTAGAAC AAGGTACTA 



TAAACCAGGT ^tt&YMttfl? I (To 
TCCTTTCTAA AGAAGAAGGT 650 
CGTCCACAAT TCTACTTCCG 700 
TCCAGCAGGA ACAGAAATGG 750 
TTGAATTGAT TCACCCAATC 800 

819 



2) INFORMATION FOR SEQ ID NO: 208 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double' 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Streptococcus agalactiae 

(B) STRAIN: ATCC 12973 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 208 

CTATCCTTGT AGTTGCTTCA ACTGATGGAC CAATGCCACA AACTCGTGAG 50 

CACATCCTTC TTTCACGTCA AGTTGGTGTT AAACACCTTA TCGTATTCAT 100 

GAACAAAGTT GACCTTGTTG ATGATGAAGA ATTGCTTGAA TTGGTTGAAA 150 

TGGAAATTCG TGACCTTCTT TCAGAATACG ACTTCCCAGG TGATGACCTT 200 

CCAGTTATCC AAGGTTCAGC TCTTAAAGCA CTTGAAGGCG ATGAAAAATA 250 

CGAAGACATC ATCATGGAAT TGATGAGCAC TGTTGATGAG TACATTCCAG 300 

AACCAGAACG TGATACTGAC AAACCTTTAC TTCTTCCAGT CGAAGATGTA 350 

TTCTCAATCA CTGGACGTGG TACAGTTGCT TCAGGACGTA TCGACCGTGG 400 

TACTGTTCGT GTCAACGACG AAGTTGAAAT CGTTGGTATT AAAGAAGATA 450 

TCCAAAAAGC AGTTGTTACT GGTGTTGAAA TGTTCCGTAA ACAACTTGAC 500 

GAAGGTCTTG CAGGGGACAA CGTTGGTGTT CTTCTTCGTG GTGTTCAACG 550 

TGATGAAATC GAACGTGGTC AAGTTCTTGC TAAACCAGGT TCAATCAACC 600 

CACACACTAA ATTTAAAGGT GAAGTTTACA TCCTTTCTAA AGAAGAAGGT 650 

GGACGTCATA CTCCATTCTT CAACAACTAC CGTCCACAAT TCTACTTCCG 700 

TACAACTGAC GTAACAGGTT CAATCGAACT TCCAGCAGGA ACAGAAATGG 750 

TTATGCCTGG TGATAACGTT ACTATCGAAG TTGAATTGAT TCACCCAATC 800 

GCCGTAGAAC AAGGTACTA 819 



2) INFORMATION FOR SEQ ID NO: 209 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

(B) STRAIN: ATCC 13813 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 209 
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AGCTATCCTT GTAGTTGCTT CAACTGATGG ACCAATGCCA CAAACTCGTG 50 

AGCACATCCT TCTTTCACGT CAAGTTGGTG TTAAACACCT TATCGTATTC 100 

ATGAACAAAG TTGACCTTGT TGATGATGAA GAATTGCTTG AATTGGTTGA 150 

5 AATGGAAATT CGTGACCTTC TTTCAGAATA CGACTTCCCA GGTGATGACC 200 

TTCCAGTTAT CCAAGGTTCA GCTCTTAAAG CACTTGAAGG CGATGAAAAA ,250 

TACGAAGACA TCATCATGGA ATTGATGAGC ACTGTTGATG AGTACATTCC 300 

AGAACCAGAA CGTGATACTG ACAAACCTTT ACTTCTTCCA GTCGAAGATG 350 

TATTCTCAAT CACTGGACGT GGTACAGTTG CTTCAGGACG TATCGACCGT 4 00 

LO GGTACTGTTC GTGTCAACGA CGAAGTTGAA ATCGTTGGTA TTAAAGAAGA 450 

TATCCAAAAA GCAGTTGTTA CTGGTGTTGA AATGTTCCGT AAACAACTTG 500 

ACGAAGGTCT TGCAGGGGAC AACGTTGGTG TTCTTCTTCG TGGTGTTCAA 550 

CGTGATGAAA TCGAACGTGG TCAAGTTCTT GCTAAACCAG GTTCAATCAA 600 

CCCACACACT AAATTTAAAG GTGAAGTTTA CATCCTTTCT AAAGAAGAAG 650 

L5 GTGGACGTCA TACTCCATTC TTCAACAACT ACCGTCCACA ATTCTACTTC 700 

CGTACAACTG ACGTAACAGG TTCAATCGAA CTTCCAGCAG GAACAGAAAT 750 

GGTTATGCCT GGTGATAACG TTACTATCGA AGTTGAATTG ATTCACCCAA 800 

TCGCCGTAGA ACAAGGTACT , AC 822 

20 



2) INFORMATION FOR SEQ ID NO: 210 

(i) SEQUENCE CHARACTERISTICS: 
IS (A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

30 ( i i ) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

(B) STRAIN: CDCss-1073 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210 



CGGAGCTATC CTTGTAGTTG CTTCAACTGA TGGACCAATG CCACAAACTC 50 

GTGAGCACAT CCTTCTTTCA CGTCAAGTTG GTGTTAAACA CCTTATCGTA 100 

40 TTCATGAACA AAGTTGACCT TGTTGATGAT GAAGAATTGC TTGAATTGGT 150 

TGAAATGGAA ATTCGTGACC TTCTTTCAGA ATACGACTTC CCAGGTGATG 200 

ACCTTCCAGT TATCCAAGGT TCAGCTCTTA AAGCACTTGA AGGCGACGAA 250 

AAATACGAAG ACATCATCAT GGAATTGATG AGCACTGTTG ATGAGTACAT 3 00 

TCCAGAACCA GAACGTGATA CTGACAAACC TTTACTTCTT CCAGTTGAAG 3 50 

45 ATGTATTCTC AATCACTGGA CGTGGTACAG TTGCTTCAGG ACGTATCGAC 4 00 

CGTGGTACTG TTCGTGTCAA CGACGAAGTT GAAATCGTTG GTATTAAAGA 4 50 

AGATATCCAA AAAGCAGTTG TTACTGGTGT TGAAATGTTC CGTAAACAAC 500 

TTGACGAAGG TCTTGCAGGG GACAACGTTG GTGTTCTTCT TCGTGGTGTT 550 

CAACGTGATG AAATCGAACG TGGTCAAGTT CTTGCTAAAC CAGGTTCAAT 600 

50 CAACCCACAC ACTAAATTTA AAGGTGAAGT TTACATCCTT TCTAAAGAAG 650 

AAGGTGGACG TCATACTCCA TTCTTCAACA ACTACCGTCC ACAATTCTAC 7 00 

TTCCGTACAA CTGACGTAAC AGGTTCAATC GAACTTCCAG CAGGAACAGA 750 

AATGGTTATG CCTGGTGATA ACGTTACTAT CGAAGTTGAA TTGATTCACC 800 

CAATCGCCGT AGAACAAGGT ACTAC 825 

55 



2) INFORMATION FOR SEQ ID NO: 211 
60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus anginosus 

(B) STRAIN: ATCC 33397 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 211 

GGAGCTATCC TTGTAGTAGC TTCAACTGAC GGACCAATGC CTCAAACTCG 50 

TGAACACATC CTTCTTTCAC GCCAAGTAGG, TGTTAAATAC CTTATCGTCT 100 

TCATGAATAA AGTTGACTTG GTTGACGATG AAGAATTGCT TGAATTGGTT 150 

GAAATGGAAA TCCGTGACCT TCTTTCAGAA TACGATTTCC CAGGTGATGA 200 

AATCCCAGTT ATCCAAGGTT CAGCTCTTAA AGCTCTTGAA GGTGATGAAA 250 

AATATGAAGA CATCATCATG GAATTGATGG ATACTGTTGA TGAATACATT 300 

CCAGAACCAG AACGTGACAC TGACAAACCA TTGCTTCTTC CAGTTGAAGA 350 

TGTATTCTCA ATTACTGGAC GTGGTACTGT TGCTTCAGGA CGTATCGACC 400 

GTGGTACTGT TAAAGTCAAC GACGAAGTTG AAATCGTTGG TATCCGTGAT 450 

GAAATCCAAA AAGCAGTTGT TACTGGTGTT GAAATGTTCC GTAAACAATT 500 

GGACGAAGGT CTTGCTGGAG ATAACGTAGG GGTTCTTCTT CGTGGTATCC 550 

AACGTGACGA AATCGAACGT GGACAAGTTC TTGCTAAACC AGGTTCAATT 600 

CATCCACACA CTAAATTCAA AGGTGAAGTT TACATCCTTA CTAAAGAAGA 650 

AGGTGGACGT CATACTCCAT TCTTCAACAA CTACCGTCCT CAATTCTACT 700 

TCCGTACTAC AGACGTTACA GGTTCAATCG AACTTCCTGC AGGTACTGAA 750 

ATGGTAATGC CTGGTGATAA CGTAACAATC GACGTTGAAT TGATCCACCC 800 

AATTGCCGTA GAACAAGGAA CTACAT 82 6 



2) INFORMATION FOR SEQ ID NO: 212 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 827 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus bovis 

(B) STRAIN: ATCC 33317 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 212 

TGGTGCTATC CTTGTAGTAG CTTCTACAGA TGGTCCAATG CCACAAACAC 50 

GTGAACACAT CCTTCTTTCA CGTCAAGTTG GTGTTAAACA CCTTATCGTC 100 

TTCATGAACA AAGTTGACCT TGTTGATGAC GAAGAATTGC TTGAATTGGT 150 

TGAAATGGAA ATCCGTGACC TTCTTTCAGA ATATGATTTC CCAGGTGATG 200 

AAATCCCTGT AATCCAAGGT TCAGCTCTTA AAGCCCTTGA AGGTGACACT 250 

CACTACGAAG ACATCATCAT GGAATTGATG AACACTGTAG ATGAATACAT 3 00 

TCCAGAACCA AAACGTGATA CTGACAAACC ATTGCTTCTT CCAGTCGAAG 350 

ACGTATTCTC AATCACTGGT CGTGGTACTG TAGCATCAGG ACGTATCGAC 4 00 

CGTGGTACTG TTAAAGTCAA CGACGAAGTT GAAATCGTTG GTATCCGTGA 4 50 

CGACATCCAA AAAGCTGTTG TTACTGGTGT TGAAATGTTC CGTAAACAAC 500 

TTGATGAAGG TATCGCAGGG GATAACGTTG GTGTTCTTCT TCGTGGTATC 550 
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CAACGTGATG AAATCGAACG TGGTCAAGTT 
CCACCCACAC ACTAAATTCA AAGGTGAAGT 
AAGGTGGACG TCACACTCCA TTCTTCAACA 
TTCCGTACAA CTGACGTTAC AGGTTCAATC 
AATGGTAATG CCTGGTGATA ACGTTACTAT 
CAATCGCCGT TGAACAAGGT ACTACAT 



CTTGCTAAAC CAGGTTCAAT 
TTACATCCTT ACTAAAGAAG 
ACTACCGTCC TCAATTCTAC 
GAACTTCCAG CAGGTACTGA 
CGACGTTGAA TTGATTCACC 



600 
650 
700 
750 
800 
827 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 213 

( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

( i i ) MOLECULE TYPE : ' Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus anginosus (deposited as 
Streptococcus constellatus) 

(B) STRAIN: ATCC 27823 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 213 

GCTATCCTCG TAGTAGCTTC AACTGATGGA CCAATGCCTC AAACTCGTGA 50 

ACATATCCTT CTTTCACGTC AAGTAGGTGT TAAATACCTT ATCGTCTTCA 100 

TGAACAAAGT TGACTTGGTT GACGATGAAG AATTGCTTGA ATTGGTTGAA 150 

ATGGAAATCC GTGACCTTCT TTCAGAATAC GATTTCCCAG GTGATGAAAT 2 00 

CCCAGTTATC CAAGGTTCAG CTCTTAAAGC TCTTGAAGGT GATGAAAAAT 250 

ATGAAGACAT CATCATGGAA TTGATGGATA CTGTTGATGA ATACATTCCA 300 

GAACCAGAAC GTGACACTGA CAAACCACTT CTTCTTCCAG TCGAAGATGT 3 50 

ATTCTCAATC ACTGGACGTG GTACTGTTGC TTCAGGACGT ATCGACCGTG 4 00 

GTACTGTTAA AGTCAATGAT GAAGTTGAAA TTGTTGGTAT TCGTGACGAA 4 50 

ATCCAAAAAG CAGTTGTTAC TGGTGTTGAA ATGTTCCGTA AACAATTGGA 500 

CGAAGGTCTT GCTGGAGATA ACGTAGGGGT TCTTCTTCGT GGTATCCAAC 550 

GTGACGAAAT CGAACGTGGA CAAGTTCTTG CTAAACCAGG TTCAATTCAT 600 

CCACACACTA AATTCAAAGG TGAAGTTTAC ATCCTTACTA AAGAAGAAGG 650 

TGGACGTCAT ACTCCATTCT TCAACAACTA CCGTCCTCAA TTCTACTTCC 700 

GTACTACAGA CGTTACAGGT TCAATCGAAC TTCCTGCAGG TACTGAAATG 750 

GTAATGCCTG GTGATAACGT AACAATTGAT GTTGAGTTGA TCCACCCAAT 800 

TGCCGTAGAA CAAGGAACTA C 821 



2) INFORMATION FOR SEQ ID NO: 214 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus cricetus 

(B) STRAIN: ATCC 19642 
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GCTATCCTTG TAGTAGCTTC TACAGACGGA CCAATGCCAC AAACTCGTGA 50 

ACACATCTTG CTTTCACGCC AAGTTGGTGT TAAGAGCCTT ATCGTCTTCA 100 

5 TGAACAAGGT TGACTTGGTT GACGATGAAG AATTGCTTGA ATTGGTTGAA 150 

ATGGAAATCC GTGATCTTCT TTCAGAATAC GATTTCCCAG GTGATGATAT 200 

CCCTGTTGTT CAAGGTTCAG CTCTTAAAGC CCTTGAAGGT GATACAGCTG 250 

CCGAAGACAA GATCATGGAA TTGATGGACA TCGTTGATGA CTACATTCCA 300 

GAACCAAAAC GTGATACTGA TAAGCCATTG CTTCTTCCAG TCGAAGACGT 350 

10 ATTCTCAATC ACTGGACGTG GTACTGTTGC TTCAGGACGT ATCGACCGTG 4 00 

GTACTGTTAA GGTCAATGAC GAAGTTGAAA TCGTTGGTAT CAAGGACGAA 450 

ATCCAAAAAG CGGTTGTTAC CGGAGTTGAA ATGTTCCGTA AACAATTGGA 500 

TGAAGGTCTT GCAGGGGATA ACGTTGGTGT GCTTCTTCGT GGTATCCAAC 550 

GTGATGAAAT CGAACGTGGT CAAGTATTGG CTGCACCTGG TTCAATCCAT 600 

15 CCACACACTA AATTCAAGGG TGAAGTTTAC ATCCTTTCTA AAGATGAAGG 650 

TGGACGTCAC ACTCCATTCT TCAACAACTA CCGTCCACAG TTCTACTTCC 700 

GTACAACTGA CGTAACTGGT TCAATCGAAT TGCCAGCAGG TACTGAAATG 750 

GTTATGCCTG GTGATAACGT TACTATCGAC GTTGAATTGA TCCACCCAAT 800 

CGCTGTTGAA AAAGGTACTA C 821 

20 



2) INFORMATION FOR SEQ ID NO: 215 

25 ( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus cristatus 
35 (B) STRAIN: ATCC 51100 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215 

TATCCTTGTA GTAGCTTCAA CTGACGGACC AATGCCACAA ACTCGTGAGC 50 

40 ACATCCTTCT TTCACGTCAG GTTGGTGTTA AACACCTTAT CGTCTTCATG 100 

AACAAGATCG ACTTGGTTGA TGACGAAGAA TTGCTTGAAT TGGTTGAAAT 150 

GGAAATCCGT GACCTCTTGT CAGAATACGA CTTCCCAGGT GACGATCTTC 2 00 

CAGTTATCCA AGGTTCAGCT CTTAAAGCTC TTGAAGGTGA TACTAAGTAC 250 

GAAGACATCA TCATGGAATT GATGAACACT GTTGATGAGT ACATCCCAGA 300 

4 5 ACCAGAACGT GATACTGACA AACCTCTTCT TCTTCCAGTC GAAGACGTAT 350 

TCTCAATCAC TGGTCGTGGT ACAGTTGCTT CAGGACGTAT CGACCGTGGT 400 

ACTGTTCGTG TCAACGATGA AATCGAAATC GTTGGTATCA AAGAAGAAAT 450 

CCAAAAAGCA GTTGTTACTG GTGTTGAAAT GTTCCGTAAA CAGCTTGACG 500 

AAGGTCTTGC AGGGGACAAC GTAGGTGTAC TTCTTCGTGG TATCCAACGT 550 

50 GATGAAATCG AACGTGGTCA AGTTATCGCT AAACCAGGTT CAATCAACCC 600 

ACACACTAAA TTCAAGGGTG AAGTTTACAT CCTTACTAAA GAAGAAGGTG 650 

GACGTCACAC TCCATTCTTC AACAACTACC GTCCACAGTT CTACTTCCGT 700 

ACAACTGACG TTACAGGTTC AATCGAACTT CCAGCAGGTA CTGAAATGGT 750 

AATGCCTGGT GATAACGTAA CTATCGACGT TGAGTTGATC CACCCAATCG 800 

55 CCGTTGAACA AGGTACTCCT T 821 



60 



2) INFORMATION FOR SEQ ID NO: 216 
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( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 792 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Streptococcus downei 

(B) STRAIN: ATCC 33748 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216 



PCT/CAOO/01150 



15 AGTAGCTTCT 
TTTCACGTCA 
GACTTGGTTG 
TGACCTGCTT 
AAGGTTCAGC 

20 ATCATGGAAT 
TGATACTGAT 
CTGGACGTGG 
GTCAACGACG 
AGTTGTTACC 

25 CAGGGGATAA 
GAACGTGGTC 
GTTTAAAGGT 
CTCCATTCTT 
GTAACTGGTT 

3 0 TGATAACGTT 



ACTGATGGAC 
GGTTGGTGTT 
ACGATGAAGA 
TCAGAATACG 
TCTTAAGGCT 
TGATGGACAT 
AAGCCTTTGC 
TACTGTAGCT 
AAGTTGAAAT 
GGAGTTGAAA 
CGTTGGTGTG 
AAGTGTTGGC 
GAAGTTTACA 
TAACAACTAC 
CAATCGAATT 
ACTATCGACG 



CAATGCCACA 
AAGAACCTTA 
ATTGCTTGAA 
ATTTCCCAGG 
CTTGAAGGTG 
CGTTGATGAC 
TTCTTCCAGT 
TCAGGACGTA 
CGTTGGTATC 
TGTTCCGTAA 
CTTCTTCGTG 
TGCGCCTGGT 
TCCTTTCTAA 
CGTCCACAGT 
GCCAGCGGGT 
TTGAATTGAT 



AACTCGTGAA 
TCGTCTTCAT 
TTGGTTGAAA 
TGATGATATC 
ATACAGCTGC 
TACATTCCAG 
CGAAGATGTA 
TCGACCGTGG 
AAGGACGAAA 
ACAATTGGAC 
GTATCCAACG 
TCGATTCACC 
AGAAGAAGGT 
TCTACTTCCG 
ACTGAAATGG 
CCACCCAATT 



CACATCTTGC 50 

GAACAAGGTT 100 

TGGAAATCCG 150 

CCTGTTGTTC 200 

CGAAGACAAG 250 

AACCAAAACG 300 

TTCTCAATCA 350 

TACTGTTAAG 400 

TCCAAAAAGC 450 

GAAGGTCTTG 500 

TGATGAAATC 550 

CACACACTAA 600 

GGACGTCATA 650 

TACAACTGAC 700 

TTATGCCTGG 750 

GC 792 



2) INFORMATION FOR SEQ ID NO: 217 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 795 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Streptococcus dysgalactiae 

(B) STRAIN: ATCC 43078 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217 

50 GTAGTTGCTT CAACAGACGG ACCAATGCCA CAAACTCGTG AGCACATCCT 50 

CCTTTCACGT CAGGTTGGTG TTAAACACCT TATCGTGTTC ATGAACAAAA 100 

TTGACCTTGT TGACGATGAA GAATTGCTTG AATTGGTTGA AATGGAAATC 150 

CGTGACCTTC TTTCAGAATA CGATTTCCCA GGTGATGACC TTCCAGTTAT 200 

CCAAGGTTCA GCTCTTAAAG CTCTTGAAGG CGACACTAAA TTTGAAGACA 250 

55 TCATCATGGA ATTGATGGAT ACTGTTGATT CATACATTCC AGAACCAGAA 300 

CGTGACACTG ACAAACCATT GCTTCTTCCA GTCGAAGACG TATTCTCAAT 350 

CACAGGTCGT GGTACAGTTG CTTCAGGACG TATCGACCGT GGTACTGTTC 400 

GTGTCAACGA CGAAATCGAA ATCGTTGGTA TCAAAGAAGA AACTAAAAAA 4 50 

GCTGTTGTTA CTGGTGTTGA AATGTTCCGT AAACAACTTG ACGAAGGTCT 500 

50 TGCAGGAGAC AACGTAGGTA TCCTTCTTCG TGGTGTTCAA CGTGACGAAA 550 
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TCGAACGTGG TCAAGTTATT GCTAAACCAG 
AAATTCAAAG GTGAAGTATA TATCCTTTCT 
CACTCCATTC TTCAACAACT ATCGTCCACA 
ACGTAACAGG TTCAATCGAA CTTCCAGCTG 
GGTGATAACG TGACAATCAA CGTTGAGTTG 



GTTCAATCAA CCCACACACT 
AAAGACGAAG GTGGACGTCA 
ATTCTACTTC CGTACAACTG 
GTACAGAAAT GGTTATGCCT 
ATCCACCCAA TCGCC 



600 
650 
700 
750 
795 



10 



15 



20 



25 



30 



35 



40 



45 



50 
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2) INFORMATION FOR SEQ ID NO: 218 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus egui subsp. equi 

(B) STRAIN: ATCC 9528 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 218 

CGGAGCTATC CTTGTAGTTG CTTCAACTGA CGGACCAATG CCACAAACTC 50 

GTGAGCACAT CCTTCTTTCA CGTCAGGTTG GTGTTAAGCA CCTTATCGTG 100 

TTCATGAACA AGGTTGACCT TGTTGACGAT GAAGAATTGC TTGAGCTTGT 150 

TGAAATGGAA ATTCGTGACC TTCTTTCAGA ATATGATTTC CCAGGTGATG 200 

ACCTTCCAGT TATCCAAGGT TCAGCGCTTA AGGCTCTTGA AGGCGACAGC 250 

AAATACGAAG ATATCATCAT GGAATTGATG GATACTGTTG ATTCATACAT 300 

TCCAGAACCA GAACGTGACA CAGACAAGCC ATTGCTTCTT CCAGTCGAGG 350 

ACGTATTCTC AATCACTGGA CGTGGTACTG TTGCTTCAGG ACGTATCGAC 4 00 

CGCGGTACTG TTCGTGTTAA CGACGAAATC GAAATCGTTG GTATCAGAGA 450 

CGAGATCAAA AAAGCAGTTG TTACTGGTGT CGAAATGTTC CGTAAACAGC 500 

TTGACGAAGG TCTTGCAGGG GACAACGTTG GTGTTCTTCT TCGTGGTGTA 550 

CAACGTGATG AAATCGAACG TGGTCAAGTT ATTGCTAAGC CAGGTTCTAT 600 

CAACCCACAC ACTAAATTTA AAGGTGAAGT ATATATCCTT ACTAAAGAAG 650 

AAGGTGGACG TCACACACCA TTCTTCAACA ACTATCGTCC ACAATTCTAC 700 

TTCCGTACTA CTGACGTAAC AGGTTCAATC GAGCTTCCAG CAGGTACAGA 750 

AATGGTTATG CCTGGTGATA ACGTGACTAT TGACGTTGAG TTGATCCACC 800 

CAATCGCCGT AGAACAAGGT ACTACATT 82 8 



2) INFORMATION FOR SEQ ID NO: 219 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus ferus 

(B) STRAIN: ATCC 33477 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 219 
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CGGTGCAATC CTTGTAGTAG CTTCTACAGA 
GTGAGCACAT CCTTCTTTCA CGTCAGGTAG 
TTCATGAACA AAGTTGACTT GGTTGACGAT 
TGAAATGGAA ATCCGTGACC TGCTTTCAGA 
5 ACCTTCCAGT TATCCAAGGT TCAGCTCTTA 
GCTCAAGAAG ATGTTATCAT GGAATTGATG 
CCCAGAACCA GAACGTGATA CTGACAAACC 
ATGTATTCTC AATCACAGGT CGTGGTACTG 
CGTGGTACTG TAAGAGTCAA CGATGAAGTT 

10 CGAAATCACT AAAGCAGTTG TTACCGGTGT 
TGGACGAAGG TCTTGCTGGT GATAACGTTG 
CAACGTGATG AAATCGAACG TGGTCAAGTA 
CAACCCACAC ACTAAATTTA AAGGTGAAGT 
AAGGTGGACG TCATACACCA TTCTTCAACA 

15 TTCCGTACAA CTGACGTAAC TGGTTCAATC 
AATGGTTATG CCTGGTGATA ACGTGACTAT 
CAATCGCCGT TGAACAAGGT ACTAC 



TGGACCAATG CCACAAACTC 50 

GTGTTAAACA CCTTATCGTC 100 

GAAGAATTGC TTGAATTGGT 150 

ATATGATTTC CCAGGTGATG 200 

AAGCGCTTGA AGGTGATACT 250 

AAAACCGTTG ATGAGTACAT 300 

ATTGCTTCTT CCAGTCGAAG 350 

TAGCTTCAGG ACGTATCGAT 400 

GAAATCGTTG GTATCAAAGA 450 

TGAAATGTTC CGTAAACAAT 500 

GTGTGCTTCT CCGTGGTGTG 550 

TTGGCTAAAC CAGGTTCAAT 600 

TTACATCCTT ACTAAAGAAG 650 

ACTACCGTCC ACAGTTCTAC 700 

GAATTGCCAG CAGGTACTGA 750 

CGACGTTGAA TTGATCCACC 800 

825 



20 



30 



55 



2) INFORMATION FOR SEQ ID NO: 22 0 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 826 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus gordonii 

(B) STRAIN: ATCC 10558 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 220 

CGGAGCTATC CTTGTAGTAG CTTCAACTGA TGGTCCTATG CCACAAACTC 50 

GTGAGCACAT CCTTCTCTCA CGCCAAGTTG GTGTTAAACA CTTGATCGTG 100 

TTCATGAACA AAGTTGACTT GGTTGACGAT GAAGAATTGC TTGAGTTGGT 150 

40 TGAAATGGAA ATCCGTGACC TCTTGTCAGA ATACGACTTC CCAGGTGACG 200 

ATCTTCCAGT TATCCAAGGT TCAGCTCTTA AAGCTCTTGA AGGTGACTCT 250 

AAATATGAAG ATATCATCAT GGAATTGATG AACACTGTTG ATGAGTACAT 300 

CCCAGAACCA GAACGCGACA CTGACAAACC ATTGCTTCTT CCAGTCGAAG 350 

ACGTATTCTC AATCACTGGA CGTGGTACAG TTGCTTCAGG ACGTATCGAC 4 00 

45 CGTGGTATCG TTAAAGTCAA TGACGAAATC GAAATCGTTG GTATCAAAGA 450 

AGAAATCCAA AAAGCAGTTG TTACTGGTGT TGAAATGTTC CGTAAACAGC 500 

TTGACGAAGG TCTTGCAGGG GACAACGTTG GTGTGCTTCT TCGTGGTATC 550 

CAACGTGATG AAATCGAACG TGGACAAGTT ATTGCTAAAC CAGGTTCAAT 600 

CAACCCACAC ACTAAATTCA AAGGTGAAGT TTATATCCTT ACTAAAGAAG 650 

50 AAGGTGGACG TCACACTCCA TTCTTCAACA ACTACCGTCC ACAGTTCTAC 700 

TTCCGTACAA CTGACGTTAC AGGTTCAATC GAACTTCCAG CAGGTACTGA 750 

AATGGTAATG CCTGGTGATA ACGTAACTAT CGACGTTGAG TTGATCCACC 800 

CAATCGCCGT TGAACAAGGT ACTACT 826 



2) INFORMATION FOR SEQ ID NO: 221 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 799 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus anginosus 

(B) STRAIN: ATCC 27335 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221 

TGTAGTAGCT TCAACTGACG GACCAATGCC TCAAACTCGT GAACATATCC 50 

TTCTTTCACG TCAAGTAGGT GTTAAATACC TTATTGTCTT CATGAACAAA 100 

15 GTTGACTTGG TTGACGATGA AGAATTGCTT, GAATTGGTTG AAATGGAAAT 150 

CCGTGATCTT CTTTCAGAAT ACGATTTCCC AGGTGATGAT ATTCCAGTAA 200 

TCCAAGGTTC AGCACTTAAA GCTCTTGAAG GTGATGAAAA ATATGAAGAC 250 

ATCATCATGG AATTGATGAA TACTGTTGAT GAATATATTC CAGAACCAGA 300 

ACGTGATACT GACAAACCAT TGCTTCTTCC AGTCGAAGAT GTATTCTCAA 350 

20 TCACTGGACG TGGTACTGTT GCTTCAGGAC GTATCGACCG TGGTACTGTT 400 

AAAGTCAACG ATGAAGTTGA AATCGTTGGT ATCCGCGAGG AAATCCAAAA 4 50 

AGCAGTTGTT ACTGGTGTTG AAATGTTCCG TAAACAATTG GACGAAGGTC 500 

TTGCTGGAGA TAACGTAGGG GTTCTTCTTC GTGGTATCCA ACGTGACGAA 550 

ATTGAACGTG GACAAGTTCT TGCTAAACCA GGTTCAATTC ATCCACACAC 600 

25 TAAATTCAAA GGTGAAGTTT ACATCCTTAC TAAAGAAGAA GGTGGACGTC 650 

ATACTCCATT CTTCAACAAC TACCGTCCTC AATTCTACTT CCGTACTACA 700 

GACGTTACAG GTTCAATCGA ACTTCCTGCA GGTACTGAAA TGGTAATGCC 750 

TGGTGATAAC GTAACAATTG ATGTTGAGTT GATCCACCCA ATTGCCGTA 799 

30 

2) INFORMATION FOR SEQ ID NO: 222 

(i) SEQUENCE CHARACTERISTICS : 
35 (A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus macacae 

(B) STRAIN: ATCC 35911 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 222 

TGGTGCTATT CTTGTAGTAG CTTCAACTGA CGGTCCAATG CCTCAAACGC 50 

GTGAACATAT CCTTCTTTCA CGCCAAGTAG GTGTTAAAAA CCTTATTGTT 100 

50 TTCATGAATA AAGTTGACTT AGTTGATGAT GAAGAATTGC TTGAATTGGT 150 

TGAAATGGAA ATCCGTGATC TTCTTACAGA ATATGATTTC CCAGGCGATG 200 

AACTTCCAGT TATCCAAGGT TCAGCACTTA AAGCTCTTGA AGGTGATACT 250 

AAGTACGAAG ATATTATCAT GGAATTGTTG GATACTGTAG ATGATTACAT -300 

CCCAGAACCA CAACGTGATA CTGACAAGCC ATTGCTTCTT CCAGTCGAAG 350 

55 ATGTTTTCTC TATTACTGGA CGTGGTACTG TTGCTTCAGG ACGTATTGAC 4 00 

CGTGGTACTG TTAAGGTTAA TGATGAAGTT GAAATCGTTG GTATTCGTGA 450 

CGATATTCAA AAAGCAGTTG TTACTGGTGT TGAAATGTTC CGTAAACAGC 500 

TTGACGAAGG TCTTGCTGGT GATAACGTCG GTGTCCTTCT TCGTGGTATC 550 

CAACGTGATG AAATTGAACG CGGTCAAGTT CTTGCTAAAC CAGGATCAAT 600 

50 TCATCCACAT ACTAAATTCA AAGGTGAAGT TTATATTCTT ACTAAAGAAG 650 
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AAGGTGGACG TCATACTCCA TTCTTTAACA 
TTCCGTACAA CTGATGTAAC TGGTTCAATT 
AATGGTTATG CCTGGTGATA ATGTTACGAT 
CAATCGCTGT TGAACAAGGT ACAAC 



PCT/CAOO/01150 

ACTACCGTCC ACAGTTCTAC 700 
GATTTGCCAG CAGGTACTGA 750 
TGATGTTGAA CTGATCCACC 800 

825 



2) INFORMATION FOR SEQ ID NO: 223 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear. 

15 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE,: 

(A) ORGANISM: Streptococcus gordonii (deposited as 
20 Streptococcus mitis) 

(B) STRAIN: ATCC 33399 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 223 

25 CTATCCTTGT AGTAGCTTCA ACTGACGGAC CAATGCCACA AACTCGTGAG 50 

CACATCCTTC TTTCACGTCA GGTTGGTGTT AAACACCTTA TCGTCTTCAT 100 

GAACAAAGTT GACTTGGTTG ACGACGAAGA ATTGCTTGAA TTGGTTGAAA 150 

TGGAAATCCG TGACCTATTG TCAGAATACG ACTTCCCAGG TGACGATCTT 200 

CCAGTTATCC AAGGTTCAGC TCTTAAAGCC CTTGAAGGTG ACACTAAATA 250 

30 CGAAGACATC GTTATGGAAT TGATGAACAC AGTTGATGAG TACATCCCAG 3 00 

AACCAGAACG TGACACTGAC AAACCATTGC TTCTTCCAGT CGAAGACGTA 350 

TTCTCAATCA CTGGTCGTGG TACAGTTGCT TCAGGACGTA TCGACCGTGG 4 00 

TATCGTTAAA GTCAACGACG AAATCGAAAT CGTTGGTATC AAAGAAGAAA 4 50 

CTCAAAAAGC AGTTGTTACT GGTGTTGAAA TGTTCCGTAA ACAACTTGAC 500 

35 GAAGGTCTTG CCGGAGATAA TGTAGGTGTC CTTCTTCGTG GTGTTCAACG 550 

TGATGAAATC GAACGTGGAC AAGTTATTGC TAAACCAGGT TCAATCAACC 600 

CACACACTAA ATTCAAAGGT GAAGTTTACA TCCTTACTAA AGAAGAAGGT 650 

GGACGTCACA CTCCATTCTT CAACAACTAC CGTCCACAAT TCTACTTCCG 700 

TACTACTGAC GTTACAGGTT CAATCGAACT TCCAGCAGGT ACTGAAATGG 750 

4 0 TAATGCCTGG TGATAACGTG ACAATCGACG TTGAGTTGAT CCACCCAATC 800 

GCCGTAGAAC AAGGTACTAC AT 822 



45 2) INFORMATION FOR SEQ ID NO: 224 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 827 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mutans 

(B) STRAIN: ATCC 25175 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224 

50 
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CCCTGGTGCT 
CTCGTGAACA 
GTCTTCATGA 
GGTTGAAATG 
ATGATATTCC 
ACTGCTCAAG 
CATTCCAGAT 
AAGATGTTTT 
GATCGTGGTA 
TGATGACATT 
AATTGGATGA 
ATCCAACGTG 
AATTCACCCA 
AGGAAGGTGG 
TACTTCCGTA 
TGAAATGGTT 
ATCCAATCGC 
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ATCCTTGTAG 
CATTCTTCTT 
ATAAAGTTGA 
GAAATCCGTG 
AGTTATTCAA 
AAGATATCAT 
CCAGAACGTG 
CTCAATCACT 
CTGTTAAAGT 
CAAAAAGCTG 
AGGTATTGCA 
ATGAAATCGA 
CATACTAAAT 
ACGTCATACA 
CAACTGACGT 
ATGCCTGGTG 
TGTTGAACAA 



TAGCTTCAAC 
TCACGTCAAG 
TTTGGTTGAC 
ATCTTCTTTC 
GGTTCAGCTC 
CATGGAATTA 
ATACTGACAA 
GGTCGTGGTA 
TAACGATGAA 
TTGTTACTGG 
GGGGATAATG 
ACGTGGTCAA 
TCAAAGGTGA 
CCATTCTTCA 
AACTGGTTCA. 
ATAACGTTAC 
GGTACTA 



TGATGGACCA 
TTGGTGTTAA 
GATGAAGAAT 
AGAATATGAT 
TTAAAGCTCT 
ATGCATACTG 
GCCGCTCCTT 
CTGTTGCTTC 
GTTGAAATCG 
TGTTGAAATG 
TTGGTGTTCT 
GTTCTTGCTA 
AGTTTATATC 
ATAACTATCG 
ATTGAGTTGC 
TATTGACGTT 



ATGCCACAAA 
ATACCTCATT 
TGCTTGAATT 
TTCCCAGGTG 
TGAAGGCGAT 
TTGATGACTA 
CTTCCAGTCG 
AGGACGTATT 
TTGGTATCCG 
TTCCGTAAAC 
CCTTCGTGGT 
AACCAGGTTC 
CTTACTAAAG 
TCCACAATTC 
CAGCAGGTAC 
GAATTGATCC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
827 



2) INFORMATION FOR SEQ ID NO: 225 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus parasanguinis 

(B) STRAIN: ATCC 15912 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225 

AGCTATCCTT GTAGTAGCTT CAACTGACGG ACCAATGCCA CAAACACGTG 50 

AACACATCCT TCTTTCACGT CAGGTTGGTG TTAAACACTT GATCGTCTTC 100 

ATGAACAAAG TTGACTTGGT TGATGATGAA GAATTGCTTG AATTGGTTGA 150 

AATGGAAATC CGTGACCTTC TTTCAGAATA CGATTTCCCA GGTGATGACC 200 

TTCCAGTTAT CCAAGGTTCA GCTCTTAAAG CTCTTGAAGG TGACTCTAAA 250 

TATGAAGATA TCATCATGGA ATTGATGGAT ACTGTTGATG AGTACATCCC 300 

AGAACCAGAA CGCGATACTG ACAAACCATT GCTTCTTCCA GTCGAAGACG 350 

TATTCTCAAT CACTGGACGT GGTACAGTTG CTTCAGGACG TATCGACCGT 400 

GGTGTTGTTC GTGTCAATGA TGAAATCGAA ATCGTTGGTA TCAAAGAAGA 450 

AATCCAAAAA GCAGTTGTTA CTGGTGTTGA AATGTTCCGT AAACAACTTG 500 

ACGAAGGTCT TGCAGGGGAT AACGTTGGTG TGCTTCTTCG TGGTATCCAA 550 

CGTGATGAAA TCGAACGTGG ACAAGTTATC GCTAAACCAG GTTCAATCAA 600 

CCCACACACT AAATTCAAAG GTGAAGTTTA CATCCTTACT AAAGAAGAAG 650 

GTGGACGTCA TACTCCATTC TTCAACAACT ACCGTCCACA GTTCTACTTC 700 

CGTACAACTG ACGTAACTGG ATCTATCGAA CTTCCACCAG GAACTGAAAT 750 

GGTAATGCCT GGTGATAACG TGACTATCGA CGTTGAGTTG ATCCACCCAA 800 

TCGCCGTTGA ACAAGGTACT ACAT 824 



2) INFORMATION FOR SEQ ID NO: 226 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 824 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus ratti 

(B) STRAIN: ATCC 19645 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 226 

TGGTGCTATC CTTGTAGTAG CTTCAACTGA TGGACCAATG CCGCAAACTC 50 

GTGAACACAT CTTGCTTTCA CGTCAAGTTG GTGTTAAATA CCTTATCGTC 100 

15 TTCATGAACA AGGTTGACTT GGTTGATGAT GAAGAATTGC TTGAATTGGT 150 

TGAAATGGAA ATCCGTGATC TTCTTTCAGA ATACGATTTC CCAGGTGATG 200 

ACATTCCAGT TATCCAAGGT TCAGCCCTTA AAGCTCTTGA AGGTGACACT 250 

GAACAAGAAG ATGTTATCAT, GGAATTGATG AAAACAGTTG ATGAGTACAT 300 

CCCAGATCCA GAACGCGATA CTGATAAGCC ATTGCTTCTT CCAGTCGAAG 350 

20 ACGTGTTCTC AATCACTGGA CGTGGTACTG TTGCATCAGG ACGTATCGAC 400 

CGTGGTACTG TTAAAGTCAA TGACGAAGTT GAAATCGTTG GTATCCGTGA 450 

TGACATCCAA AAAGCTGTTG TTACTGGTGT TGAAATGTTC CGTAAACAGC 500 

TTGACGAAGG TCTTGCTGGT GATAACGTTG GTGTACTTCT TCGTGGTATC 550 

CAACGTGATG AAATCGAACG CGGTCAAGTT CTTGCTAAAC CAGGTTCAAT 600 

25 TCATCCGCAT ACTAAATTTA AAGGTGAAGT TTACATCCTT ACTAAAGAAG 650 

AAGGCGGACG TCACACTCCA TTCTTCAACA ACTACCGTCC ACAGTTCTAC 7 00 

TTCCGTACAA CTGACGTAAC TGGTTCAATC GAATTGCCAG CAGGTACTGA 750 

AATGGTTATG CCTGGTGATA ACGTGACTAT CGACGTTGAA TTGATCCACC 800 

CAATCGCTGT TGAACAAGGT ACTA 824 

30 



2) INFORMATION FOR SEQ ID NO: 227 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 795 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

10 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus sanguinis 
15 (B) STRAIN: ATCC 10556 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 227 

TGTAGTAGCT TCAACTGACG GACCAATGCC ACAAACTCGT GAGCACATCT 50 

50 TGCTTTCACG TCAGGTTGGT GTTAAACACT TGATCGTCTT CATGAACAAA 100 

GTTGACTTGG TTGACGATGA AGAATTGCTT GAATTGGTTG AAATGGAAAT 150 

CCGTGACCTC TTGTCAGAAT ACGACTTCCC AGGTGACGAT CTTCCAGTTA 200 

TCCAAGGTTC AGCTCTTAAA GCTCTTGAAG GTGACTCTAA ATATGAAGAC 250 

ATCATCATGG AATTGATGGA CACTGTTGAT GAGTACATCC CAGAACCAGA 300 

55 ACGCGATACT GACAAGCCAT TGCTTCTTCC AGTCGAAGAC GTATTCTCAA 350 

TCACTGGTCG TGGTACAGTT GCTTCAGGAC GTATCGACCG TGGTATCGTT 400 

AAAGTCAACG ACGAAATCGA AATCGTTGGT ATCAAAGAAG AAATCCAAAA 450 

AGCAGTTGTT ACTGGTGTTG AAATGTTCCG TAAACAGCTT GACGAAGGTC 500 

TTGCAGGGGA CAACGTAGGT GTGCTTCTCC GTGGTATCCA ACGTGATGAA 550 

50 ATCGAACGTG GACAAGTTAT CGCTAAACCA GGTTCAATCA ACCCACACAC 600 

136 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



TAAATTCAAG GGTGAAGTTT ATATCCTTAC 
ACACTCCATT CTTCAACAAC TACCGTCCAC 
GACGTTACAG GTTCAATCGA ACTTCCAGCA 
TGGTGATAAC GTAACAATCG ACGTTGAGTT 



TAAAGAAGAA GGCGGACGTC 650 

AGTTCTACTT CCGTACAACT 700 

GGTACTGAAA TGGTAATGCC 750 

GATCCACCCA ATCGC 795 



2) INFORMATION FOR SEQ ID NO: 228 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 795 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

15 

(ii) MOLECULE TYPE : Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus sobrinus 
20 (B) STRAIN: ATCC 33478 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 228 



TGTAGTAGCT TCTACTGACG GACCAATGCC ACAAACTCGT GAACACATCT 50 

25 TGCTTTCACG CCAAGTTGGT GTTAAGAACC TCATCGTCTT CATGAACAAG 100 

GTTGACTTGG TTGATGATGA AGAATTGCTT GAATTGGTTG AAATGGAAAT 150 

CCGTGATCTT CTTTCAGAAT ACGATTTCCC AGGTGACGAC ATTCCTGTTG 200 

TTCAAGGTTC AGCTCTTAAG GCTCTTGAAG GTGATACAGC TGCCGAAGAC 250 

AAGATTATGG AATTGATGGA CATCGTTGAT GATTACATTC CAGAACCAAA 300 

30 ACGCGATACT GATAAGCCAT TGCTTCTCCC AGTCGAAGAC GTATTCTCAA 350 

TCACTGGTCG TGGTACTGTT GCTTCAGGAC GTATTGACCG TGGTACTGTT 400 

AAGGTTAACG ACGAAGTTGA AATCGTTGGT ATCCGTGACG ATATCCAAAA 450 

AGCAGTTGTT ACTGGAGTTG AAATGTTCCG TAAGCAATTG GACGAAGGTC 500 

TTGCTGGAGA TAACGTTGGT GTGCTTCTTC GTGGTATCCA ACGTGATGAA 550 

35 ATTGAACGTG GTCAAGTATT GGCTGCACCT GGTTCAATCC ACCCACACAC 600 

TAAGTTCAAG GGTGAAGTTT ACATCCTTTC TAAAGATGAA GGTGGACGTC 650 

ACACTCCATT CTTCAACAAC TACCGTCCAC AGTTCTACTT CCGTACAACT 700 

GACGTAACTG GTTCAATCGA ATTGCCAGCA GGTACTGAAA TGGTTATGCC 750 

TGGTGATAAC GTTACTATCG ACGTTGAATT GATCCACCCA ATCGC 795 

40 



2) INFORMATION FOR SEQ ID NO: 229 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 797 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE : Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus suis 
55 (B) STRAIN: ATCC 43765 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 229 

TGTAGTAGCT TCAACTGACG GTCCAATGCC ACAAACTCGT GAGCACATCC 50 
50 TTCTTTCACG TCAGGTTGGT GTTAAACACC TTATCGTCTT CATGAACAAA 100 
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GTTGACTTGG 
CCGTGACCTT 
TCCAAGGTTC 
ATCGTTATGG 
5 ACGCGACACT 
TCACTGGTCG 
CGTGTCAACG 
AGCAGTTGTT 
TTGCCGGCGA 
10 ATCGAACGTG 
TAAATTCAAA 
ACACTCCATT 
GACGTAACTG 
TGGTGATAAC 

15 



TTGACGATGA 
CTTTCAGAAT 
AGCTCTTAAA 
AATTGATGAA 
GACAAACCAT 
TGGTACTGTA 
ACGAAATCGA 
ACTGGTGTTG 
TAACGTTGGT 
GTCAAGTTAT 
GGTGAAGTTT 
CTTCGACAAC 
GTTCAATCAA 
GTTACTATCG 



AGAATTGCTT 
ACGATTTCCC 
GCTCTTGAAG 
CACTGTTGAT 
TGTTGCTTCC 
GCTTCAGGAC 
AATCGTTGGT 
AAATGTTCCG 
GTGCTTCTTC 
CTCTAAACCA 
ACATCCTTAC 
TACCGTCCAC 
ATTGCCAGAA 
ACGTTGAATT 



GAGTTGGTTG 
AGGTGATGAT 
GTGACTCTAA 
GAGTACATTC 
AGTCGAGGAC 
GTATCGACCG 
CTTCAAGAAG 
TAAACAACTT 
GTGGTGTACA 
GGTTCTATCA 
TAAAGAAGAA 
AGTTCTACTT 
GGTACTGAAA 
GATCCACCCA 



AAATGGAAAT 
CTTCCAGTTA 
GTACGAAGAC 
CAGAACCAGA 
GTATTCTCAA 
TGGTACTGTT 
AAAAATCTAA 
GACGAAGGTC 
ACGTGATGAA 
ACCCACACAC 
GGTGGACGTC 
CCGTACAACT 
TGGTAATGCC 
ATCGCCG 



150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
797 



2) INFORMATION FOR SEQ ID NO: 230 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 793 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus uteris 
30 (B) STRAIN: ATCC 19436 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 230 

TTGTTGTTGC ATCAACTGAT GGACCAATGC CACAAACTCG TGAGCACATC 50 

CTTCTTTCAC GCCAAGTTGG TGTTAAACAC CTTATCGTTT TCATGAACAA 100 

AATCGACCTT GTTGACGATG AAGAATTGCT TGAATTAGTT GAAATGGAAA 150 

TCCGTGACCT TCTTTCAGAA TACGATTTCC CAGGTGATGA CCTACCAGTT 200 

ATCCAAGGTT CAGCTCTTAA AGCTCTTGAA GGTGATTCTA AATACGAAGA 250 

CATCATCATG GAATTGATGA AAACTGTTGA TGAGTATATT CCAGAACCAG 300 

AACGTGATAC AGACAAACCA TTACTTCTTC CAGTCGAAGA CGTATTCTCA 350 

ATCACAGGTC GTGGTACTGT AGCTTCAGGA CGTATCGATC GTGGTACTGT 4 00 

TCGTGTCAAC GACGAAATTG AAATCGTTGG TATCAAAGAA GAAACTAAAA 450 

AAGCAGTTGT TACTGGTGTT GAAATGTTCC GTAAACAACT TGACGAAGGT 500 

CTTGCAGGAG ATAACGTAGG TATCCTTCTT CGTGGTGTTC AACGTGACGA 550 

AATCGAACGT GGACAAGTTA TTGCTAAACC AGGTTCAATC AACCCACACA 600 

CTAAATTCAA AGGTGAAGTT TACATCCTTT CTAAAGATGA AGGTGGACGT 650 

CATACTCCAT TCTTCAACAA CTACCGTCCT CAATTCTATT TCCGTACAAC 700 

TGACGTAACA GGTTCAATCG AACTTCCAGC TGGTACTGAA ATGGTAATGC 750 

CTGGTGATAA CGTGACAATC AGCGTTGAGT TGATCCACCC AAT 793 



40 



45 



2) INFORMATION FOR SEQ ID NO: 231 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 98 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus vestibularis 
5 (B) STRAIN: ATCC 49124 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231 

TTGTAGTAGC ATCTACTGAC GGACCAATGC CACAAACTCG TGAGCACATC 50 

10 CTTCTTTCAC GTCAGGTTGG TGTTAAACAC CTTATCGTCT TCATGAACAA 100 

AGTTGACTTG GTTGACGATG AAGAATTGCT TGAATTGGTT GAAATGGAAA 150 

TCCGTGACCT TCTTTCAGAA TACGATTTCC CAGGTGATGA TATTCCAGTT 200 

ATCCAAGGTT CAGCTCTTAA AGCTCTTGAA GGTGATTCTA AATACGAAGA 250 

CATCATCATG GACTTGATGA ACACTGTTGA CGAATACATT CCAGAACCAG 300 

15 AACGTGACAC TGACAAACCA TTGTTGCTTC, CAGTCGAAGA CGTATTCTCA 350 

ATCACTGGTC GTGGTACTGT TGCTTCAGGA CGTATCGACC GTGGTGTTGT 4 00 

TCGTGTTAAT GACGAAGTTG AAATCGTTGG TCTTAAAGAA GAAATCCAAA 450 

AAGCAGTTGT TACTGGTGTA GAAATGTTCC GTAAACAACT TGACGAAGGT 500 

ATTGCCGGAG ATAACGTCGG TGTCCTTCTT CGTGGTATCC AACGTGATGA 550 

20 AATTGAACGT GGTCAAGTAT TGGCTGCACC TGGTTCAATC AACCCACACA 600 

CTAAATTCAA AGGTGAAGTT TACATCCTTT CTAAAGAAGA AGGTGGACGT 650 

CACACTCCAT TCTTCAACAA CTACCGTCCA CAGTTCTACT TCCGTACAAC 700 

TGACGTAACA GGTTCAATCG AACTTCCTGC AGGTACTGAA ATGGTTATGC 750 

CTGGTGATAA CGTGACTATC GACGTTGAGT TGATCCACCC AATCGCCG 798 

25 



2) INFORMATION FOR SEQ ID NO: 232 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Tatumella ptyseos 
40 (B) STRAIN: ATCC 33301 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232 

GGCGCTATCC TGGTTGTTGC TGCAACTGAC GGCCCTATGC CTCAGACCCG 50 

15 TGAGCACATC CTGCTGGGCC GCCAGGTAGG CGTTCCTTAC ATCATCGTGT 100 

TCCTGAACAA ATGTGACATG GTTGATGATG AAGAGCTGCT GGAACTGGTA 150 

GAAATGGAAG TCCGTGACCT GCTGTCACAG TACGACTTCC CGGGTGACGA 200 

CACGCCAATC GTTCGCGGTT CAGCGCTGAA AGCACTGGAA GGTGAAGGCG 250 

AGTGGGAAGA GAAGATTCTG GAGCTGGCTG GCTTCCTGGA TTCTTACATC 300 

50 CCTGAGCCAG AGCGTGCTAT CGATCAGCCG TTCCTGCTGC CAATCGAAGA 350 

CGTATTCTCA ATCTCCGGTC GTGGTACAGT TGTTACCGGT CGTGTAGAGC 400 

GCGGGATCAT CAAAGTCGGT GAAGAAGTTG AGATCGTTGG TATCAAAGAT 450 

ACTGCGAAAT CAACCTGTAC CGGTGTTGAA ATGTTCCGTA AACTGCTGGA 500 

CCAGGGTCAG GCGGGTGAGA ACGTTGGTGT TCTGCTGCGT GGTATCAAGC 550 

55 GTGAAGAGAT CGAACGTGGT CAGGTTCTGG CTAAACCAGG TTCAATCAAA 600 

CCACACACCC AGTTCGAGTC AGAAGTTTAT ATTCTGTCTA AAGACGAAGG 650 

CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 700 

GTACAACTGA CGTGACCGGA ACCATCGAAC TGCCGGAAGG CGTAGAGATG 750 

GTAATGCCTG GTGACAACAT CAAAATGGTT GTTACCCTGA TCCATCCAAT 800 

50 CGCGATGGAC GATGGTCTGC GTTTCGCAA 829 
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2) INFORMATION FOR SEQ ID NO: 233 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Trabulsiella guamensis 

(B) STRAIN: ATCC 49490 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 233 

20 GGCGCAATCC TGGTAGTAGC AGCGACTGAC GGCCCGATGC CGCAGACTCG 50 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT 100 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTA 150 

GAGATGGAAG TTCGTGAACT GCTGTCTCAG TACGATTTCC CGGGCGATGA 2 00 

CACGCCGATC GTACGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGACGCAG 250 

2 5 AGTGGGAAGC GAAAATCATC GAACTGGCAG GTTTCCTGGA TTCTTACATT 300 

CCGGAACCAG AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA 350 

CGTATTCTCC ATCTCTGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAGC 400 

GCGGTATCAT CAAAGTGGGT GAAGAAGTAG AAATCGTTGG TATCAAAGAG 450 

ACTGCGAAGT CAACCTGTAC TGGCGTAGAA ATGTTCCGCA AACTGCTGGA 500 

30 CGAAGGCCGT GCTGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAAC 550 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CACCATCAAC 600 

CCGCACACCA AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGACGAAGG 650 

CGGCCGTCAC ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 700 

GTACAACTGA CGTGACTGGC ACCATCGAAC TGCCGGAAGG CGTAGAGATG 750 

3 5 GTAATGC CGG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT 800 

CGCGATGGAC GACGGTCTGC GTTTCGCAA 829 



40 2) INFORMATION FOR SEQ ID NO: 234 

( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Veillonella parvula 

(B) STRAIN: ATCC 10790 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 234 



CGGCGCTATC TTGGTTGTAT CCGCAGCTGA CGGCCCTATG CCTCAAACTC 50 

GCGAACACAT CTTGTTGGCT CGCCAAGTTG GTGTTCCTGC AATCGTAGTA 100 

TTCTTGAACA AAGCTGACAT GGTTGACGAT GAAGAATTGA TCGAATTGGT 150 

AGAAATGGAA GTTCGTGAAC TTCTTTCTTC CTACGAATTC CCTGGCGACG 2 00 

60 AAGTACCTAT CGTTGTAGGT TCCGCGTTGA AAGCTTTGGA AGGCGATGCT 250 
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CAATATGTAG 
CCCAACACCA 
ATGTTTTCAC 
CGTGGTCAAG 
AAAAGCTGAA 
TGGATTCTGC 
GATCGTAAAG 
CAACCCACAY 
AAGGTGGTCG 
TTCCGTACAA 
AATGTGTATG 
CAATCGCTAT 
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CTAAAATTGA 
GTTCGTGACA 
AATCACTGGT 
TAAACGTTGG 
CAATACGTAG 
AGTAGCAGGT 
ACATYGAACG 
ACAAAATTCA 
TCATACTCCA 
CAGACGTAAC 
CCTGGCGATA 
CGAAGAAGGT 



CGAATTGATG 
CTGATAAACC 
CGTGGTACAG 
TGATACTGTT 
TAACAGGTCT 
GACAACGTAG 
TGGTCAAGTA 
AAGCAGAAGT 
TTCTTCTCCA 
AGGTGTTGTA 
ACGTAACAAT 
CTTCG 



GACGCTGTAG 
ATTCTTGATG 
TAGCAACTGG 
GAAGTAGTAG 
TGAAATGTTC 
GTGCATTGCT 
TTGGCTAAAC 
ATACGTATTG 
ACTACCGTCC 
AACCTTCCTG 
GGAAATCGAA 



ACTCCTACAT 
CCTGTGGAAG 
CCGTGTTGAA 
GCTTGAAAGA 
CGTAAAGTGT 
TCGTGGTGTT 
CAGGTTCCAT 
ACTAAAGAAG 
ACAATTCTAC 
AAGGTGTAGA 
TTGATTACTC 



300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
825 



2) INFORMATION FOR SEQ 



ID NO: 235 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia enterocolitica 

(B) STRAIN: ATCC 9610 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235 

CGCTATCCTG GTTGTTGCTG CAACTGATGG CCCAATGCCA CAGACTCGTG 50 

AGCACATCCT GTTGGGTCGT CAGGTTGGTG TTCCTTACAT CATCGTATTC 100 

ATGAACAAAT GTGACATGGT TGACGATGAA GAGCTGCTAG AACTGGTAGA 150 

AATGGAAGTG CGCGATCTTC TGTCTACCTA CGATTTCCCA GGCGATGATA 200 

CGCCAGTTGT TCGTGGTTCC GCGCTGAAAG CATTGGAAGG CGAACCTGAG 250 

TGGGAAGCAA AAATTATCGA ACTGGCTGGC TACCTGGATT CTTACATCCC 300 

AGAACCAGAG CGTGCTATCG ATAAGCCGTT CCTGCTGCCA ATCGAAGACG 350 

TATTCTCTAT CTCTGGTCGT GGTACTGTTG TAACGGGTCG TGTAGAGCGC 4 00 

GGTATCGTTA AAGTTGGTGA AGAAGTCGAA ATTGTTGGCC TGAAAGATAC 450 

CGTTAAATCT ACTTGTACTG GCGTTGAAAT GTTCCGCAAA CTGCTGGATG 500 

AAGGCCGTGC AGGTGAGAAC GTTGGTGTTC TGCTGCGTGG TATCAAGCGT 550 

GAAGATATCG AACGTGGTCA AGTTCTTGCT AAACCAGGTT CGATTAAACC 600 

ACACACCAAA TTTGAATCAG AAGTTTATAT TCTGAGCAAA GATGAAGGTG 650 

GTCGCCATAC TCCGTTCTTC AAAGGCTACC GTCCTCAGTT CTACTTCCGT 700 

ACAACTGATG TAACCGGTAC TATTGAACTG CCAGAAGGCG TTGAGATGGT 750 

GATGCCAGGT GATAACATTC AAATGATTGT TAACCTGATT GCTCCTATCG 800 

CAATGGATGA CGGCTTGCGC TTTGC 825 



2) INFORMATION FOR SEQ ID NO: 236 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DN A 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia frederiksenii 

(B) STRAIN: ATCC 33641 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 236 

GGCGCGATCC TGGTTGTTGC TGCCACTGAT GGCCCGATGC CACAGACTCG 50 

CGAGCACATT CTGTTAGGGC GTCAGGTGGG TGTTCCTTAC ATCCTGGTCT 100 

10 TCCTGAACAA ATGTGACATG GTTGACGACG AAGAGCTGCT GGAACTGGTA 150 

GAAATGGAAG TTCGTGAACT TCTGTCTCAG TACGATTTCC CTGGCGACGA 200 

CACTCCAGTT ATCCGTGGTT CTGCGCTGAA AGCGCTGGAA GGCGAAGCTG 250 

AGTGGGAAGC AAAAATCATC GAATTGGCTG AGGCGCTGGA TAGCTATATT 300 

CCACAGCCAG AGCGTGCGAT TGATAAACCA TTCCTGCTGC CAATCGAAGA 350 

15 CGTATTCTCA ATCTCTGGCC GTGGTACTGT TGTCACCGGT CGTGTAGAGC 4 00 

GCGGTATCGT TAAAGTCGGC GAAGAAGTCG AAATCGTTGG TATCATTGAT 450 

ACCATCAAGA CTACCTGTAC TGGTGTTGAA ATGTTCCGCA AATTGCTGGA 500 

CGAAGGCCGT GCGGGTGAGA ACGTTGGTGT TCTGCTACGT GGTACTAAAC 550 

GTGATGACGT ACAACGTGGT CAGGTATTGG CAAAACCAGG TTCTATCAAG 600 

20 CCACACACCA AATTTGAATC AGAAGTTTAT ATTCTGAGCA AAGATGAAGG 650 

TGGTCGCCAT ACTCCGTTCT TCAAAGGTTA TCGTCCTCAG TTCTACTTCC 700 

GTACAACTGA CGTGACCGGT ACTATCGAAC TGCCAGAAGG CGTTGAGATG 750 

GTGATGCCAG GTGATAACAT TCAAATGATT GTTAACCTGA TTGCTCCTAT 800 

CGCAATGGAT GACGGTCTGC GCTTTGCG 828 

25 



2) INFORMATION FOR SEQ ID NO: 237 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia intermedia 
40 (B) STRAIN: ATCC 29909 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 237 

CTTGGTTGTC GCTGCAACTG ATGGTCCTAT GCCACAGACT CGCGAGCACA 50 

15 TCCTGCTAGG TCGTCAGGTG GGTGTTCCTT ACATCCTGGT CTTCCTGAAC 100 

AAGTGTGACA TGGTTGACGA TGAGGAGTTG CTGGAATTGG TAGAAATGGA 150 

AGTCCGCGAA CTTCTGTCTC AATATGATTT CCCTGGCGAT GATACTCCTG' 2 00 

TTATCCGTGG TTCAGCGCTG AAGGCGTTGG AAGGCGAGCC TGAATGGGAA 250 

GCAAAAATTA TCGAATTAGC TGAGGCGCTG GATAGTTATA TTCCACAGCC 300 

50 AGAGCGCGCG ATTGATAGAC CATTCTTGCT GCCAATCGAA GACGTATTCT 350 

CTATCTCAGG TCGTGGTACA GTCGTCACTG GTCGTGTAGA GCGTGGGATC 4 00 

GTTAAAGTTG GCGAAGAAGT TGAAATCGTT GGTATTATCG ATTCCATTAG 4 50 

AACAACATGT ACTGGCGTTG AAATGTTCCG CAAATTGCTG GACGAAGGCC 500 

GCGCGGGTGA GAACGTTGGT GTTCTACTGC GTGGGACTAA ACGTGATGAC 550- 

55 GTACAGCGTG GTCAGGTATT AGCTAAGCCA GGTTCTATCA AGCCACATAC 600 

TAAATTCGAA TCCGAAGTTT ATATTCTGAG CAAAGATGAA GGCGGGCGTC 650 

ACACGCCGTT CTTCAAAGGC TACCGTCCTC AGTTCTACTT CCGTACAACG 700 

GATGTAACCG GTACTATTGA ATTGCCAGAC GGCGTTGAGA TGGTGATGCC 750 

AGGTGATAAC ATTCAAATGA TTGTTAACCT GATTGCACCT ATTGCGATGG 800 

50 ATGATGGTCT GCG 813 
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2) INFORMATION FOR SEQ ID NO: 238 



10 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Yersinia pestis 

(B) STRAIN: KIM D27 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 238 



20 GGAGCGATCT 
TGAGCATATC 
TCCTGAACAA 
GAAATGGAGG 
CACTCCAGTC 

25 AGTGGGAAGC 
CCGCAACCAG 
CGTATTCTCT 
GTGGTATTGT 
ACGATTAAAA 

3 0 TGAAGGCCGT 
GTGACGATGT 
CCACACACGA 
CGGCCGTCAT 
GTACAACTGA 

35 GTCATGCCTG 
CGCAATGGAT 



TGGTTGTTGC 
CTGCTGGGCC 
ATGTGACATG 
TTCGTGAGCT 
ATCCGTGGTT 
TAAAATTATC 
AACGCGCTAT 
ATTTCTGGTC 
TAAGGTCGGC 
CAACTTGTAC 
GCTGGTGAAA 
TCAGCGTGGT 
AGTTTGAGTC 
ACACCGTTCT 
CGTGACCGGT 
GTGACAACGT 
GATGGTCTGC 



TGCAACCGAT 
GCCAGGTTGG 
GTTGACGATG 
TCTGTCTCAA 
CAGCGTTGAA 
GAGTTGGCAG 
TGATAGACCA 
GTGGTACTGT 
GAAGAAGTTG 
TGGCGTTGAA 
ATGTTGGTGT 
CAAGTACTGG 
AGAAGTTTAT 
TCAAGGGCTA 
ACCATTGAGC 
AAACATGGTT 
GCTTCGCAA 



GGCCCTATGC 
TGTCCCATAC 
AAGAGTTGCT 
TACGATTTCC 
AGCCCTGGAA 
AAGCTCTGGA 
TTCCTATTGC 
AGTTACTGGT 
AAATCGTTGG 
ATGTTCCGCA 
TCTGCTGCGT 
CGAAACCAGG 
ATTCTGAGCA 
CCGTCCTCAG 
TGCCAGAAGG 
GTTAACCTAA 



CGCAGACTCG 
ATTATTGTCT 
AGAGTTGGTT 
CAGGCGACGA 
GGTGACGCTG 
TAGCTATATT 
CAATTGAAGA 
CGTGTAGAAC 
TATTATCGAT 
AGCTGCTGGA 
GGTACTAAGC 
TTCTATCAAG 
AAGATGAAGG 
TTCTACTTCC 
CGTTGAAATG 
TTGCTCCTAT 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
829 



40 2) INFORMATION FOR SEQ ID NO: 239 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia pseudotuberculosis 

(B) STRAIN: ATCC 29833 



55 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 239 

TGGAGCGATC TTGGTTGTTG CTGCAACCGA TGGCCCTATG CCGCAGACTC 50 

GTGAGCATAT CCTGCTGGGC CGCCAGGTTG GTGTCCCATA CATTATTGTC 100 

TTCCTGAACA AATGTGACAT GGTTGACGAT GAAGAGTTGC TAGAGTTGGT 150 

TGAAATGGAG GTTCGTGAGC TTCTGTCTCA ATACGATTTC CCAGGCGACG 200 

ACACTCCAGT CATCCGTGGT TCAGCGTTGA AAGCCCTGGA AGGTGACGCT 250 
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GAGTGGGAAG 
TCCGCAACCA 
ACGTATTCTC 
CGCGGTATTG 
TACGATTAAA 
ATGAAGGCCG 
CGTGACGATG 
GCCACACACG 
GCGGCCGTCA 
CGTACAACTG 
GGTCATGCCT 
TCGCAATGGA 
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CTAAAATTAT 
GAACGCGCTA 
TATTTCTGGT 
TTAAGGTCGG 
ACAACTTGTA 
TGCTGGTGAA 
TTCAGCGTGG 
AAGTTTGAGT 
TACACCGTTC 
ACGTGACCGG 
GGTGACAACG 
TGATGGT 



CGAGTTGGCA 
TTGATAGACC 
CGTGGTACTG 
CGAAGAAGTT 
CTGGCGTTGA 
AATGTTGGTG 
TCAAGTACTG 
CAGAAGTTTA 
TTCAAGGGCT 
TACCATTGAG 
TAAACATGGT 



GAAGCTCTGG ATAGCTATAT 
ATTCCTATTG CCAATTGAAG 
TAGTTACTGG TCGTGTAGAA 
GAAATCGTTG GTATTATCGA 
AATGTTCCGC AAGCTGCTGG 
TTCTGCTGCG TGGTACTAAG 
GCGAAACCAG GTTCTATCAA 
TATTCTGAGC AAAGATGAAG 
ACCGTCCTCA GTTCTACTTC 
CTGCCAGAAG GCGTTGAAAT 
TGTTAACCTA ATTGCTCCTA 



300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
817 



10 



15 



20 



25 



30 



35 



40 



45 



50 
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2) INFORMATION FOR SEQ ID NO: 240 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
•(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia rohdei 

(B) STRAIN: ATCC 43380 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 240 

TGGCGCGATC CTGGTTGTTG CTGCAACTGA TGGCCCAATG CCACAGACTC 50 

GCGAGCACAT CCTGTTGGGT CGTCAAGTGG GTGTTCCTTA CATCTTAGTC 100 

TTCCTGAACA AGTGTGACAT GGTTGACGAC GAAGAGTTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAGC TTCTGTCTCA ATACGATTTC CCTGGCGATG 200 

ACACTCCGGT TATTCGTGGT TCCGCGCTGA AAGCGCTGGA AGGCGAGGCC 250 

GAGTGGGAAG CCAAAATTAT TGAACTTGCT GAAGCACTGG ATAGCTACAT 300 

TCCACAGCCA GAGCGCGCGA TTGATAAACC ATTCTTGCTG CCAATCGAAG 350 

ACGTATTCTC TATCTCAGGC CGTGGAACAG TTGTTACCGG GCGTGTTGAG 4 00 

CGCGGTATCG TCAGAGTGGG CGAAGAAGTT GAAATCGTGG GTATCATCGA 450 

CACCATTAAA ACCACTTGTA CCGGTGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCCGGTGAG AACGTTGGTG TTCTGCTGCG CGGTACTAAA 550 

CGCGATGACG TGCAACGTGG TCAAGTGTTG GCTAAACCAG GTTCTATTAA 600 

GCCGCATACC AAATTTGAGT CAGAAGTTTA TATTCTGAGC AAAGATGAAG 650 

GTGGTCGTCA TACTCCGTTC TTCAAAGGTT ACCGTCCACA GTTCTACTTC 700 

CGTACAACTG ACGTGACCGG TACCATCGAA CTGCCAGACG GTGTTGAGAT 750 

GGTGATGCCA GGTGATAACA TTCAAATGAT TGTTAACCTG ATTGCGCCTA 800 

TTGCAATGGA TGACGGTCTA CGATTTGCA 829 



2) INFORMATION FOR SEQ ID NO: 241 

( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 804 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yokenella regensburgei 
<B) STRAIN: ATCC 35313 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241 

TGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACTC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

10 TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

AGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGATTTC CCGGGCGACG 2 00 

ACACTCCGAT CATCCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGAAGCA 250 

GAGTGGGAAG CTAAAATCGT TGAGCTGGCT GGCTTCCTGG ATTCTTACAT 300 

CCCAGAACCA GTTCGTGCTA TCGACCTGCC GTTCCTGCTG CCGATCGAAG 350 

15 ACGTATTCTC CATCTCCGGT CGTGGCACCG, TTGTTACCGG TCGTGTAGAG 4 00 

CGCGGTATCG TTAAAGTTGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA 4 50 

GACTGCTAAG TCTACCTGTA CCGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCTGGTGAG AACGTTGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GTTCCATCAA 600 

20 GCCGCACACC AAATTCGAAT CTGAAGTTTA TATCCTGTCC AAAGACGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 700 

CGTACAACTG ACGTGACTGG TACCATCGAA CTGCCGGAAG GCGTAGAGAT 750 

GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 800 

TCGC 804 

25 



2) INFORMATION FOR SEQ ID NO: 242 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Achromobacter xylosoxidans subsp. 
40 denitrif leans 

(B) STRAIN: ATCC 15173 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242 

4 5 TCAGTTCCCC CGCGATCACA TGCCCAAGAT CTACGAAGCG CTTACTCTGG 50 

CCGACGAGGG TTCCTCGTTC GCCGAAAAGG GTCTGACGCT GGAAGTGCAG 100 

CAACAGCTGG GCGACGGCGT GGTGCGTACC ATCGCGCTGG GCTCCAGCGA 150 

CGGCCTGCGC CGCGGTATGA AGGTCACCGG TACGGGCGCG CCGATCTCGG 200 

TGCCGGTCGG CACCGGCACG CTGGGCCGCA TCATGGACGT GCTGGGTCGT 250 

50 CCCATCGACG AAGCCGGCCC GATCCAGCAC GAAGAAAAGC GTGGCATTCA 300 

CCAGCCGGCT CCCCGTTTCG ACGAACTGTC GCCGTCGGTG GAACTGCTGG 350 

AAACCGGCAT CAAGGTTATT GACCTGGTCT GCCCGTTCGC CAAGGGCGGC 4 00 

AAGGTCGGCC TGTTCGGCGG CGCCGGCGTG GGCAAGACCG TCAACATGAT 4 50 

GGAACTGATC AACAACATCG CCAAGCAGCA CAGCGGCTTG TCGGTGTTCG 500 

55 CCGGCGTGGG CGAGCGTACC CGCGAAGGCA ACGACTTCTA CCACGAAATG 550 

GAAGAGTCGA ACGTTCTGGA CAAGGTTGCG ATGGTGTTCG GTCAGATGAA 600 

CGAACCCCCG GGCAACCGTC TGCGCGTGGC GCTGACCGGC CTGACCATGG 650 

CCGAGAAGTT CCGCGACGAA GGCCGCGACA TCCTGTTCTT CGTGGACAAC 700 

ATCTACCGCT ACACCCTGGC CGGTACGGAA GTGTCCGCGC TGCTGGGCCG 750 

50 TATGCCGTCG GCAGTGGGCT ACCAGCCCAC GCTGGCCGAA GAAATGGGCA 800 
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AGCTGCAAGA GCGCATCACC TCGACCAAGA CCGGCTCGAT CACCTCGAT* 84 9 



2) INFORMATION FOR SEQ ID NO: 243 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 787 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Acinetobacter baumannii 

(B) STRAIN: ATCC 19606 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 243 

TGAAACTACT TTAGAAGTTC AGCAACAACT TGGTGATGGT GTTGTTCGTA 50 

CCATCGCAAT GGGTTCTACA GAAGGTCTTA AACGTGGTCT TACTGTAACT 100 

AGCACAAACG CACCGATCTC TGTTCCAGTT GGTACAGCCA CTCTTGGCCG 150 

TATCATGGAC GTTTTAGGTC GTCCTATCGA TGAAGCAGGT CCTGTTGCGA 2 00 

CTGAAGAACG TTTGCCGATT CACCGTCAAG CGCCTTCTTA TGCTGAACAA 250 

GCAGCTTCTA CTGACCTTTT AGAAACTGGT ATTAAAGTCA TCGACTTACT 3 00 

TTGCCCGTTT GCGAAAGGTG GTAAAGTTGG TTTATTCGGT GGTGCTGGTG 350 

TTGGTAAAAC CGTTAACATG ATGGAATTGA TCAACAACAT CGCGAAAGCA 4 00 

CACTCAGGTT TATCTGTGTT TGCTGGTGTT GGTGAGCGTA CTCGTGAAGG 4 50 

TAATGACTTC TATCACGAAA TGAAAGATTC TAACGTTCTT GACAAAGTAG 500 

CAATGGTCTA CGGTCAGATG AACGAGCCAC CAGGTAACCG TTTACGCGTA 550 

GCGTTAACTG GTTTGACTAT GGCTGAATAC TTCCGTGATG AAAAAGACGA 600 

AAACGGTAAA GGTCGTGACG TATTATTATT CGTCGACAAC ATCTACCGTT 650 

ATACACTTGC AGGTACTGAA GTATCAGCAT TGTTAGGTCG TATGCCATCT 700 

GCGGTAGGTT ACCAACCTAC ACTTGCAGAA GAAATGGGTG TTCTTCAAGA 750 

GCGTATTACA TCTACTAAAT CTGGTTCGAT CACTTCG 787 



2) INFORMATION FOR SEQ ID NO: 244 

( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Acinetobacter Iwoffii 

(B) STRAIN: CDCF 3697 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244 

GTTCCTAAGA TCTATGACGC TCTCCACGTT GATGGCACTG AAACTACATT 50 

AGAAGTTCAG CAACAACTTG GTGATGGCGT AGTTCGTACT ATTGCAATGG 100 

GTTCTACTGA AGGCCTTAAG CGTGGTTTGA ACGTAACTAA CACTAACGCG 150 

CCGATTTCTG TACCAGTAGG TACAGCGACT CTAGGTCGTA TCATGGACGT 2 00 

TCTTGGTCGC CCAATCGACG AAGCTGGTCC AGTTGCGACT GAAGCGCGTT 250 
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TGCCGATTCA 
GACCTTTTAG 
TAAAGGTGGT 
TAAACATGAT 
TCTGTATTCG 
TCACGAGATG 
GTCAGATGAA 
TTGACCATGG 
CCGTGACGTA 
GTACTGAAGT 
CAACCGACAC 
GACTAAGTCT 
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CCGTCAAGCA 
AAACTGGTAT 
AAAGTTGGTC 
GGAGTTGATC 
CTGGTGTTGG 
AAAGACTCAA 
CGAGCCACCG 
CTGAGTACTT 
CTATTGTTCG 
ATCAGCACTT 
TTGCAGAAGA 
GGTTCGATTA 



CCTTCTTATG 
TAAAGTCATC 
TGTTCGGTGG 
AACAACATCG 
TGAGCGTACT 
ACGTTCTAGA 
GGTAACCGTT 
CCGTGACGAG 
TAGATAACAT 
CTAGGTCGTA 
GATGGGTGTT 
CGTCA 



CTGAACAAGC 
GACTTACTTT 
TGCCGGTGTT 
CTAAAGCGCA 
CGTGAAGGTA 
CAAAGTAGCA 
TACGCGTAGC 
AAAGACGAAA 
CTACCGTTAT 
TGCCGTCTGC 
CTTCAAGAAC 



AGCTTCTACT 
GCCCGTTCGC 
GGTAAAACTG 
CTCAGGTTTA 
ATGACTTCTA 
ATGGTCTACG 
GTTGACTGGT 
ACGGCAAAGG 
ACACTAGCAG 
AGTAGGTTAC 
GTATTACATC 



300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
825 
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45 
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2) INFORMATION FOR SEQ ID NO: 245 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 837 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
.(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 

(B) STRAIN: CSG-197 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 245 

CAATGAAGTT CCAGAAATTA ACAATGCCTT AGTCGTAGAC GTTGAAAGAG 50 

ATGAAGGTAC AGTATCTCTT ACATTAGAAG TGGCATTACA ACTTGGCGAT 100 

GATGTCGTAC GTACAATTGC AATGGATTCT ACTGATGGTG TTAAACGTGG 150 

TACAGAAGTT CGAGATAGCG GAGATAGCAT CAGTGTTCCA GTTGGTGATG 200 

CTACGTTAGG ACGTGTGTTT AATGTTCTTG GTGATACAAT TGACTTAGAC 250 

GAGAAGCTTG ATACTTCTGT CAAACGTGAT CCAATTCATA GAGAAGCACC 3 00 

TGCATTCGAT CAATTATCAA CAAAAGTTGA AATCTTAGAA ACAGGTATTA 350 

AAGTAATTGA TTTACTTGCA CCATATATTA AAGGTGGTAA AATCGGTTTA 4 00 

TTCGGTGGCG CTGGTGTAGG TAAAACAGTA TTAATTCAAG AATTAATTAA 4 50 

TAATATAGCT CAAGAACATG GTGGTATTTC AGTATTTGCC GGCGTAGGTG 500 

AACGTACGCG TGAAGGTAAT GACTTATACT ACGAAATGAG TGATAGTGGT 550 

GTTATTAAGA AAACAGCTAT GGTCTTCGGA CAAATGAATG AGCCACCTGG 600 

TGCGCGTATG CGTGTTGCTT TATCAGGCTT AACAATGGCT GAACACTTCC 650 

GTGATGTACA AGGACAAGAT GTTTTACTAT TTATTGATAA CATATTCAGA 700 

TTTACGCAAG CTGGTTCAGA AGTATCAGCA CTATTAGGTC GTATGCCATC 750 

AGCCGTTGGT TATCAACCTA CCCTTGCTAC TGAAATGGGT CAATTACAAG 800 

AACGTATTAC ATCAACAACT AAAGGATCTG TAACGTC 837 



2) INFORMATION FOR SEQ ID NO: 246 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 851 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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PCT/CA00/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Alcaligenes faecalis 

(B) STRAIN: ATCC 15554 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 246 



TTCCCCCGCG 
CGAAAGTTCG 
AATTGGGTGA 
CTGCGCCGCG 
CGTGGGTGTC 
TTGACGAAGT 
CCTGCGCCTA 
CGGTATTAAA 
TTGGTCTGTT 
CTGATCAACA 
TGTGGGTGAG 
ATGCTGGCGT 
GTGTTCGGTC 
GTCCGGCCTG 
TGTTCTTTGT 
TCCGCTCTGC 
GGCCGAGGAA 
G 



ACAGCATCCC 
GCTTTCGCAG 
CGGTGTAGTT 
GTATGGAAGT 
GGCACCCTGG 
CGGTCCTATT 
CTTTCGACGA 
GTGATTGACC 
CGGTGGTGCC 
ACATCGCCAA 
CGTACCCGTG 
TATCCAGATG 
AGATGAACGA 
ACCATGGCCG 
GGACAACATC 
TGGGTCGTAT 
ATGGGTAAGC 



TAAAGTCTAC 
AAAAAGGCCT 
CGCACCATCG 
GGCCGGTTCG 
GTCGCATTAT 
CAGTCCGACG 
ACTGTCGCCT 
TGGTTTGCCC 
GGTGTGGGCA 
GGCACACAGC 
AAGGTAACGA 
GACAACCTGA 
ACCTCCAGGC 
AGAAGTTCCG 
TACCGCTACA 
GCCTTCCGCA 
TGCAAGAGCG 



GAAGCATTGA CGCTCGTTGA 
GACTTTTGAA GTACAGCAAC 
CCATGGGTTC CAGCGACGGC 
GGCGCTCCCA TCTCCGTTCC 
GGACGTTCTG GGTCGCCCTA 
AGCGTCGCGC CATTCACCAG 
TCCGTAGAGC TGCTGGAAAC 
GTTCGCCAAG GGTGGTAAGG 
AGACCGTGAA CATGCTGGAG 
GGTCTGTCCG TGTTTGCCGG 
CTTCTACCAC GAAATGGCCG 
GCGAGTCCAA AGTGGCCATG 
AACCGTCTGC GTGTGGCACT 
TGACGAAGGC CGTGACATCC 
CGCTGGCCGG TACAGAAGTG 
GTGGGTTACC AGCCTACGCT 
CATTACCTCC ACCAAGACCG 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
851 



2) INFORMATION FOR SEQ ID NO: 247 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 84 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: 4229 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247 

GATGGCGGAA AGCTACCAGA AATCTACAAC GCCCTTACGG TAAAACAGAG 50 

CAACGAAAAC GGAACAAGCA TTAACTTAAC ATTTGAAGTT GCACTTCATT 100 

TAGGTGATGA CACAGTTCGT ACAGTTGCAA TGTCTTCCAC AGATGGACTT 150 

GTTCGTGGCA CAGAAGTAGA AGATACTGGT AAAGCAATCT CTGTACCAGT 200 

TGGTGATGCA ACACTTGGTC GTGTATTTAA CGTATTAGGT GATGCAATTG 250 

ACTTAGATGG TGAGGTTCCT GCGGATGTAC GTCGTGATCC AATTCACCGT 300 

CAAGCACCTG CATTCGAAGA ATTATCTACT AAAGTAGAAA TTCTTGAAAC 350 

TGGTATTAAA GTAGTAGACT TACTTGCTCC TTACATTAAG GGTGGTAAGA 400 

TCGGTCTATT CGGTGGTGCC GGTGTAGGTA AAACGGTATT AATTCAGGAA 4 50 

TTAATCAATA ACATCGCACA AGAACACGGT GGTATCTCTG TATTCGCTGG 500 

TGTAGGTGAG CGTACTCGTG AGGGTAATGA CTTATACCAC GAAATGAGCG 550 

ATTCTGGCGT AATTAAGAAA ACTGCGATGG TATTCGGACA AATGAACGAG 600 

CCACCTGGAG CACGTCAACG TGTTGCGTTA ACAGGTTTAA CAATGGCTGA 650 

GCATTTCCGT GATGAGCAAG GACAAGATGT ACTTCTGTTC ATCGATAATA 700 

TCTTCCGTTT CACGCAAGCA GGTTCTGAAG TATCTGCCCT TCTTGGCCGT 750 

ATGCCATCTG CGGTAGGTTA CCAACCAACA CTTGCAACAG AAATGGGTCA 800 
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ATTACAAGAG CGTATTACAT CTACAAATAA AGGGTCTATC ACGTCT 84 6 



5 2) INFORMATION FOR SEQ ID NO: 248 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 14579 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 248 



CCAGAAATCT ACAATGCCCT TACGGTAAAA CAAAGCAACG AAAACGGAAG 50 

CATGAACTTA ACATTTGAAG TTGCACTTCA TTTAGGTGAT GATACAGTTC 100 

GTACAGTTGC GATGTCTTCC ACAGATGGAC TTGTTCGTGG CACAGAAGTA 150 

GAAGATACTG GTAAAGCAAT CTCTGTACCA GTTGGTGATG CAACACTTGG 200 

25 ACGTGTATTC AACGTATTAG GTGATGCAAT TGACTTAGAT GGTGAACTTC 250 

CTGCGGATGT ACACCGTGAT CCAATTCACC GTCAAGCACC TGCATTCGAA 300 

GAATTATCTA CTAAAGTAGA AATTCTTGAA ACTGGTATTA AAGTAGTAGA 350 

CTTACTTGCT CCTTACATTA AGGGTGGTAA GATCGGCCTA TTCGGTGGTG 4 00 

CCGGCGTAGG TAAAACAGTA TTAATTCAGG AGTTAATCAA TAACATCGCA 450 

3 0 CAAGAGCACG GTGGTATCTC TGTATTCGCT GGTGTAGGTG AGCGTACTCG 500 

TGAGGGTAAT GACTTATACC ACGAAATGAG CGATTCTGGC GTAATCAAGA 550 

AAACTGCGAT GGTATTCGGA CAAATGAACG AGCCACCTGG AGCACGTCAA 600 

CGTGTTGCAT TAACAGGTTT AACAATGGCT GAGCATTTCC GTGATGAGCA 65 0 

AGGACAAGAC GTACTTCTGT TCATCGATAA CATCTTCCGT TTCACGCAAG 700 

35 CGGGTTCTGA AGTATCTGCC CTTCTTGGTC GTATGCCATC TGCGGTAGGT 750 

TACCAACCAA CACTTGCAAC AGAAATGGGT CAATTACAAG AGCGTATTAC 800 

ATCTACAAAT 810 



10 



50 



2) INFORMATION FOR SEQ ID NO: 249 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 944 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacteroides distasonis 

(B) STRAIN: ATCC 8503 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 249 

CTTTGATTAT GGGACAGAAG AAACAGTAAC CCTCCCCCGT ATCCACGACG 50 

CCATGGAGAT TTCCCGCCCA AACGGAAAGA TCTTGATCGT CGAAGTTCAG 100 

CAACACATCG GGGAAAACAC CGTCCGTACC GTAGCGATGG ATACGACCGA 150 

50 CGGATTGAGA CGAGGCATGG AGGCCGTGTC ATACGGAATG CCCATCACCA 200 
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SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

TGCCGACCGG CGACCAAGTC AAAGGACGTT 
CCTATCGATG GCATGGCCCA GCTTACTAAA 
TCGTGAGCCT CCTAAATTCG AGGATCTGAC 
ACACGGGTAT TAAAGTAATC GATTTATTGG 
5 AAGATCGGAC TTTTCGGAGG AGCCGGAGTC 
GGAATTGATC AACAACATCG CAAAGAAAAA 
CCGGTGTGGG TGAACGTACA CGTGAAGGAA 
ATCCAATCCG GTGTCATCCG ATACGGCGAG 
GGCAGGCAAC TGGGACTTGT CGAAGATCGA 

10 CGCAGGCTAC CTTGGTATTC GGGCAGATGA 
TCGTCCGTGG CCTTATCCGG TTTGACGATC 
AGCCTCTGAG GGAGAAAGAA AAGATATATT 
TCCGTTTCAC CCAAGCCGGT TCCGAGGTTT 
CCTTCCGCCG TAGGTTACCA ACCGACATTG 

15 GCAAGAGCGT ATCACTTCAA CCAAGAAGGG, 



PCT/CA00/01150 

TAATGAATGT CACCGGCGAC 250 

GACGGGGCTC TTCCCATCCA 300 

AACGACCCAA GAGGTCTTGT 350 

AACCTTACGC CAAAGGAGGC 400 

GGCAAAACGG TATTGATCAT 450 

CAACGGATTC TCCGTCTTCG 500 

ATGACCTATT GCGTGAAATG 550 

GAGTTTAAGA AGAGTATGGA 600 

TTACGATGAA TTGGCTAAGT 650 

ACGAACCGCC GGGCGCCCGC 700 

GCCGAATCCT TCCGTGACAA 750 

ATTCTTCATC GATAATATCT 800 

CCGCCTTGCT GGGGCGTATG 850 

GCCACGGAAA TGGGAGCTAT 900 

CTCCATCACC TCCG 944 



2) INFORMATION FOR SEQ ID NO: 250 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 939 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

2 5 '(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Bacteroides ovatus 

(B) STRAIN: ATCC 8483 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 250 

3 5 TTGAGGGTAC GGATGCAGAA TTGGTGCTGC CAAGCATCCA CGACGCACTG 50 

GAGATAAAGA GGCCAAACGG CAAAATACTG GTTGTAGAAG TTCAGCAACA 100 

TATCGGCGAA AATACGGTGC GTACCGTAGC GATGGACAGT ACTGACGGAC 150 

TTCAGAGAGG CATGAAAGTG TATCCCACCG GAGGCCCGAT CACGATGCCG 200 

ATTGGCGAAC AGATTAAAGG ACGACTGATG AACGTAGTCG GTGATTCGAT 250 

4 0 CGACGGTATG AAAGGACTCG ACCGCAAAGG TGCATATTCC ATTCATCGCG 3 00 

ACCCCCCTAA GTTTGAGGAT TTGACTACTG TGCAAGAGGT GCTCTTCACA 350 

GGTATCAAAG TGATCGACCT GCTCGAACCG TATGCCAAAG GTGGTAAAAT 400 

CGGTTTGTTC GGCGGTGCCG GTGTAGGAAA GACTGTATTG ATTCAGGAAC 450 

TTATCAATAA TATCGCCAAG AAACATAATG GATTCTCTGT ATTTGCCGGA 500 

45 GTAGGTGAAC GTACCCGTGA AGGTAACGAC TTGCTGCGCG AAATGATTGA 550 

ATCCGGTGTA ATCCGTTACG GCGAAGCATT CAAAGAAGGA ATGGAGAAAG 600 

GTCACTGGGA TCTTTCGAAA GTGGATTATA ACGAACTGGA GAAATCGCAA 650 

GTGTCTCTGA TTTTCGGTCA GATGAACGAG CCTCCGGGCG CACGTGCCTC 700 

TGTGGCATTG TCCGGACTGA CGGTGGCGGA ATCTTTCCGC GACGCAGGAA 750 

50 AAGAAGGTGA GAAACGCGAT ATTCTGTTCT TTATTGATAA TATCTTCCGT 800 

TTCACGCAAG CAGGTTCAGA AGTGTCCGCC CTTTTGGGAC GTATGCCCTC 850 

CGCTGTTGGT TACCAGCCCA CGTTGGCTAC GGAAATGGGT GCGATGCAGG 900 

AACGTATCAC GTCTACCCGC AAAGGTTCTA TCACCTCCG 939 



2) INFORMATION FOR SEQ ID NO: 251 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 83 3 bases 
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10 



WO 01/23604 PCT/CA0O/01 150 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leclercia adecarboxylata 

(B) STRAIN: ATCC 23216 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 251 



CGAATTCCCT CAGGATGCCG TACCGCGCGT GTACGATGCT CTTGAGGTTC 50 

AGAATGGTAA TGAGAGCCTG GTGCTGGAAG TTCAGCAGCA GCTCGGCGGC 100 

15 GGTATTGTGC GTACCATCGC CATGGGTTCT TCCGACGGTC TGCGTCGTGG 150 

TCTGGAAGTT AAAGACCTCG AGCACCCAAT CGAAGTACCA GTAGGTAAAG 200 

CAACCCTGGG CCGTATCATG AACGTCCTGG GTCAGCCGAT CGACATGAAA 250 

GGCGACATCG GCGAAGAAGA GCGTTGGGCT ATCCACCGTT CAGCACCTTC 3 00 

CTATGAAGAG CTGTCCAGCT CTCAGGAACT GCTGG AAACC GGCATCAAAG 350 

2 0 TTATCGACCT GATGTGTCCG TTCGCGAAGG GCGGTAAAGT TGGTCTGTTC 4 00 

GGCGGTGCGG GTGTAGGTAA AACCGTAAAC ATGATGGAGC TGATCCGTAA 4 50 

CATCGCGATC GAGCACTCCG GTTACTCCGT GTTTGCAGGC GTGGGTGAGC 500 

GTACTCGTGA GGGTAACGAC TTCTACCACG AAATGACCGA CTCCAACGTT 550 

CTGGACAAAG TATCCCTGGT TTACGGCCAG ATGAACGAGC CACCAGGAAA 600 

2 5 CCGTCTGCGC GTTGCGCTGA CCGGCCTGAC CATGGCTGAG AAGTTCCGTG 650 

ACGAAGGTCG TGACGTACTG CTGTTCGTTG ACAACATCTA CCGTTACACC 700 

CTGGCCGGTA CGGAAGTATC CGCACTGCTG GGTCGTATGC CATCAGCAGT 750 

AGGCTACCAG CCGACCCTGG CGGAAGAGAT GGGTGTTCTG CAGGAACGTA 800 

TCACCTCTAC CAAAACCGGT TCTATCACCT CCG 833 

30 



2) INFORMATION FOR SEQ ID NO: 252 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Stenotrophomonas maltophilia 
45 (B) STRAIN: CDC F3338 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 252 

GTGTACGACG CACTGAAGGT GGAAAACACC GAGATCACCC TCGAAGTCCA 50 

50 GCAGCAGCTG GGCGACGGCG TGGTGCGTAC CATCGCCCTC GGTTCCACCG 100 

ACGGCCTGAA GCGCAACCTG GTTGCCGTCA ACACCGGCCG TGGCATCTCG 150 

GTGCCGGTCG GCGCCGGCAC CCTGGGCCGC ATCATGGACG TGCTGGGCCG 2 00 

TCCGATCGAC GAAGCCGGCC CGGTCGCTGC CAGCGACAAC TGGGAAATCC 250 

ACCGCGCTGC GCCGTCGTAT GAAGACCAGT CCCCGGCCAC CGAGCTGCTG 300 

55 GAAACCGGCA TCAAGGTCAT CGACCTGATG TGCCCGTTCG CCAAGGGCGG 350 

CAAGGTCGGC CTGTTCGGCG GCGCCGGCGT CGGCAAGACC GTCAACATGA 4 00 

TGGAACTGAT CAACAACATC GCCAAGGCGC ACAGCGGCCT GTCCGTGTTC 4 50 

GCCGGCGTGG GTGAGCGTAC CCGTGAGGGC AACGACTTCT ACCACGAGAT 500 

GAAGGACTCC AACGTCCTGG ACAAGGTGGC GATGGTGTAC GGCCAGATGA 550 

60 ACGAGCCGCC GGGCAACCGT CTGCGCGTCG CCCTGACCGG CCTGACCATG 600 
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GCCGAGTACT TCCGCGATGA GAAGGACGAA 
CCTGCTGTTC GTCGACAACA TCTACCGCTA 
TGTCGGCACT GCTGGGCCGC ATGCCGTCCG 
CTGGCCGAGG AAATGGGCGT CCTGCAGGAG 
TGGTTCGATC ACCTCGATC 



AACGGCAAGG GCAAGGACGT 
CACCCTGGCC GGTACCGAAG 
CGGTGGGTTA CCAGCCGACC 
CGCATCACCT CGACCAAGAA 



650 
700 
750 
800 
819 



2) INFORMATION FOR SEQ ID NO: 253 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 864 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bartonella henselae 

(B) STRAIN: ATCC 49882 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 253 

AGTTTGAAGG CCCTTTGCCA AATATTCTCA ATGCATTAGA AACAGATAAT 50 

TTGGGCAATC GGCTAGTTTT AGAAGTTGCT CAGCATTTGG GTGAAAATAC 100 

CGTGCGTACC ATTGCCATGG ATACTACCGA TGGTCTTGTC CGTGGTCAAA 150 

AAGTTTTTGA TACAGGAACA CAGATCAGTG TTCCCGTGGG AGAAGCAACA 200 

CTTGGTCGTA TTATGAATGT GATTGGAGAG CCGGTTGATA ATGTTGGCCC 250 

AATTGCTACA AGCAAAACCC GTTCCATTCA CCAAGAGGCT CCTGAATATG 300 

TGGAGCAATC AACCGCATCA GAAATCCTTG TGACTGGTAT TAAAGTCGTT 3 50 

GATCTGTTAG CTCCTTATTC TAAAGGGGGG AAGGTTGGTT TGTTTGGAGG 400 

TGCCGGTGTT GGTAAAACCG TTCTCATTAT GGAGCTTATC AACAATATTG 450 

CAAAGGCGCA TGGTGGCTAT TCAGTGTTTG CCGGTGTTGG TGAACGTACA 500 

CGTGAGGGAA ATGATCTTTA TTATGAAATG ATCGAAAGCC GTGTGAATGT 550 

GAATCCAAAA GACAACAATG GTTCAACAGA AGGATCAAAA TGTGCACTCG 600 

TTTATGGGCA AATGAATGAA CCACCAGGGG CGCGTGCACG TGTGGCTCTT 650 

TCAGGATTGA CCATTGCAGA AAGTTTCCGT GATGAGGGAC AAGATGTTTT 700 

GTTCTTCGTA GATAATATTT TCCGTTTTAC GCAAGCAGGC GCTGAAGTGT 750 

CAGCTCTTTT AGGGCGTATT CCTTCTGCTG TAGGGTATCA GCCAACTTTG 800 

GCAACTGATA TGGGGGCTTT GCAAGAGCGT ATTACCAGTA CAAGAACAGG 850 

CTCTATTACC TCTG 864 



2) INFORMATION FOR SEQ ID NO: 254 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 866 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bifidobacterium adolescentis 

(B) STRAIN: ATCC 15703 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 254 
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10 



15 



20 



TTCCCGGTGG 
GKCCAACACC 
TTGMGGTTGA 
AAGCCGACTG 
CCCGATCTCT 
TGTCCGGCAA 
GAACGTTGGT 
CAAGACCCAG 
CGTATGTGCA 
AAGACCGTGC 
CGGTGTGTCC 
ATCTGATCGG 
GTCTTCGGCC 
GACCGCACTG 
TGCTGCTGTT 
GTGTCCACCC 
CCTGGCCGAT 
GACACTCCAT 



GCCACCTGCC 
GGYGTCCACG 
ACAGCATCTT 
ACGGCCTTGT 
GTGCCGGTTG 
CATCCTCAAC 
CTATCCACCG 
ATGTTCGAAA 
GGGCGGCAAG 
TGATCCAGGA 
GTGTTCGCAG 
CGAAATGGAC 
AGATGGATGA 
ACCATGGCAG 
CATCGATAAC 
TGCTCGGCCG 
GAGATGGGCG 
CACCTC 



CGACATTTAC 
AASAGGGCGA 
GGCGATTCCA 
GCGTGGCGCC 
GCGATGTCAC 
AAGAAGGCCG 
TAACCCGCCG 
CCGGTATCAA 
ATCGGTCTGT 
AATGATTCAG 
GCGTCGGCGA 
GAAGCCGGCG 
GCAGCCGGGT 
AGTACTTCCG 
ATCTTCCGTT 
TATGCCGTCC 
CGCTGCAGGA 



AATGCACTTA 
GACCACCAAG 
CCGTGCGTAC 
ACCGTGTATG 
CAAGGGCCAC 
ACGAGACCGT 
GCATTCGACC 
GGTCATCGAT 
TCGGCGGCGC 
CGTGTGGCTC 
GCGTACCCGT 
TGCTCGAGAA 
ACCCGTCTGC 
TGACGTACAG 
TCACCCAGGC 
GCAGTGGGCT 
GCGAATCACT 



CCGTTGAACT 50 

AAGATCACCC 100 

CGTCGCTCTG 150 

ACACCGGCGG 200 

GTATTCGACG 250 

TAAGGTTACC 300 

AGCTGGAGTC 350 

TTGCTGACCC 400 

AGGCGTCGGC 450 

AGAACCACGG 500 

GAGGGTAACG 550 

GACCGCACTG 600 

GCGTGCCGCT 650 

AATCAGGACG 700 

TGGTTCCGAG 750 

ACCAGCCGAA 800 

TCGACCCGTG 850 
866 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 255 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 842 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brucella abortus 

(B) STRAIN: S2308 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255 



AAGGCCAGCT 
CATCGTCTGG 
CACCATCGCC 
GCGACACTGG 
CGCATCATGA 
GACCAAGGCA 
AGTCGACCGA 
CTGGCGCCTT 
CGTCGGCAAG 
CGCACGGCGG 
GGCAACGACC 
CGGCGGCGAA 
CCCCGGGTGC 
AACTTCCGTG 
CCGCTTCACG 
CTTCCGCTGT 
CAGGAACGCA 



GCCGCTGATC 
TTCTCGAAGT 
ATGGACGCGA 
CGAACCGATC 
ACGTCATCGG 
ACCCGCGCCA 
AGCCGAAATT 
ACGCCAAGGG 
ACCGTTCTCA 
TTATTCCGTG 
TTTACCACGA 
GGCTCCAAGG 
CCGCGCCCGC 
ACCAGGGCCA 
CAGGCAGGTT 
GGGTTATCAG 
TCACCACGAC 



CTGAACGCGC 
TGCCCAGCAC 
CCGAAGGTCT 
ATGGTGCCGG 
CGAGCCGGTT 
TCCACCAGAA 
CTGGTCACGG 
CGGCAAGATC 
TCATGGAACT 
TTCGCAGGCG 
AATGATCGAG 
CAGCCCTCGT 
GTTGCCCTTT 
GGACGTTCTG 
CGGAAGTGTC 
CCGACGCTGG 
GACCAAGGGT 



TTGAAGTGGA 
CTCGGCGAAG 
CGTTCGCGGT 
TCGGCGTCGA 
GACGAAGCAG 
CGCGCCGGAA 
GCATCAAGGT 
GGCCTCTTCG 
CATCAACAAC 
TCGGTGAGCG 
TCGGGCGTGA 
TTACGGCCAG 
CCGGTCTGAC 
TTCTTCGTGG 
GGCTCTTCTC 
CAACCGACAT 
TCGATCACCT 



CAATCAGGGC 50 

ACACCGTGCG 100 

CAGGAAGCAC 150 

AACGCTTGGC 2 00 

GCCCCATCAA 2 50 

TATATCGAAC 3 00 

CGTCGACCTT 350 

GCGGTGCAGG 4 00 

GTCGCCAAGG 450 

TACCCGTGAG 500 

ACAAGCTCGG 550 

ATGAACGAAC 600 

GGTTGCTGAA 650 

ACAACATCTT 700 

GGCCGTATTC 750 

GGGCGCCATG 800 

CG 842 
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2) INFORMATION FOR SEQ ID NO: 256 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 833 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Cedecea davisae 

(B) STRAIN: ATCC 33431 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 256 

15 CGAGTTCCCT CAGGACGGCG TACCGCGCGT, TKATGATGCG CTTGAAGTAC 50 

AAAATAACAG CGAGCAGCTG GTGCTGGAAG TTCAGCAGCA GCTCGGCGGC 100 

GGTATCGTGC GTACCATCGC TATGGGTTCT TCCGACGGTC TGCGTCGTGG 150 

TCTGGAAGTT AAAGACCTCG AGCACCCGAT CGAAGTCCCG GTAGGTAAAG 200 

CGACCCTTGG CCGTATCATG AACGTGCTGG GTCAGCCGAT CGACATGAAA 250 

20 GGCGATATCG GCGAAGAAGA CCGTTGGGCT ATTCACCGCG CTGCACCTTC 300 

CTATGAAGAG CTGTCCAGCT CTCAGGAACT GCTGGAAACC GGCATCAAAG 350 

TAATCGACCT TATCTGTCCG TTCGCTAAGG GCGGTAAAGT AGGTCTGTTC 4 00 

GGTGGTGCGG GCGTGGGTAA AACCGTAAAC ATGATGGAGC TTATCCGTAA 450 

CATCGCGATC GAGCACTCCG GCTACTCCGT GTTTGCGGGC GTGGGTGAGC 500 

25 GTACTCGTGA GGGTAACGAC TTCTATCACG AAATGACCGA CTCCAACGTT 550 

CTGGACAAAG TTGCCCTGGT TTACGGCCAG ATGAACGAGC CACCGGGTAA 600 

CCGTCTGCGC GTAGCGCTGA CCGGTCTGAC CATCGCGGAG AAATTCCGTG 650 

ACGAAGGTCG TGACGTTCTG CTGTTCGTTG ATAACATCTA CCGTTACACC 700 

CTGGCCGGTA CTGAAGTATC CGCGCTGCTG GGTCGTATGC CTTCTGCGGT 750 

30 AGGTTACCAG CCAACTCTGG CGGAAGAGAT GGGTGTTCTT CAGGAGCGTA 800 

TTACCTCCAC CAAGACCGGT TCCATCACCT CCG 833 



35 2) INFORMATION FOR SEQ ID NO: 257 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

4 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cedecea lapagei 

(B) STRAIN: ATCC 33432 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 257 



TTCCCTCAGG ACGGCGTACC GCGCGTATAT GACGCGCTTG AGGTACAGAA 50 

TAACAGCGAG AAGCTGGTGC TGGAAGTTCA GCAGCAGCTC GGCGGCGGTA 100 

TCGTACGTAC CATCGCAATG GGTTCTTCCG ACGGTCTGCG TCGTGGTCTG 150 

GAAGTGAAAG ACCTCGAGCA CCCGATCGAA GTCCCGGTAG GTAAAGCGAC 200 

55 TCTGGGTCGT ATCATGAACG TGCTGGGTCA GCCAATTGAT ATGAAAGGCG 250 

ACATCGGCGA AGAAGATCGT TGGGCGATTC ACCGCGCAGC ACCTTCCTAT 300 

GAAGAGCTGT CCAGCTCTCA GGAACTGCTG GAAACCGGCA TCAAAGTTAT 350 

CGACCTGATT TGTCCGTTCG CTAAGGGCGG TAAAGTTGGT CTGTTCGGTG 4 00 

GTGCGGGCGT AGGTAAAACC GTAAACATGA TGGAGCTGAT CCGTAACATC 450 

50 GCGATCGAGC ACTCCGGTTA CTCCGTGTTT GCAGGCGTGG GTGAGCGTAC 500 

154 
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TCGTGAGGGT AACGACTTCT ACCACGAGAT GACCGACTCC AACGTTCTGG 
ACAAAGTTGC ACTGGTTTAC GGCCAGATGA ACGAGCCGCC AGGTAACCGT 
CTGCGCGTAG CGCTGACCGG TCTGACCATC GCGGAGAAAT TCCGTGACGA 
AGGCCGTGAC GTTCTGCTGT TCGTCGATAA CATCTATCGT TATACCCTGG 
CCGGTACAGA AGTTTCTGCA CTGCTGGGTC GTATGCCATC TGCGGTAGGT 
TATCAGCCTA CTCTGGCAGA AGAGATGGGT GTTCTTCAGG AGCGTATTAC 
CTCCACCAAG ACCGGTTCCA TCACTTCCG 



550 
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2) INFORMATION FOR SEQ ID NO: 258 

( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 83 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cedecea neteri 

(B) STRAIN: ATCC 33855 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 258 

TCCCTCAGGA CGGCGTACCG CGCGTTTATG ACGCGCTTGA GGTACAGAAC 50 

AACAATGAGA AGCTGGTGCT GGAAGTTCAG CAGCAGCTCG GCGGCGGTAT 10 0 

CGTGCGTACC ATCGCAATGG GTTCTTCCGA CGGTCTGCGT CGTGGTCTGG 150 

TAGTAACAGA CCTCGAGCAC CCGATCGAAG TCCCGGTAGG TAAAGCGACC 200 

CTTGGCCGTA TCATGAACGT GCTGGGTCAG CCGATCGACA TGAAAGGCGA 250 

CATCGGCGAA GAAGACCGTT GGGCGATTCA CCGCGCAGCA CCTTCCTACG 300 

AAGAGCTGTC CAGCTCTCAG GAATTGCTGG AAACCGGCAT CAAAGTTATC 350 

GACCTGATTT GTCCGTTCGC TAAGGGCGGT AAAGTAGGTC TGTTCGGTGG 4 00 

TGCGGGCGTA GGTAAAACCG TAAACATGAT GGAGCTGATC CGTAACATTG 4 50 

CGATCGAGCA CTCCGGTTAT TCCGTGTTTG CGGGCGTGGG TGAGCGTACT 500 

CGTGAGGGTA ACGACTTCTA CCACGAAATG ACCGACTCCA ACGTTCTGGA 550 

TAAAGTAGCA CTGGTTTACG GCCAGATGAA CGAGCCACCA GGTAACCGTC 600 

TGCGCGTAGC GCTGACCGGT CTGACCATCG CGGAAAAATT CCGTGACGAA 650 

GGCCGCGACG TTTTGCTGTT CGTTGATAAC ATCTACCGTT ATACCCTGGC 700 

CGGTACCGAA GTATCTGCAC TGCTGGGTCG TATGCCTTCT GCGGTAGGTT 750 

ATCAGCCAAC TCTGGCAGAA GAGATGGGTG TTCTTCAGGA GCGTATTACC 800 

TCCACCAAGA CCGGTTCTAT CACCTCCGTA 83 0 



2) INFORMATION FOR SEQ ID NO: 259 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 931 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: ChryseoJbacteriu/n /neningosepticujn 

(B) STRAIN: CDC B7681 



SUBSTITUTE SHEET (RULE 26) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 259 

TTTCAGGAAG TAGAGGAACT ACCAAACATT TATGACGCAC TTGAAGTTGT 50 

CAGAGAAGGC CAGAAAGGTC TGATCTTAGA AGTTGAACAA CACATCGGTG 100 

5 AGGATACAGT GAGATGTATC GCTATGGATG CAACAGACGG ACTTCAAAGA 150 

GGACAATCTG TAATAGGACA TGGAAGACAA ATTACTATGC CTATTGGTGA 2 00 

AGAAGTAAAT GGTAGATTGT TCAACGTGGT AGGAGATGCT ATCGACGGAC 250 

TTCAGGAATT ATCTAAAGAC AATGGTTTGC CAATCCACAG AGAAGCACCT 300 

AAATTCGATC AGTTATCAAC TTCTGCTGAA GTACTATATA CAGGTATCAA 350 

10 AGTAATCGAC CTTATCGAGC CTTATGCAAA AGGTGGTAAA ATTGGTTTGT 400 

TCGGTGGTGC TGGTGTAGGT AAAACAGTAT TGATCCAGGA ATTGATTAAT 450 

AACATTGCTA AAGGACACGG TGGTCTTTCT GTATTCGCAG GAGTAGGTGA 500 

GAGAACAAGA GAAGGAAATG ACCTTCTTCG TGAGATGTTA GAGTCTGGTA 550 

TTATTAAATA TGGTGACGAA TTCATGCATT CTATGGAGAA CGGTGGATGG 600 

15 GATCTTTCTA AAGTTGACAG TGAGTTGATG, AAAGAGTCTA AAGCTGCTTT 650 

CGTTTTCGGA CAGATGAACG AGCCACCAGG TGCAAGAGCA CGTGTAGCCC 700 

TTTCTGGTCT TACTTTAGCT GAATACTACC GTGATGGTGG CGAAAGCGGA 750 

CAAGGTAGAG ACGTTCTTTT CTTCGTAGAC AACATCTTCC GTTTTACACA 800 

GGCTGGTTCT GAGGTGTCTG CACTTCTAGG TCGTATGCCT TCAGCGGTAG 850 

20 GTTACCAACC AACTCTAGCT TCTGAGATGG GTGCAATGCA GGAGAGAATT 900 

ACTTCAACTA AAAACGGATC TATTACATCT G 931 



25 2) INFORMATION FOR SEQ ID NO: 2 60 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 72 6 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacter amalonaticus 

(B) STRAIN: ATCC 25405 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 260 



GTATCGTACG TACCATCGCA ATGGGTTCTT CCGACGGTCT GCGTCGTGGT 50 

CTGGAAGTCA AAGACCTCGA GCACCCGATC GAAGTCCCGG TAGGTAAAGC 100 

AACCCTGGGT CGTATCATGA ACGTCCTGGG TCACCCGATC GACATGAAAG 150 

GCGATATCGG TGAAGAAGAG CGTTGGGCTA TCCACCGCGC AGCACCGTCC 200 

15 TATGAAGAGC TGTCCAGCTC TCAGGAACTG CTGGAAACCG GTATCAAAGT 2 50 

TATCGACCTG ATGTGTCCGT TCGCGAASGG CGGTAAAGTG GGTCTGTTCG 300 

GTGGTGCGGG TGTAGGTAAA ACCGTAAACA TGATGGAGCT TATCCGTAAC 350 

ATCGCGATCG AGCACTCCGG TTACTCCGTG TTTGCGGGCG TAGGTGAACG 400 

TACTCGTGAG GGTAACGACT TCTACCACGA AATGACCGAC TCCAACGTTC 4 50 

50 TGGATAAAGT ATCCCTGGTA TATGGCCAGA TGAACGAGCC GCCGGGAAAC 500 

CGTCTGCGCG TTGCACTGAC CGGTCTGACC ATGGCAGAGA AATTCCGTGA 550 

CGAAGGTCGT GACGTACTGC TGTTCGTCGA TAACATCTAT CGTTACACCC 600 

TGGCCGGTAC GGAAGTTTCC GCACTGCTGG GTCGTATGCC ATCAGCGGTA 650 

GGTTACCAGC CGACCCTGGC GGAAGAGATG GGTGTTCTGC AGGAACGTAT 700 

55 CACTTCTACC AAAACCGGTT CTATCA 726 



0 



2) INFORMATION FOR SEQ ID NO: 261 

156 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 812 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



PCT/CAOO/01150 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Citrobacter braakii 

(B) STRAIN: ATCC 43162 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 261 



15 GCCGTACCGC 
CCTGGTGCTG 
TCGCCATGGG 
CTCGAGCACC 
CATGAACGTC 

20 AAGAGCGTTG 
AGCTCTCAGG 
TCCGTTCGCT 
GTAAAACCGT 
TCCGGTTACT 

25 CGACTTCTAC 
TGGTATATGG 
CTGACCGGTC 
TCTGCTGT.TC 
TATCCGCTCT 

30 CTGGCGGAAG 
CGGTTCTATC 



GCGTGTACGA 
GAAGTTCAGC 
TTCTTCCGAC 
CGATCGAAGT 
CTGGGTCACC 
GGCTATCCAC 
AACTGCTGGA 
AAGGGCGGTA 
AAACATGATG 
CCGTGTTTGC 
CACGAAATGA 
CCAGATGAAC 
TGACCATGGC 
GTTGATAACA 
GCTGGGTCGT 
AGATGGGTGT 
AC 



TGCTCTTGAG 
AGCAGCTCGG 
GGTCTGCGTC 
CCCGGTAGGT 
CGATCGACAT 
CGCGCGGCAC 
AACCGGCATC 
AAGTTGGTCT 
GAGCTGATCC 
GGGCGTGGGT 
CCGACTCCAA 
GAGCCGCCGG 
AGAGAAGTTC 
TCTATCGTTA 
ATGCCATCAG 
TCTTCAGGAA 



GTTATGAATG 
CGGCGGTATC 
GTGGTCTGGA 
AAAGCAACTC 
GAAAGGCGAT 
CTTCCTATGA 
AAAGTTATCG 
GTTCGGTGGT 
GTAACATCGC 
GAACGTACTC 
CGTTCTGGAT 
GAAACCGTCT 
CGTGACGAAG 
CACCCTGGCC 
CGGTAGGCTA 
CGTATCACCT 



GTAAAGAGAG 50 

GTACGTACCA 100 

AGTTAAAGAT 150 

TGGGTCGTAT 200 

ATCGGTGAAG 250 

AGAGCTGTCC 300 

ACCTGATGTG 350 

GCGGGTGTAG 4 00 

GATCGAACAC 4 50 

GTGAGGGTAA 500 

AAAGTATCCC 550 

GCGCGTTGCT 600 

GTCGTGACGT 650 

GGTACAGAAG 700 

CCAGCCGACC 750 

CTACCAAAAC 800 
812 



35 2) INFORMATION FOR SEQ ID NO: 262 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 811 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacter koseri 

(B) STRAIN: ATCC 27156 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 262 



GCGCGTGTAC 
TGGAAGTTCA 
GGTTCTTCCG 
CCCGATCGAA 

55 TACTGGGCGA 
TGGGCTATCC 
GGAACTGCTG 
CGAAGGGCGG 
GTAAACATGA 

50 CTCCGTGTTT 



GACGCCCTTG 
GCAGCAGCTC 
ACGGCCTGCG 
GTCCCGGTAG 
ACCAGTAGAC 
ACCGTGCGGC 
GAAACCGGTA 
TAAAGTGGGT 
TGGAGCTGAT 
GCGGGCGTAG 



AGGTGCAGAA 
GGTGGCGGTA 
TCGTGGTCTG 
GTAAAGCAAC 
ATGAAAGGCG 
ACCGTCCTAC 
TCAAAGTTAT 
CTGTTCGGTG 
CCGTAACATC 
GTGAACGTAC 



TGGTAATGAA 
TCGTACGTAC 
GATGTGAAAG 
GCTGGGTCGT 
AGATCGGTGA 
GAAGAGTTGT 
CGACCTGATG 
GTGCGGGTGT 
GCGATCGAAC 
TCGTGAGGGT 



CATCTGGTGC 50 

CATCGCCATG 100 

ACCTTGAGCA 150 

ATCATGAACG 200 

AGAAGAGCGT 250 

CAAACTCTCA 300 

TGCCCGTTCG 350 

AGGTAAAACC 4 0 0 

ACTCCGGTTA 4 50 

AACGACTTCT 500 
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ACCACGAAAT GACCGACTCC 
GGCCAGATGA ACGAGCCGCC 
CCTGACCATG GCGGAGAAAT 
TCGTCGACAA CATCTACCGT 
CTGCTGGGTC GTATGCCTTC 
AGAGATGGGT GTTTTGCAGG 
TCACCTCCGT A 



AACGTTATCG ACAAAGTATC CCTGGTTTAC 
GGGAAACCGT CTGCGCGTTG CGCTGACCGG 
TCCGTGACGA AGGTCGTGAC GTTCTGCTGT 
TACACCCTGG CCGGTACGGA AGTATCCGCA 
AGCGGTAGGT TACCAGCCGA CCCTGGCGGA 
AACGTATCAC CTCCACCAAA ACCGGTTCTA 



550 
600 
650 
700 
750 
800 
811 



2) INFORMATION FOR SEQ ID NO: 263 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 816 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacter farmeri 

(B) STRAIN: ATCC 51112 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 263 

GATGCCGTAC CGCGCGTGTA CGATGCTCTT GAGGTTATGA ATGGTAAAGA 50 

GAGTCTGGTG CTGGAAGTTC AGCAGCAGCT CGGCGGCGGT ATCGTACGTA 100 

CCATCGCGAT GGGTTCTTCC GACGGTCTGC GTCGTGGTCT GGAAGTAAAA 150 

GACCTCGAAC ACCCGATCGA AGTCCCGGTA GGTAAAGCAA CTCTGGGTCG 200 

TATCATGAAC GTCCTGGGTC ACCCGATCGA CATGAAAGGC GATATCGGTG 250 

AAGAAGAGCG TTGGGCTATC CACCGCGCAG CGCCATCCTA TGAAGAGCTG 300 

TCCAGCTCTC AGGAACTGCT GGAAACCGGT ATCAAAGTTA TCGACCTGAT 350 

GTGTCCGTTT GCGAAGGGCG GTAAAGTCGG TCTGTTCGGT GGTGCGGGTG 400 

TAGGTAAAAC CGTAAACATG ATGGAACTCA TCCGTAACAT CGCGATCGAG 450 

CACTCCGGTT ACTCCGTGTT TGCGGGCGTA GGTGAACGTA CTCGTGAGGG 500 

TAACGACTTC TACCACGAAA TGACCGATTC CAACGTTCTG GATAAAGTAT 550 

CCCTGGTATA TGGCCAGATG AACGAGCCGC CGGGAAACCG TCTGCGCGTT 600 

GCGCTGACCG GTCTGACCAT GGCAGAGAAA TTCCGTGACG AAGGTCGTGA 650 

CGTACTGCTG TTCGTCGATA ACATCTATCG TTACACCCTG GCCGGTACGG 700 

AAGTATCCGC ACTGCTGGGC CGTATGCCAT CAGCGGTAGG CTACCAGCCA 750 

ACCCTGGCGG AAGAGATGGG TGTTCTGCAG GAACGTATCA CTTCTACCAA 800 

AACCGGTTCT ATTACC 816 



2) INFORMATION FOR SEQ ID NO: 264 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacter freundii 

(B) STRAIN: ATCC 8090 



158 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 264 



PCT/CA00/01150 



AGGATGCCGT ACCGCGCGTG TACGATGCTC TTGAGGTTAT GAATGGTAAA 50 

GAGAGCCTGG TGCTGGAAGT TCAGCAGCAG CTCGGCGGCG GTATCGTACG 100 

5 TACCATCGCC ATGGGTTCTT CTGACGGTCT GCGTCGTGGT CTGGAAGTTA 150 

AAGACCTCGA GCACCCGATC GAAGTCCCGG TAGGTAAAGC AACGCTGGGT 200 

CGTATCATGA ACGTTCTGGG TCACCCGATC GACATGAAAG GCGATATCGG 250 

TGAAGAAGAG CGTTGGGCTA TCCACCGTGC AGCACCTTCC TACGAAGAGC 300 

TGTCAAGCTC TCAGGAACTG CTGGAAACCG GTATCAAAGT TATCGACCTG 350 

10 ATGTGTCCGT TCGCTAAGGG CGGTAAAGTT GGTCTGTTCG GTGGTGCGGG 400 

TGTAGGTAAA ACCGTAAACA TGATGGAGCT GATCCGTAAC ATCGCGATCG 4 50 

AACACTCCGG TTACTCCGTG TTTGCGGGCG TAGGTGAACG TACTCGTGAG 500 

GGTAACGACT TCTACCACGA AATGACCGAC TCCAACGTTC TGGACAAAGT 550 

ATCCCTGGTA TATGGCCAGA TGAACGAGCC GCCTGGAAAC CGTCTGCGTG 600 

15 TTGCGCTGAC CGGTCTGACC ATGGCTGAGA AGTTCCGTGA CGAAGGTCGT 650 

GACGTTCTGC TGTTCGTTGA TAACATCTAT CGTTACACCC TGGCCGGTAC 700 

AGAAGTATCT GCACTGCTGG GTCGTATGCC ATCAGCGGTA GGCTACCAGC 750 

CGACCCTGGC GGAAGAGATG GGTGTTCTGC AGGAACGTAT CACCTCCACC 800 

AAAACCGGTT CTATCACCT 819 

20 



2) INFORMATION FOR SEQ ID NO: 265 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacter koseri 
35 (B) STRAIN: ATCC 27028 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 265 

GATGCCGTAC CGCGCGTGTA CGACGCCCTT GAGGTGCAGA ATGGTAATGA 50 

4 0 ACATCTGGTG CTGGAAGTTC AGCAGCAGCT CGGTGGCGGT ATCGTACGTA 100 

CCATCGCCAT GGGTTCTTCC GACGGCCTGC GTCGTGGTCT GGATGTGAAA 150 

GACCTTGAGC ACCCGATCGA AGTCCCGGTA GGTAAAGCAA CGCTGGGTCG 200 

TATCATGAAC GTACTGGGCG AACCAGTAGA CATGAAAGGC GAGATCGGTG 250 

AAGAAGAGCG TTGGGCTATC CACCGTGCGG CACCGTCCTA CGAAGAGTTG 300 

4 5 TCAAACTCTC AGGAACTGTT GGAAACCGGT ATCAAAGTTA TCGACCTGAT 350 

GTGTCCGTTC GCGAAGGGCG GTAAAGTGGG TCTGTTCGGT GGTGCGGGTG 400 

TAGGTAAAAC CGTAAACATG ATGGAGCTGA TCCGTAACAT CGCGATCGAA 4 50 

CACTCCGGTT ACTCCGTGTT TGCGGGCGTA GGTGAACGTA CTCGTGAGGG 500 

TAACGACTTC TACCACGAAA TGACCGACTC CAACGTTATC GACAAAGTAT 550 

50 CCTTGGTTTA CGGCCAGATG AACGAGCCGC CGGGAAACCG TCTGCGCGTT 600 

GCGCTGACCG GCCTGACCAT GGCGGAGAAA TTCCGTGACG AAGGTCGTGA 650 

CGTTCTGCTG TTCGTCGACA ACATCTACCG TTACACCCTG GCCGGTACGG 700 

AAGTATCCGC ACTGCTGGGT CGTATGCCTT CAGCGGTAGG TTACCAGCCG 750 

ACCCTGGCGG AAGAGATGGG TGTTTTGCAG GAACGTATCA CCTCCACCAA 8 00 

55 AACCGGTTCT ATCACCTCCG TA 822 
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2) INFORMATION FOR SEQ ID NO:266 
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( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 820 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Citrobacter sedlakii 

(B) STRAIN: ATCC 51115 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 266 

15 GATGCCGTAC CGCGCGTGTA CGACGCCCTT, GAGGTACAGA ATGGTAATGA 50 

GCGTCTGGTG CTGGAAGTTC AGCAGCAGCT CGGTGGCGGT ATTGTACGTA 100 

CCATCGCCAT GGGTTCTTCC GACGGTCTGC GTCGTGGTCT GGAAGTAAAA 150 

GACCTTGAGC ACCCGATCGA AGTCCCGGTA GGTAAAGCAA CGCTGGGTCG 200 

TATCATGAAC GTACTGGGCG AACCAGTAGA CATGAAAGGC GACATCGGTG 250 

20 AAGAAGAGCG TTGGGCTATC CACCGTGCCG CGCCGTCCTA TGAAGAGTTG 300 

TCTAACTCTC AGGAACTGCT GGAAACCGGC ATCAAAGTTA TCGACCTGAT 350 

GTGTCCGTTC GCGAAGGGCG GTAAAGTCGG TCTGTTCGGT GGTGCGGGCG 4 00 

TAGGTAAAAC CGTAAACATG ATGGAGCTGA TCCGTAACAT CGCGATCGAG 450 

CACTCCGGTT ACTCTGTGTT TGCGGGCGTG GGTGAACGTA CTCGTGAGGG 500 

2 5 TAACGACTTC TACCACGAAA TGACCGACTC CAACGTTATC GACAAAGTAT 550 

CCCTGGTGTA CGGCCAGATG AACGAGCCGC CTGGAAACCG TCTGCGCGTC 600 

GCACTGACCG GTCTGACCAT GGCTGAGAAG TTCCGTGACG AAGGTCGTGA 650 

CGTTCTGCTG TTCGTCGATA ACATCTATCG TTACACCCTG GCCGGTACGG 700 

AAGTATCCGC ACTGCTGGGT CGTATGCCTT CAGCGGTAGG TTATCAGCCG 750 

30 ACTCTGGCGG AAGAGATGGG TGTTCTGCAG GAACGTATCA CCTCAACCAA 800 

AACCGGTTCT ATCACCTCCG 82 0 



35 2) INFORMATION FOR SEQ ID NO: 267 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacter werkmanii 

(B) STRAIN: ATCC 51114 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 267 

50 

GCCGTACCGC GCGTGTACGA TGCTCTTGAG GTTATGAATG GTAAAGAGAG 50 

CCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTACGTACCA 100 

TCGCCATGGG TTCTTCCGAC GGTCTGCGTC GTGGTCTGGA AGTTAAAGAC 150 

CTTGAGCACC CGATCGAAGT CCCGGTAGGT AAAGCAACCC TGGGTCGTAT 200 

55 CATGAACGTC CTGGGTCATC CGATCGACAT GAAAGGCGAT ATCGGTGAAG 250 

AAGAGCGTTG GGCTATCCAC CGCGCAGCAC CTACCTATGA AGAACTGTCC 3 00 

AGTTCTCAGG AACTGCTGGA AACCGGCATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTCGCG AAGGGCGGTA AAGTTGGTCT GTTCGGTGGT GCGGG TGTAG 4 00 

GTAAAACCGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAACAC 4 50 

50 TCCGGTTACT CAGTGTTTGC GGGCGTTGGT GAACGTACTC GTGAGGGTAA 500 

160 
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CGACTTCTAC CACGAAATGA CCGACTCCAA 
TGGTATATGG CCAGATGAAC GAGCCGCCGG 
CTGACCGGTC TGACCATGGC TGAGAAGTTC 
TCTGCTGTTC GTTGATAACA TCTATCGTTA 
5 TATCTGCACT GCTGGGTCGT ATGCCATCAG 
CTGGCGGAAG AGATGGGTGT TCTGCAGGAA 
CGGTTC 



PCT/CAOO/01150 ^ 

CGTTCTGGAC AAAGTATCCC 550 

GAAACCGTCT GCGCGTTGCG 600 

CGTGACGAAG GTCGTGACGT 650 

CACCCTGGCC GGTACTGAAG 700 

CGGTAGGCTA CCAGCCAACC 750 

CGTATCACCT CTACCAAAAC 800 

806 



10 



2) INFORMATION FOR SEQ ID NO: 268 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 810 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacter youngae 

(B) STRAIN: ATCC 29935 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 268 

GCCGTACCGC GCGTGTACGA TGCTCTTGAG GTTATGAATG GTAAAGAGAG 50 

CCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTACGTACCA 100 

TCGCCATGGG TTCTTCCGAC GGTCTGCGTC GTGGTCTGGA AGTTAAAGAC 150 

30 CTCGAGCACC CGATCGAAGT CCCGGTAGGT AAAGCAACGC TGGGTCGTAT 2 00 

CATGAACGTT CTGGGTCACC CGATCGACAT GAAAGGCGAT ATCGGAGAAG 250 

AAGAGCGTTG GGCTATTCAC CGCGCAGCAC CTTCCTATGA AGAGCTGTCC 300 

AGCTCTCAGG AACTGCTGGA AACCGGCATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTCGCT AAGGGCGGTA AAGTTGGTCT GTTCGGTGGT GCGGGTGTAG 400 

35 GTAAAACCGT AAACATGATG GAGCTTATTC GTAACATCGC GATCGAGCAC 450 

TCCGGTTACT CTGTGTTTGC GGGTGTAGGT GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CCGATTCCAA CGTTCTGGAT AAAGTATCCC 550 

TGGTTTATGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCG 600 

CTGACCGGTC TGACCATGGC TGAGAAATTC CGTGACGAAG GTCGTGACGT 650 

5 0 ACTGCTGTTC GTCGATAACA TCTATCGTTA TACCCTGGCC GGTACGGAAG 700 

TATCCGCACT GCTGGGTCGT ATGCCATCAG CGGTAGGTTA CCAGCCGACC 750 

CTGGCGGAAG AGATGGGTGT TCTGCAGGAA CGTATCACTT CTACCAAAAC 800 

CGGTTCTATC 810 

15 

2) INFORMATION FOR SEQ ID NO: 269 

( i ) SEQUENCE CHARACTER I S T I CS : 
50 (A) LENGTH: 827 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium innocuum 

(B) STRAIN: ATCC 14501 

50 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 269 
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TTGAGAACGG AGATTTGCCG CAGCTATTGA CCGCTATTGA AATTCCTCTG 50 

AAAGACAGTG AATCTCTGAT TGTCGAAGTT GCTCAGCATA TCGGTGATGA 100 

5 ACGTGTCCGC TGTATCGCTA TGGGCGGTAC AGATGGTCTG GTTCGTGGAA 150 

TGGAAGCCAT TGATACAGGA TCCGCAATCC GTGTACCGGT GGGAAAAGAA 200 

ATTCTGGGAA GAATGTTCAA TGTCCTCGGA CGTGAAATTG ATGGTCTGGG 250 

ACCTGTAGGA ACGGATAACA CACTGCCGAT CCACAGACAG GCACCGGGCT 3 00 

TTGAGGAGCA GCAGACATCC GCAGAAATGC TGGAAACAGG AATTAAGGTC 350 

10 ATTGACCTGT TATGTCCATA TTCCAAGGGT GGTAAGATTG GTTTGTTTGG 4 00 

TGGTGCGGGA GTAGGTAAAA CCGTACTGAT TCAGGAGCTG ATTCATAATA 4 50 

TCGCCAAGGA ACATGGTGGA ATGTCCGTCG TTACCGGTGT AGGGGAGAGA 500 

ACCCGTGAAG GAAACGACAT GTATCATGAA ATGAAGGACA GCGGTGTCCT 550 

TGATAAGACC GTACTGGTTT ACGGACAGAT GAATGAATCA CCGGGTGCCA 600 

15 GAATGCGTGT CGGTCTGACC GGGCTGACGA, TGGCGGAATA TTTCCGTGAT 650 

CACGACCATC AGGATGTATT GCTGTTTATT GATAATATTT TCCGTTTTAC 700 

CCAGGCGGGA AGTGAAGTAA GTGCCCTGCT GGGACGTATG CCAAGTGCAG 750 

TAGGCTATCA GCCGACACTT GCGACAGAAA TGGGACAGCT GCAGGAGCGC 800 

ATTACATCCA CGAAGGATGG TTCCATT 827 

20 



2) INFORMATION FOR SEQ ID NO: 270 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium perfringens 
35 (B) STRAIN: ATCC 13124 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 270 

GCAAATGATG AACTTCCTAA TATATTTAAT GCAATACACA TAAAAATGGA 50 

10 CGATGGAAAA ATCTTAGTTT GTGAGGTAGA GCAACACGTA GGAGACGATA 100 

TAGTTAGAAC TATAGCTATG GAAGCTACTG AAGGACTAAG AAGAGGTGTA 150 

GAAGCTGTTG ATACAGGAGC ACCTATATCA GTACCAGTTG GTGAATGCGT 2 00 

ATTAGGAAGA ATATTTAACG TATTAGGTAA ACCACTAGAT AGTGGAGCTG 2 50 

AAGTTAATAA CGAAGAAAAA TATCCAATTC ATAGACCAGC TCCATCATTT 3 00 

4 5 GAAGAACAAT CAGTTGTTCC TCAAATGTTT GAGACAGGAA TAAAGGTTAT 3 50 

CGACCTTTTA GCACCTTACC AAAGAGGGGG AAAAATCGGT CTATTTGGAG 4 00 

GTGCAGGTGT TGGTAAAACA GTTCTTATCC AAGAGCTTAT AAACAACATA 4 50 

GCTAAAGAGC ACGGTGGACT TTCTGTATTC ACAGGAGTTG GAGAAAGATC 500 

AAGAGAAGGT AATGACCTTT ACTATGAAAT GATGGAATCA GGAGTTATAA 550 

50 AAAATACAGC ATTAGTATTT GGACAAATGA ACGAACCACC TGGAGCAAGA 600 

ATGAGAGTTG CTTTAACAGG ACTTACTATG GCTGAGTACT TCAGAGACCA 650 

AGGTCAAGAC GTGTTATTAT TCATAGATAA CATATTCAGA TTCTCACAAG 7 00 

CTGGATCAGA GGTTTCAGCT TTATTAGGAA GAATACCATC AGCTGTTGGT 750 

TACCAACCAA CTCTTGCTAC AGAGATGGGA GCTCTTCAAG AGAGAATCAC 800 

55 ATCAACTACC CATGGATCAA TTACATCAG 829 



2) INFORMATION FOR SEQ ID NO: 271 

50 
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WO 01/23604 PCT/CAOO/01 150 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 271 
TTGTCCACGT TGGATRTCTT CA 



2) INFORMATION FOR SEQ ID NO: 272 

( i ) SEQUENCE CHARACTER I ST I C S : 

(A) LENGTH: 8,18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium diphtheriae 

(B) STRAIN: ATCC 27010 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 272 

CCCGCGTGGC GAGCTGCCGG CACTGTACAA CGCGTTGACT GTCGAGGTCA 50 

CCCTCGAGGC AGTCGCTAAG ACCATTACCC TTGAGGTTGC CCAGCACTTG 100 

GGCGACAACC TCGTTCGCGC CGTGTCCATG GCCCCTACCG ACGGCCTCGT 150 

CCGTGGTGCT GTTGTGACCG ACTCGGGCAA GCCAATCTCC GTGCCAGTTG 200 

GCGACGTTGT TAAAGGCCAC GTTTTCAACG CACTGGGCGA TTGCTTGGAT 250 

GAGCCAGGTC TCGGCCGCGA TGGTGAGCAG TGGGG AATTC ACCGCGATCC 300 

ACCACCATTC GATCAGCTCG AAGGTAAGAC CGAAATCCTC GAGACCGGTA 350 

TTAAGGTCAT CGACTTGCTC ACCCCTTACG TTAAGGGCGG CAAGATTGGT 400 

CTGTTCGGTG GTGCAGGTGT GGGTAAGACC GTGCTCATCC AGGAGATGAT 450 

CACTCGTATT GCTCGCGAGT TCTCCGGTAC CTCCGTCTTC GCTGGCGTTG 500 

GTGAGCGTAC CCGTGAGGGC ACCGACCTCT TCCTCGAAAT GGAAGAAATG 550 

GGCGTTCTTC AGGACACCGC TCTCGTGTTC GGCCAGATGG ACGAGCCACC 600 

AGGAGTCCGT ATGCGCGTTG CTCTGTCCGG TCTGACCATG GCGGAGTACT 650 

TCCGCGATGT TCAGCACCAG GACGTGCTTC TGTTCATCGA TAACATTTTC 700 

CGTTTCACCC AGGCCGGTTC CGAGGTTTCG ACCCTTCTTG GTCGTATGCC 750 

TTCCGCCGTG GGTTACCAGC CAACCTTGGC TGACGAGATG GGTGTTCTCC 800 

AGGAGCGTAT TACCTCTA 818 



2) INFORMATION FOR SEQ ID NO: 273 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 833 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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WO 01/23604 PCT/CA00/01150 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium pseudodiphtheriticuw 

(B) STRAIN: ATCC 10700 



5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 273 



CGATGCCTGC TCTGTACAAC 
GTCGCCAAGA CCATCACGCT 
GATCCGGACC ATTGCGTTGG 

10 AGGTTATCGA CACTGGTAAG 
AAAGGACACG TCTTCAATGC 
GGGCCGCGAC GGCGAACAGT 
ATGCGCTGGA GGGCAAAACC 
GACCTCCTTA CCCCTTACGT 

15 CGCCGGCGTC GGCAAGACCG 
CTCGTAACTT CTCCGGTACT 
CGTGAGGGTA CTGACCTGTT 
AGACACCGCC CTTGTCTTCG 
TGCGCGTGGC CTTGTCTGGT 

20 CAAAACCAGG ACGTTTTGTT 
GGCAGGTTCC GAGGTTTCCA 
GTTATCAGCC AACATTGGCT 
ACCTCTACAC GTGGTAAGTC 



GCGCTGACTG TCGAGGTCAC 
TGAGGTTGCA CAGCACCTCG 
CACCTACGGA CGGTCTCGTC 
CCAATTACTG TTCCCGTCGG 
GCTCGGTGAG TGTTTGGACG 
GGGGAATCCA CCGCGATCCG 
GAGATTCTGG AGACTGGAAT 
TAAGGGTGGC AAAATTGGTC 
TTCTTATCCA, GGAAATGATC 
TCCGTGTTCG CCGGCGTCGG 
CCTGGAAATG GAAGAGATGG 
GTCAAATGGA CGAACCACCA 
CTAACCATGG CTGAATATTT 
GTTCATTGAC AACATCTTCC 
CGCTGTTGGG CCGTATGCCT 
GATGAGATGG GTGTTTTGCA 
AATTACTTCC CTG 



CCTCGAGGCA 50 

GCGATAACCT 100 

CGTGGCGCTG 150 

CGATGCCGTC 200 

AACCAGGATT 250 

CCACCATTCG 3 00 

CAAGGTTATC 350 

TGTTCGGTGG 400 

ACTCGTATCG 450 

TGAGCGTACC 500 

GCGTGTTGCA 550 

GGGGTTCGTA 600 

CCGCGACGTT 650 

GTTTTACTCA 700 

TCCGCCGTGG 750 

GGAACGGATT 800 
833 



25 



2) INFORMATION FOR SEQ ID NO: 274 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 417 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium ulcerans 

(B) STRAIN: NCTC 8665 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 274 



CCGTGCTCAT CCAGGAGATG 
ACCTCTGTGT TCGCCGGCGT 

4 5 CTGGGTCGAG ATGGACGAGG 
TCGGCCAGAT GGACGAGCCG 
GCGCTGACCA TGGCGGAGTA 
GCTGTTCATC GACAACATCT 
CCACCCTGCT GGGCCGCATG 

50 GCGGACGAGA TGGGTGTGCT 
CTCCATCACC TCGATGC 



ATCACCCGTG TGGCCCGCAA 
CGGCGAGCGC ACCCGTGAGG 
CCGACGTGCT CAAGGACACC 
CCGGGAACCC GTCTGCGCGT 
CTTCCGCGAT GTGCAGAACC 
TCCGCTTCTC CCAGGCCGGC 
CCCTCCGCGG TGGGCTACCA 
GCAGGAGCGC ATCACCTCGA 



CTTCGGCGGC 50 

GCAACGACCT 100 

GCCCTGGTGT 150 

GGCCCTGTCC 200 

AGGACGTGCT 250 

TCCGAGGTCT 3 00 

GCCGAACCTG 3 50 

CTCGCGGCCA 4 00 
417 



55 2) INFORMATION FOR SEQ ID NO: 275 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 



PCT/CA00/01150 



(ii) MOLECULE TYPE: Genomic DNA 

5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium urealyticum 

(B) STRAIN: ATCC 43042 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 275 

10 

GGGCAGCAGC CAGCACTATT CAACGCGCTG CACGTCGAGG TTGACCTCGA 50 

GGCAGTTGCG AAGACCATTA CCCTGGAGGT CGCACAGCAC CTGGGTGACA 100 

ACCTGGTGCG CACCGTCTCC ATGGCCCCGA CCGACGGCCT GGTCCGCGGT 150 

GCAGAGGTCA AGGACACCGG TAAGCCGATC TCTGTGCCAG TCGGCGATGT 200 

15 TGTCAAGGGG CACGTCTTCA ACGCCCTGGG CGACTGCCTG GATGAGCCAG 250 

GTCTCGGCCG CGACGGCGAG CAGTGGGGCA TCCACCGCGA GCCACCGGCA 300 

TTCGACGAGC TCGAGGGTAA GACCGAGATC CTGGAGACCG GCGTTAAGGT 350 

CATCGACCTG CTGACCCCTT ACGTCAAGGG CGGCAAGATT GGCCTCTTCG 4 00 

GTGGTGCAGG TGTGGGTAAG ACCGTCCTGA TTCAGGAGAT GATTACCCGT 450 

20 ATCGCCCGCG AGTTCTCCGG TACCTCCGTG TTCGCCGGCG TCGGCGAGCG 500 

TACCCGTGAG GGTACGGACC TCTTCCTCGA GATGGAGGAG ATGGGCGTGC 550 

TCCAGGACAC CGCGCTGGTG TTCGGTCAGA TGGATGAGCC GCCGGGAGTC 600 

CGTATGCGCG TGGCTCTGTC CGGTCTGACC ATGGCGGAGT ACTTCCGCGA 650 

TGTTCAGGGC CAGGACGTGC TGCTGTTCAT CGACAACATC TTCCGTTTCA 700 

25 CCCAGGCAGG TTCTGAGGTC TCCACGCTGC TCGGCCGCAT GCCGTCCGCA 750 

GTGGGTTACC AGCCGACCCT GGCTGACGAG ATGGGTGTTC TGCAGGAGCG 800 

CATTACCTCC ACGAAGGGTA AGTCCATTAC CTCCC 835 



30 



2) INFORMATION FOR SEQ ID NO: 276 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 818 bases 
B5 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

JO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Coxiella burnetii 

(B) STRAIN: Nine Mile phase II 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 276 



50 



CCCGTCACGC 
AATTTAACGC 
AATTGCCATG 
ATACGGAAAA 
ATCATGAACG 
AAAAGAAAAA 
CTGGCGCTAC 
TGCCCCTTTG 
TGGAAAAACG 
ACAGCGGTTA 
AATGATTTTT 
GTTGGTGTAC 
GTTTGACGGG 
GTTCTGTTAT 



TGTCCCGAAA 
TCGAAGTCCA 
GGCAGCACTG 
ACCGATTGAA 
TGCTGGGTGA 
CTCCCTATTC 
CGAATTATTA 
CTAAGGGAGG 
GTTAATATGA 
TTCTGTTTTT 
ATCATGAAAT 
GGACAAATGA 
GCTTACGCTG 
TTATCGATAA 



GTGTACGATG 
ACAGCAACTC 
AGGGCTTAAA 
GTTCCCGTAG 
GCCGATCGAT 
ATCGTCCTGC 
GAAACCGGTA 
CAAAGTGGGT 
TGGAATTAAT 
GCGGGTGTGG 
GAAAGAATCC 
ACGAGCCGCC 
GCGGAAGCCT 
TATCTTTCGT 
16 



CTTTGCAGGT 
GGGGACGGTG 
ACGCGATATC 
GAAAAGAAAC 
GAGTTAGGTC 
GCCGAGCTTT 
TTAAAGTGGT 
CTTTTTGGAG 
CCGTAATATC 
GAGAAAGAAC 
AATGTCTTGG 
AGGGAACCGC 
TCCGTGACGA 
TACACTTTGG 
5 



GGATGAAAAT 50 

TCGTGCGCAC 100 

GCCGTAAAAA 150 

TTTAGGTCGT 200 

CCATTAATTC 250 

ATTGAGCAAT 300 

CGATTTGCTT 350 

GCGCGGGCGT 400 

GCCATTGAAC 450 

GCGAGAAGGC 500 

ATAAAGTGGC 550 

TTGCGGGTGG 600 

AGGACGCGAC 650 

CAGGGGTTGA 700 
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WO 01/23604 
AGTCTCTGCC CTCCTCGGTC GGATGCCATC 
CGTTGGCCGA AGAGATGGGG GCCCTGCAAG 
AAAGGGTCCA TTACGTCG 



PCT/CAOO/01150 

GGCTGTGGGT TATCAGCCGA 750 
AACGCATTAC TTCCACTAAA 800 

818 



2) INFORMATION FOR SEQ ID NO: 277 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA, 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Edwardsiella hoshinae 

(B) STRAIN: ATCC 33379 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277 

TTCCCGCAGG ACGCCGTGCC GAAGGTGTAT AACGCACTGG AAGTAAAAGG 50 

CGGTGCCACG AAACTGGTAC TGGAAGTGCA GCAGCAGCTG GGTGGCGGCG 100 

TAGTTCGCTG CATCGCGATG GGCTCTTCCG ACGGTCTGCG CCGTGGGCTA 150 

GAGGTTGAAG ACCAAGACCA TCCGATCGAG GTTCCTGTTG GCAAGGCGAC 200 

TCTGGGCCGT ATCATGAACG TACTGGGTGA TCCGGTCGAC ATGAAGGGCG 250 

AGATCGGTGA AGAAGAGCGT TGGGCTATCC ATCGTGCTGC ACCGAGCTAT 300 

GAAGATCTGT CTAACTCTCA GGAACTGCTG GAGACCGGCA TTAAGGTTAT 350 

CGACCTGATT TGCCCGTTCG CTAAAGGCGG TAAAGTGGGC CTGTTCGGTG 4 00 

GGGCCGGTGT GGGTAAGACC GTTAACATGA TGGAGCTTAT CCGTAACATC 450 

GCTATCGAGC ACTCCGGTTA CTCAGTCTTC GCCGGTGTGG GTGAGCGTAC 500 

CCGTGAGGGT AACGACTTCT ACCACGAGAT GACCGATTCC AACGTATTGG 550 

ATAAAGTTTC TCTGGTGTAT GGTCAGATGA ACGAGCCACC GGGAAACCGT 600 

CTGCGCGTGG CGCTGACCGG TCTGACCATG GCGGAGAAAT TCCGTGATGA 650 

AGGTCGTGAT GTACTGTTGT TCATCGATAA CATCTACCGT TATACCTTGG 700 

CCGGTACTGA AGTCTCCGCT CTGCTGGGCC GTATGCCGTC GGCGGTAGGT 750 

TATCAGCCGA CTCTGGCGGA GGAAATGGGG GTGCTGCAAG AGCGTATTAC 800 

CTCCACTAAG ACCGGGTCCA TCACCTCTG 829 



2) INFORMATION FOR SEQ ID NO: 278 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 809 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Edwardsiella tarda 

(B) STRAIN: ATCC 15947 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 278 

GCCGTGCCGA AGGTGTATAA CGCACTGGAA GTAAAAGGCG GTGCCACGAA 50 
ACTGGTACTG GAAGTGCAGC AGCAGCTGGG TGGCGGCGTC GTTCGCTGCA 100 
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TCGCGATGGG CTCCTCCGAC GGTCTGCGCC 
CAAGACCATC CGATCGAGGT TCCGGTCGGT 
CATGAACGTA CTGGGTGATC CGGTCGACAT 
AAGAGCGTTG GGCTATCCAC CGCGCGGCGC 
5 AACTCTCAGG AGCTGCTGGA GACCGGCATC 
CCCGTTCGCC AAAGGCGGTA AAGTTGGCCT 
GTAAGACCGT TAACATGATG GAGCTTATCC 
TCCGGTTACT CCGTATTTGC CGGTGTAGGC 
CGACTTCTAC CACGAGATGA CCGACTCCAA 

10 TGGTATACGG CCAGATGAAT GAGCCGCCGG 
TTGACCGGTC TGACCATGGC GGAGAAATTC 
GTTGTTGTTC ATCGATAACA TTTATCGTTA 
TTTCTGCTCT GCTGGGTCGT ATGCCGTCGG 
CTGGCGGAAG AGATGGGTGT GTTGCAAGAG 

15 GGGCTCTAT 



PCT/CA00/01150 

GTGGGCTGGT TKJTTGKAGKCT ' fob 

AAGGCGACCC TGGGCCGTAT 200 

GAAGGGCGAG ATCGGCGAAG 250 

CGAGCTATGA AGATCTGTCC 300 

AAGGTTATCG ACCTGATTTG 350 

GTTCGGTGGT GCCGGTGTGG 4 00 

GTAACATCGC TATCGAGCAC 450 

GAGCGTACCC GTGAGGGTAA 500 

CGTATTGGAT AAAGTTTCTC 550 

GAAACCGTCT GCGTGTGGCG 600 

CGTGATGAAG GTCGCGATGT 650 

TACCTTGGCT GGTACCGAAG 700 

CGGTAGGTTA TCAGCCGACC 750 

CGTATCACCT CAACGAAGAC 800 

809 



2) INFORMATION FOR SEQ ID NO: 279 



20 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 84 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM : Eikenella corrodens 

(B) STRAIN: ATCC 23834 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 9 

3 5 TTTCCGCGTA CCGCCATTCC GCGTGTTTAC GATGCACTCA AACTGGTTGA 50 

TACTGATTTG ACGCTGGAAG TACAGCAGCA GCTTGGTGAC GGCGTTGTCC 100 

GTACCATTGC GATGGGTAGT ACAGACGGTT TGAAACGTGG CTTGGCTGTG 150 

CAAAATACTG GTGCACCGAT TACTGTGCCG GTGGGGAAAG CCACCCTAGG 200 

CCGCATCATG GACGTTCTGG GCAACCCCGT GGACGAACAA GGTCCGATCG 250 

4 0 GTTCTGACCA AACCCGTGCT ATTCATCAAT TTGCACCTAA GTTCGACGAA 300 

CTCTCCAGCA CTACCGAATT GTTGGAAACA GGCATTAAAG TGATCGATTT 3 50 

GCTTTGTCCG TTTGCTAAAG GTGGTAAAGT GGGTCTGTTT GGCGGTGCCG 4 00 

GTGTGGGCAA AACCGTGAAC ATGATGGAGC TGATTAACAA CATTGCCAAA 4 50 

GCGCACAGTG GTCTTTCCGT ATTCGCCGGT GTGGGTGAGC GTACTCGTGA 500 

4 5 AGGTAACGAC TTCTACCACG AAATGAAAGA CTCCAACGTG TTGGATAAAG 550 

TGGCAATGGT GTATGGCCAG ATGAACGAGC CGCCTGGTAA CCGCTTGCGT 600 

GTTGCTCTAA CTGGTTTGTC GATGGCTGAA TACTTCCGTG ACGAAAAAGA 650 

CGAAAACGGC AAAGGCCGTG ACGTATTATT CTTTGTGGAT AATATCTACC 700 

GCTATACTCT GGCGGGTACC GAAGTGTCGG CTCTGCTTGG CCGTATGCCT 750 

50 TCTGCTGTGG GTTATCAGCC AACTTTGGCT GAAGAAATGG GTCGTTTGCA 800 

GGAGCGTATT ACCTCCACCC AAACTGGTTC CATTACCTCT 840 



55 2) INFORMATION FOR SEQ ID NO: 280 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 803 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 



PCT/CA00/01150 



(ii) MOLECULE TYPE: Genomic DNA 

(vi ) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter agglomerans 

(B) STRAIN: ATCC 27989 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 280 

GCCGTACCAC GAGTGTACGA TGCACTTGAG GTAAAGAATG GTGAAGAGCG 50 

TCTGGTGCTG GAAGTTCAGC AACAGCTCGG CGGTGGCGTT GTACGTACCA 100 

TCGCAATGGG TTCTTCTGAT GGTCTGCGTC GTGGTCTGGA AGTAACGGAC 150 

CTGGCTCACC CGATCGAAGT CCCGGTAGGT AAAGCAACAC TGGGTCGTAT 200 

CATGAACGTA CTGGGCGAAC CAGTAGACAT, GAAAGGCGAC ATCGGTGAAG 250 

AAGAGCGTTG GGCGATCCAC CGTGCAGCAC CGTCCTACGA AGAGTTGTCA 300 

AACTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTCGCT AAGGGCGGTA AAGTGGGTCT GTTCGGTGGT GCGGGTGTAG 4 00 

GTAAAACCGT AAACATGATG GAGCTTATTC GTAACATCGC GATCGAGCAC 4 50 

TCCGGTTACT CTGTGTTTGC GGGCGTAGGT GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTATCGAC AAAGTATCCC 550 

TGGTGTATGG TCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCG 600 

CTGACCGGTC TGACCATGGC TGAGAAGTTC CGTGACGAAG GTCGTGACGT 650 

ACTGTTGTTC GTTGACAACA TCTACCGTTA CACCCTGGCC GGTACGGAAG 700 

TATCCGCACT GCTGGGCCGT ATGCCTTCTG CGGTAGGTTA TCAGCCGACG 750 

CTGGCGGAAG AGATGGGCGT TCTGCAGGAA CGTATCACCT CCACCAAAAC 800 

CGG 803 



2) INFORMATION FOR SEQ ID NO: 281 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 833 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter amnigenus 

(B) STRAIN: ATCC 33072 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 281 

CGAATTCCCT CAGGATGCCG TACCGCGCGT GTACGATGCT CTTGAGGTAC 50 

AGAATGGTAA CGAGAGTCTG GTGCTGGAAG TTCAGCAGCA GCTCGGTGGT 100 

GGTATCGTAC GTACTATCGC CATGGGTTCT TCCGACGGTC TGCGTCGTGG 150 

TCTGGCTGTT AAAGATCTCG AACACCCGAT CGAAGTCCCG GTAGGTAAAG 200 

CAACACTGGG TCGTATCATG AACGTTTTGG GTCAACCAAT CGACATGAAA 250 
GGCGACATCG GTGAAGAAGA CCGTTGGGCA ATCCACCGTG CAGCACCTTC " 3 00 

CTATGAAGAG CTGTCTAGCT CTCAGGAACT GCTGGAAACC GGCATCAAAG 350 

TTATCGACCT GATGTGTCCG TTCGCTAAGG GCGGTAAAGT TGGTCTGTTC 400 

GGCGGTGCGG GCGTGGGTAA AACTGTAAAC ATGATGGAGC TGATCCGTAA 4 50 

CATCGCGATC GAGCACTCCG GTTACTCCGT GTTTGCAGGC GTGGGTGAGC 500 

GTACTCGTGA GGGTAACGAC TTCTACCACG AAATGACCGA TTCCAACGTT 550 

CTGGATAAAG TATCCCTGGT TTATGGCCAG ATGAACGAGC CACCAGGAAA 600 

CCGTCTGCGC GTTGCGCTGA CCGGTCTGAC TATGGCTGAG AAGTTCCGTG 650 

ACGAAGGTCG TGACGTACTG CTGTTCGTAG ATAACATCTA CCGTTACACC 700 

168 
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CTGGCCGGTA CTGAAGTATC TGCGCTGCTG 

AGGTTACCAG CCGACCCTGG CGGAAGAGAT 

TCACTTCTAC CAAAACCGGT TCTATCACCT 



PCT/CA00/01150 

GGCCGTATGC CTTCAGCGGT 750 
GGGCGTTCTG CAGGAACGTA 800 
CCG 833 



2) INFORMATION FOR SEQ ID NO: 282 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM:, Enterobacter asburiae 

(B) STRAIN: ATCC 35953 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 282 

GCCGTACCAC GCGTGTACGA CGCGCTTGAG GTACAGAATG GTAACGAGAG 50 

CCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTGCGTACCA 100 

TCGCGATGGG TTCTTCCGAC GGTCTGCGTC GTGGTCTGGA AGTCAAAGAC 150 

CTTGAGCACC CGATCGAAGT CCCGGTAGGT AAAGCAACAC TGGGTCGTAT 20 0 

CATGAACGTA TTGGGTCAAC CAATCGACAT GAAAGGCGAC ATCGGTGAAG 250 

AAGAGCGTTG GGCTATCCAC CGCGCGGCAC CTTCCTACGA AGAGCTGTCC 300 

AGCTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTCGCG AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GCGGGTGTTG 400 

GTAAAACCGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAGCAC 450 

TCCGGTTACT CCGTGTTTGC GGGCGTAGGT GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTCTGGAC AAAGTATCCC 550 

TGGTTTACGG CCAGATGAAC GAGCCACCAG GAAACCGTCT GCGCGTTGCG 600 

CTGACCGGTC TGACGATGGC TGAGAAGTTC CGTGATGAAG GCCGTGACGT 650 

TCTGCTGTTC GTTGATAACA TCTATCGTTA CACCCTGGCC GGTACGGAAG 700 

TATCTGCACT GCTGGGTCGT ATGCCTTCAG CGGTAGGTTA CCAGCCTACG 750 

CTGGCGGAAG AGATGGGTGT TCTTCAGGAA CGTATCACCT CTACCAAAAC 800 

CGGTTCTATC 810 



2) INFORMATION FOR SEQ ID NO: 283 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 811 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter cancerogenus 

(B) STRAIN: ATCC 35317 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 83 

GCCGTACCGC GCGTGTACGA TGCTCTTGAG GTACAGAATG GTAACGAGAG 50 
CCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTACGTACTA 100 
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TCGCCATGGG TTCTTCCGAC GGTCTGCGTC 
CTTGAGCACC CGATCGAAGT CCCGGTAGGT 
CATGAACGTA TTGGGTCAAC CAATCGACAT 
AAGAGCGTTG GGCTATCCAC CGCGCAGCAC 
5 AGCTCTCAGG AACTGCTGGA AACCGGCATC 
TCCGTTCGCG AAGGGCGGTA AAGTCGGTCT 
GTAAAACCGT AAACATGATG GAGCTGATCC 
TCCGGTTATT CCGTGTTTGC GGGCGTGGGT 
CGACTTCTAC CACGAAATGA CCGACTCCAA 

10 TGGTGTACGG CCAGATGAAC GAGCCACCAG 
CTGACCGGCC TGACCATGGC TGAGAAGTTC 
ACTGCTGTTC GTTGATAACA TCTACCGTTA 
TATCTGCACT GCTGGGTCGT ATGCCTTCAG 
CTGGCGGAAG AGATGGGTGT TCTTCAGGAA 

15 CGGTTCTATC A 



PCT/CA00/01150 

GTGGTCTGGA AGTAAAAGAC 150 

AAAGCAACAC TGGGTCGTAT 200 

GAAAGGCGAC ATCGGTGAAG 250 

CTTCCTACGA AGAGCTGTCC 300 

AAAGTTATCG ACCTGATGTG 350 

GTTCGGTGGT GCGGGTGTAG 400 

GTAACATCGC GATCGAGCAC 4 50 

GAACGTACTC GTGAGGGTAA 500 

CGTTCTGGAT AAAGTATCCC 550 

GAAACCGTCT GCGCGTTGCG 600 

CGTGACGAAG GTCGTGACGT 650 

CACCCTGGCC GGTACCGAAG 700 

CGGTAGGTTA TCAGCCTACG 750 

CGTATCACCT CAACCAAAAC 800 

811 



2) INFORMATION FOR SEQ ID NO: 284 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 '(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: EnteroJbacter cloacae 

(B) STRAIN: ATCC 13047 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 284 

35 GCCGTACCAC GCGTGTACGA CGCGCTTGAG GTACAGAATG GTAACGAGAG 50 

CCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTGCGTACCA 100 

TCGCGATGGG GTCTTCCGAC GGTCTGCGTC GTGGTCTGGA AGTTAAAGAC 150 

CTTGAGCACC CGATCGAAGT CCCGGTAGGT AAAGCAACAC TGGGTCGTAT 200 

CATGAACGTA TTGGGTCAGC CAATCGACAT GAAAGGCGAC ATCGGTGAAG 250 

10 AAGAGCGTTG GGCTATCCAC CGCGCGGCAC CTTCCTACGA AGAGCTGTCC 300 

AGCTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTTGCG AAGGGCGGTA AAGTTGGTCT GTTCGGTGGT GCGGGTGTAG 4 00 

GTAAAACCGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAGCAC 4 50 

TCCGGTTACT CCGTATTTGC GGGCGTAGGT GAACGTACTC GTGAGGGTAA 500 

15 CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTCTGGAC AAAGTATCCC 550 

TGGTTTACGG CCAGATGAAC GAGCCACCAG GAAACCGTCT GCGCGTTGCG 600 

CTGACTGGTC TGACGATGGC TGAGAAGTTC CGTGACGAAG GCCGTGACGT 650 

TCTGCTGTTC GTTGATAACA TCTACCGTTA CACCCTGGCC GGTACGGAAG 700 

TATCTGCACT GCTGGGTCGT ATGCCTTCAG CGGTAGGTTA TCAGCCTACG 750 

50 CTGGCGGAAG AGATGGGTGT TCTTCAGGAA CGTATCACCT CTACCAAAAC 800 

CGGTTCTATC ACTTCCG 817 



55 2) INFORMATION FOR SEQ ID NO: 285 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 766 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 



PCT/CAOO/01150 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter gergoviae 

(B) STRAIN: ATCC 33028 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 285 

AATGAGAGCC TGGTGCTGGA AGTTCAGCAG CAGCTCGGCG GCGGTATCGT 50 

GCGTACCATC GCAATGGGTT CTTCTGACGG TCTGCGTCGC GGTCTGGAAG 100 

TTAAAGATCT CGAACATCCG ATCGAAGTCC CGGTAGGTAA AGCGACCCTC 150 

GGCCGTATTA TGAACGTGCT GGGTCAGCCG GTTGATATGA AAGGCGATAT 200 

CGGCGAAGAA GAGCGTTGGG CGATCCACCG, CGCTGCGCCG TCCTATGAAG 250 

AGCTCTCCAG CTCTCAGGAA CTGCTGGAAA CCGGTATCAA GGTAATGGAC 300 

CTGATTTGCC CGTTCGCGAA GGGCGGTAAA GTCGGTCTGT TCGGCGGTGC 350 

GGGCGTTGGT AAAACCGTAA ACATGATGGA GCTGATCCGT AACATCGCGA 4 00 

TCGAGCACTC CGGCTACTCC GTGTTTGCGG GCGTGGGTGA ACGTACTCGT 450 

GAGGGTAACG ACTTCTACCA CGAAATGACC GACTCCAACG TTATCGACAA 500 

AGTATCCCTG GTGTACGGCC AGATGAACGA GCCGCCGGGA AACCGTCTGC 550 

GCGTGGCGCT GACCGGTCTG ACCATGGCTG AGAAATTCCG TGACGAAGGT 600 

CGTGACGTTC TGCTGTTCGT CGATAACATC TACCGCTATA CCCTCGCCGG 650 

TACTGAAGTA TCCGCACTGC TGGGCCGTAT GCCTTCTGCA GTAGGTTACC 700 

AGCCGACGCT GGCGGAAGAG ATGGGTGTTC TGCAGGAACG TATCACCTCC 750 

ACCAAAACCG GTTCTA 766 



2) INFORMATION FOR SEQ ID NO: 286 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 805 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter hormaechei 

(B) STRAIN: ATCC 49162 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 286 

GCCGTACCAC GCGTGTACGA CGCGCTTGAG GTACAGAATG GTAACGAGAG 50 

CCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTGCGTACCA 100 

TCGCCATGGG TTCTTCCGAC GGTCTGCGTC GTGGTCTGGA AGTGAAAGAC 150 

CTTGAGCACC CGATCGAAGT CCCGGTAGGT AAAGCAACGC TGGGTCGTAT 2 00 

CATGAACGTA TTGGGTCAGC CAATCGACAT GAAAGGCGAC ATCGGTGAAG 250 

AAGAGCGTTG GGCTATCCAC CGCGCGGCAC CTTCCTACGA AGAGCTGTCC 300 

AGCTCTCAGG AACTGCTGGA AACCGGCATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTTGCG AAGGGCGGTA AAGTTGGTCT GTTCGGTGGT GCGGGTGTAG 4 00 

GTAAAACCGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAGCAC 4 50 

TCCGGTTACT CCGTGTTTGC GGGCGTGGGT GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTCTGGAC AAAGTATCCC 550 

TGGTTTACGG CCAGATGAAC GAGCCACCAG GAAACCGTCT GCGCGTTGCG 600 

CTGACTGGCC TGACGATGGC TGAGAAGTTC CGTGACGAAG GCCGTGACGT 650 

TCTGCTGTTC GTCGATAACA TCTACCGTTA CACCCTGGCC GGTACGGAAG 700 

TATCTGCACT GCTGGGTCGT ATGCCTTCAG CGGTAGGTTA TCAGCCAACG 750 
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CTTGCGGAAG AGATGGGTGT TCTTCAGGAA CGTATCACCT CGACCAAAAC 800 
CGGTT 805 



2) INFORMATION FOR SEQ ID NO: 287 



10 



15 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 791 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter sakasakii 

(B) STRAIN: ATCC 29544 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 287 



TACGACGCCC 
CCAGCAGCAG 
CCGACGGTCT 

25 GAAGTGCCGG 
TCAGCCTATC 
TTCATCGCGC 
CTGGAAACCG 
CGGTAAAGTC 

30 TGATGGAGCT 
TTTGCGGGCG 
AATGACCGAC 
TGAACGAGCC 
ATGGCTGAGA 

35 CAACATCTAC 
GCCGTATGCC 
GGTGTTCTGC 



TTGAGGTAAC 
CTCGGCGGCG 
GCGTCGCGGT 
TAGGTAAAGC 
GACATGAAAG 
GGCGCCGTCC 
GCATCAAAGT 
GGTCTGTTCG 
TATTCGTAAC 
TGGGCGAACG 
TCCAACGTAC 
GCCGGGAAAC 
AATTCCGTGA 
CGTTACACCC 
TTCAGCGGTA 
AGGAGCGTAT 



GAATGGTAAT 
GTATCGTACG 
CTGCCTGTTG 
GACGCTGGGT 
GCGACATCGG 
TATGAAGAGC 
TATCGACCTG 
GTGGTGCAGG 
ATCGCGATTG 
TACCCGTGAA 
TGGATAAAGT 
CGTCTGCGCG 
CGAAGGTCGT 
TGGCCGGTAC 
GGTTATCAGC 
CACCTCCACC 



GAGCGTCTGG 
TACCATCGCG 
CAGACCTTGA 
CGTATCATGA 
CGAAGAAGAG 
TGTCCAGCTC 
ATGTGTCCGT 
TGTAGGTAAA 
AGCACTCCGG 
GGTAACGACT 
ATCCCTGGTG 
TTGCGCTGAC 
GACGTTCTGC 
TGAAGTATCC 
CGACCCTGGC 
AAAACCGGTT 



TGCTGGAAGT 
ATGGGCTCTT 
GCACCCGATC 
ACGTCCTGGG 
CGTTGGGCGA 
TCAGGAACTG 
TCGCGAAGGG 
ACCGTAAACA 
TTACTCCGTG 
TCTACCACGA 
TACGGCCAGA 
CGGCCTGACC 
TGTTCGTCGA 
GCACTGCTGG 
GGAAGAGATG 
C 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
791 



40 

2) INFORMATION FOR SEQ ID NO: 288 

( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 839 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: E/iterococcus avium 

(B) STRAIN: ATCC 14025 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 288 

TTTCTTTAGA TCAATCCTTA CCAGACATCA ACAATGCGTT GATTGTTTAC 50 

AAAAAAGATA AAACAAAAGT TGTTCTTGAA GTTGCTTTGG AACTTGGTGA 100 

TGGTGTTATC CGCACAATCG CTATGGAGGC TACTGATGGA TTGCAACGTG 150 

50 GAATGGAAGT TGTCGATACT GGCAAATCAA TCTCCGTTCC TGTAGGTAAA 200 

172 
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GATACTCTAG GTCGTGTGTT 
AGCACCTTTT CCAGAAGATG 
CTGCTTTTGA AGACCTTAGT 
AAGGTTATCG ACTTATTAGC 
5 ATTCGGTGGT GCCGGTGTTG 
ATAATATCGC CCAAGAACAC 
GAACGTACTC GTGAAGGGAA 
CGTTATTGAG AAAACAGCCA 
GTGCACGTAT GCGTGTTGCC 
10 CGTGATGAAG AAGGACAAGA 
CTTTACTCAA GCCGGATCAG 
CAGCCGTTGG GTATCAACCA 
GAACGAATCA CTTCAACCAA 



TAACGTATTA GGTGAAACGA 
CAGAAAGAAG CGGCATTCAT 
ACAAGTAACG AGATTTTGGA 
CCCTTATTTA AAAGGTGGGA 
GTAAAACCGT TTTGATCCAA 
GGTGGTATTT CAGTGTTTAC 
CGACCTTTAT TATGAAATGA 
TGGTGTTCGG ACAAATGAAC 
TTGACTGGTT TGACATTAGC 
TGTGTTGCTA TTTATTGACA 
AAGTTTCTGC CTTATTAGGA 
ACTTTGGCAA CTGAAATGGG 
AAAAGGTTCG ATCACTTCA 



PCT/CA00/01150 

TTGATAAAGA 250 

AAAAAGGCGC 3 00 

AACAGGGATC 3 50 

AAGTCGGACT 4 00 

GAATTAATTC 4 50 

CGGTGTTGGG 500 

AAGACTCTGG 550 

GAGCCGCCTG • 600 

TGAATATTTC 650 

ACATCTTCCG 700 

CGTATGCCAT 750 

GCAATTACAA 800 
839 



15 



2) INFORMATION FOR SEQ ID NO: 289 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 847 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 

(B) STRAIN: ATCC 25788 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 289 



TTCTCTCTAG 
CAAAAAAGAT 

35 TTGGCGACGG 
CAACGAGGAA 
TGGGACAGAA 
ATTTAGAAGC 
AAAGCACCTG 

4 0 TGGGATCAAA 
TTGGACTTTT 
TTGATCCACA 
TGTTGGTGAA 
AATCTGGCGT 

45 CCACCAGGTG 
GTACTTCCGT 
TCTTCCGCTT 
ATGCCGTCTG 
ATTACAAGAA 

50 



ACCAATCATT 
GAGCAGAAAA 
CGTTATCCGT 
TGGAAGTAAT 
ACCTTAGGAC 
GCCGTTCCCT 
ACTTTGATGA 
GTTATCGATT 
CGGTGGTGCC 
ACATCGCCCA 
CGGACACGTG 
TATCGAAAAA 
CTCGGATGCG 
GATGTGGAAG 
CACTCAAGCA 
CCGTTGGGTA 
CGGATCACAT 



ACCAGATATC 
CAAAAGTTGT 
ACGATTGCCA 
CGATACAGGC 
GGGTGTTTAA 
GAAGAAGCAC 
ATTGTCAACA 
TGTTAGCCCC 
GGTGTTGGTA 
AGAGCATGGT 
AAGGAAATGA 
ACAGCCATGG 
GGTAGCCTTG 
GACAAGACGT 
GGTTCTGAAG 
TCAGCCAACA 
CGACGAAGAA 



AACAATGCGT 
GTTAGAAGCT 
TGGAATCAAC 
GCCTCCATTT 
TGTCTTAGGG 
CCCGCAGTGG 
AGTACGGAGA 
TTATTTAAAA 
AAACCGTCTT 
GGGATCTCTG 
CCTTTATAAT 
TGTTTGGACA 
ACTGGTTTGA 
GCTCTTGTTT 
TATCTGCCTT 
TTAGCAACTG 
AGGTTCCGTT 



TGATTGTTTA 
GCCTTAGAAC 
AGATGGCTTA 
CTGTTCCAGT 
GACACCATCG 
GATTCACAAA 
TCCTTGAAAC 
GGGGGGAAAG 
GATCCAAGAA 
TCTTCACAGG 
GAAATGAAAG 
AATGAACGAA 
CATTAGCCGA 
ATCGATAATA 
ACTAGGTCGG 
AGATGGGGCA 
ACGTCTA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
847 



2) INFORMATION FOR SEQ ID NO: 290 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 845 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

50 
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(ii) MOLECULE TYPE: Genomic DNA 



PCT/CA00/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus durans 
5 (B) STRAIN: ATCC 19432 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 290 

TTTCTTTAGA CCAATCCTTA CCAGATATCA ACAACGCTTT AGTTGTTTAT 50 

10 AAAAATGATG AGAAGAAATC GAAAGTTGTT CTTGAAACAG CGCTAGAATT 100 

AGGTGACGGT GTCATCCGTA CAATCGCGAT GGAATCAACA GATGGTTTAC 150 

AACGCGGAAT GGAAGTCATT GATACAGAAA AAGCAATTTC TGTACCAGTG 200 

GGTAAAGAAA CGTTAGGTCG TGTATTCAAT GTATTAGGAG ATACGATCGA 250 

TTTATCTGCA CCTTTCCCAG AAGATGCAAA ACGTAGCGAA ATCCATAAAA 300 

15 AAGCACCAAA CTTTGATGAG TTAAGTACAA GTACTGAGAT CCTTGAAACT 350 

GGGATCAAAG TTATTGACTT GCTTGCTCCT TACTTAAAAG GTGGGAAAGT 400 

TGGATTATTC GGTGGTGCCG GTGTAGGTAA AACTGTATTG ATCCAAGAAT 450 

TGATCCATAA TATCGCTCAA GAACACGGTG GTATTTCTGT ATTTACTGGT 500 

GTTGGTGAAC GTACACGTGA AGGTAATGAC CTTTATTATG AAATGAAAGA 550 

2 0 TTCAGGAGTT ATTGAAAAAA CAGCCATGGT GTTTGGTCAA ATGAACGAAC 600 

CACCAGGTGC ACGTATGCGT GTTGCCTTGA CTGGTTTGAC GATTGCTGAA 650 

TACTTCCGTG ATGTTGAAGG GCAAGACGTG CTATTGTTTA TTGATAATAT 700 

TTTCCGTTTC ACTCAAGCCG GTTCAGAAGT TTCTGCCCTA TTAGGTCGTA 750 

TGCCTTCTGC CGTTGGGTAC CAACCAACGC TAGCAACAGA AATGGGTCAA 800 

25 TTACAAGAAC GGATCACTTC AACGAAAAAA GGTTCAATCA CTTCA 845 



2) INFORMATION FOR SEQ ID NO: 291 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 840 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM; Enterococcus faecal is 

(B) STRAIN: ATCC 2 9212 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 91 

15 TTAGATCAAT CCTTACCCGA TATTAACAAC GCTTTAGTCG TTTATAAAAA 50 

TGGCGAAGCA AAACAAAAAG TAGTACTTGA AGTCGCTTTA GAACTAGGTG 100 

ATGGAGTGAT TCGTTCTATC GCCATGGAAT CGACAGATGG TTTACAACGT 150" 

GGAATGGAAG TTATCGATAC AGGAAAATCA ATTTCAGTTC CTGTTGGTAA 200 

AGATACATTA GGTCGTGTGT TTAACGTTTT AGGAGACACA ATTGACTTAG 250 

50 AAGCGCCATT CCCTGCAGAT GCTGAACGTA GTGGGATTCA TAAAAAAGCG 300 

CCAGCATTTG ATGAATTAAG TACCAGTAAT GAAATTTTAG AAACAGGGAT 350 

TAAAGTTATT GACTTATTAG CACCTTATCT AAAAGGTGGT AAAGTCGGAC 4 00 

TTTTCGGTGG TGCCGGTGTT GGTAAAACCG TCTTAATTCA AGAATTAATT 450 

CATAATATTG CCCAAGAACA TGGAGGGATT TCCGTCTTTA CTGGTGTTGG 500 

55 TGAACGGACA CGTGAAGGGA ACGATCTGTA CTATGAAATG AAAGATTCAG 550 

GCGTTATTGA AAAAACAGCC ATGGTTTTTG GTCAAATGAA CGAACCGCCA 600 

GGTGCACGGA TGCGTGTGGC CTTAACTGGG TTAACGATTG CTGAATATTT 650 

CCGTGATGTG GAAGGACAAG ACGTGCTATT ATTTATTGAT AACATTTTCC 700 

GTTTCACCCA AGCCGGTTCA GAAGTTTCTG CCCTTTTAGG TCGGATGCCG 750 

50 TCAGCCGTTG GTTACCAACC AACCTTAGCG ACTGAAATGG GACAATTACA 800 

174 
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AGAACGGATT ACTTCAACGA AAAAAGGATC AATTACCTCT 840 



2) INFORMATION FOR SEQ ID NO: 2 92 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: ATCC 19434 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 292 

TTTCTTTAGA CCAATCCTTA CCAGATATCA ACAACGCATT AGTTGTTTAT 50 

AAAAATGACG AAAATAAATC AAAAGTTGTT CTTGAAGCAG CGTTAGAATT 100 

AGGGGACGGA GTGATCCGGA CCATTGCGAT GGAATCAACA GATGGTTTAC 150 

AAAGAGGAAT GGAAGTCATT GATACAGGCA AAGCAATCTC TGTTCCTGTA 200 

GGTAAAGAAA CATTAGGTCG CGTATTCAAC GTACTAGGAG ATACGATCGA 250 

TTTAGAAACA CCTTTCCCAG AAGATGCGGA AAGAAGCGAA ATTCATAAAA 300 

AAGCACCAGC CTTTGACGAA TTAAGTACAA GTACAGAAAT TTTGGAAACA 350 

GGGATCAAAG TTATCGATTT GCTTGCCCCA TATTTAAAAG GTGGGAAAGT 4 00 

CGGACTATTC GGTGGTGCCG GTGTTGGTAA AACCGTACTG ATCCAAGAAC 450 

TGATCCATAA TATCGCCCAA GAACATGGTG GTATTTCTGT ATTTACCGGT 500 

GTAGGTGAAC GTACTCGTGA AGGTAATGAC TTGTATTATG AAATGAAAGA 550 

TTCAGGAGTT ATCGAAAAAA CAGCCATGGT GTTCGGACAA ATGAACGAAC 600 

CACCAGGTGC ACGTATGCGT GTTGCTTTGA CTGGTTTGAC GATTGCGGAA 650 

TATTTCCGTG ATGTAGAAGG TCAAGATGTA CTGTTGTTTA TCGACAACAT 700 

TTTCCGTTTC ACTCAAGCTG GATCTGAAGT ATCAGCCTTG TTAGGACGGA 750 

TGCCTTCTGC GGTTGGTTAT CAACCAACAT TGGCAACAGA AATGGGTCAA 8 00 

TTGCAAGAAC GTATCACATC TACGAAAAAA G 831 



2) INFORMATION FOR SEQ ID NO: 293 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: ATCC 4 9573 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 293 • 

CCTTACCAGA TATCAACAAT GCGTTGATCG TTTACAAAAA AGACGAGAAA 50 

AAAACAAAAG TAGTATTGGA AGCCGCTTTA GAACTAGGGG ATGGTGTGAT 100 

CCGCACCATC GCAATGGAAT CTACAGACGG TTTGCAACGA GGAATGGAAG 150 

TCATCGATAC CGGTGCCTCA ATCTCTGTCC CTGTAGGAAC AGATACTCTA 200 
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GGCCGAGTAT 
CCCAGAAGAT 
ATGAATTGTC 
GATTTATTAG 
TGCCGGTGTT 
CTCAAGAGCA 
CGTGAAGGTA 
AAAGACAGCC 
TGCGGGTTGC 
GAAGGACAAG 
AGCGGGTTCT 
GTTATCAACC 
ACTTCTACGA 



TTAATGTACT 
GCCAAACGTA 
AACAAGTACA 
CTCCTTACTT 
GGTAAAACCG 
TGGGGGAATT 
ATGACTTGTA 
ATGGTTTTCG 
TTTGACTGGT 
ATGTGCTCTT 
GAAGTATCTG 
AACTCTAGCA 
AAAAAGGATC 



AGGCGATACT 
GTGGCATCCA 
GAAATCCTTG 
AAAAGGTGGT 
TATTGATTCA 
TCAGTATTTA 
TTATGAAATG 
GTCAAATGAA 
TTGACCATTG 
GTTTATCGAT 
CCTTGTTAGG 
ACTGAAATGG 
TGTAAC 



ATCGACTTGG 
CAAAAAAGCC 
AAACTGGGAT 
AAAGTCGGCT 
AGAATTGATT 
CCGGTGTTGG 
AAAGAATCAG 
TGAACCACCA 
CTGAGTATTT 
AATATTTTCC 
CCGGATGCCA 
GTCAATTACA 



PCT/CA00/01150 

AAGCACCAT^ *2eI)! 

CCAGATTTCG 300 

CAAAGTTATC 350 

TGTTCGGTGG 400 

CACAATATCG 450 

CGAACGGACG 500 

GCGTTATCGA 550 

GGTGCCCGGA 600 

CCGTGACGTT 650 

GTTTCACACA 700 

TCAGCCGTTG 750 

AGAACGAATC 800 
826 



15 



20 



25 



30 



2) INFORMATION FOR SEQ ID NO: 294 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE : Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus saccharolyticus 

(B) STRAIN: ATCC 43076 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 294 



35 



JO 



15 



SO 



TTTCTTTGGA 
AAAAAGAATG 
TGGTGATGGT 
AACGTGGGAT 
GGGAAAGAGA 
CTTAGATGGT 
AGGCACCTGA 
GGGATTAAAG 
TGGTTTATTC 
TGATTAACAA 
GTTGGTGAGC 
GTCGGGCGTT 
CACCAGGTGC 
TACTTCCGTG 
TTTCCGTTTT 
TGCCTTCAGC 
TTACAAGAAC 



CCAATCCTTA 
ATGAAAAAAC 
GTCGTACGTA 
GGAAGTCATC 
CATTAGGACG 
CCATTCCCAG 
TTTTGATGAA 
TAATCGATTT 
GGTGGTGCCG 
TATTGCGCAA 
GTACTCGTGA 
ATTGAGAAAA 
ACGTATGCGA 
ATGTTGAAGG 
ACTCAAGCTG 
GGTAGGGTAT 
GTATTACGTC 



CCAGACATCA 
AAAAGTGGTA 
CGATTGCTAT 
GATACAGGAG 
TGTATTTAAC 
AAGAAGTAGC 
TTAAGTACAA 
ATTAGCGCCT 
GTGTAGGTAA 
GAACATGGTG 
AGGAAATGAC 
CAGCGATGGT 
GTTGCTTTAA 
ACAAGATGTA 
GTTCAGAAGT 
CAACCGACAT 
AACGAAAAAA 



ACAATGCGTT 
CTTGAAACAG 
GTCGTCAACA 
CATCAATTTC 
GTTTTAGGGG 
ACGAGATGGT 
GTACGGAGAT 
TACTTAAAAG 
AACGGTATTA 
GTATTTCAGT 
CTTTATTATG 
TTTTGGACAA 
CTGGTTTAAC 
TTACTATTTA 
TTCAGCTTTA 
TAGCAACAGA 
GGCTCAATTA 



GGTGGTCTAT 50 

CTTTAGAACT 100 

GACGGTTTGC 150 

TGTTCCTGTT 200 

AGACTATCGA 250 

ATTCATAAAA 300 

TCTTGAAACA 350 

GTGGGAAAGT 4 00 

ATTCAAGAAT 4 50 

ATTTGCGGGT 500 

AAATGAAAGA 550 

ATGAACGAAC 600 

CATTGCAGAA 650 

TTGATAACAT 700 

TTAGGACGTA 750 

AATGGGACAA 800 

CATCAA 84 6 



2) INFORMATION FOR SEQ ID NO: 295 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 803 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 

176 
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( i i ) MOLECULE TYPE: Genomic DNA 



PCT/CAOO/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia fergusonii 
5 (B) STRAIN: ATCC 35469 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 295 

GCCGTACCGC GCGTGTACGA TGCTCTTGAG GTGCAAAATG GTAATGAGCG 50 

10 TCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGTGGTATC GTGCGTACCA 100 

TCGCAATGGG GTCCTCCGAC GGTCTGCGTC GCGGTCTGGA TGTAAAAGAC 150 

CTCGAACACC CGATCGAAGT CCCGGTAGGT AAAGCGACTC TGGGCCGTAT 200 

CATGAACGTA CTGGGTGAAC CGGTCGACAT GAAAGGCGAG ATCGGTGAAG 2 50 

AAGAGCGTTG GGCGATTCAC CGCGCAGCAC CTTCCTACGA AGAGCTGTCA 300 

15 AACTCTCAGG AACTGCTGGA AACCGGTATC. AAAGTTATCG ACCTGATGTG 350 

TCCGTTCGCT AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GCGGGTGTAG 4 00 

GTAAAACTGT AAACATGATG GAGCTTATTC GTAACATCGC GATCGAGCAC 4 50 

TCCGGTTACT CTGTGTTTGC GGGCGTAGGT GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CTGACTCCAA' CGTTATCGAC AAAGTATCCC 550 

20 TGGTATATGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCA 600 

CTGACCGGCC TGACCATGGC TGAGAAATTC CGTGACGAAG GTCGTGACGT 650 

TCTGCTGTTC GTTGACAACA TCTATCGTTA CACCCTGGCC GGTACGGAAG 700 

TATCCGCACT GCTGGGCCGT ATGCCTTCAG CGGTAGGTTA TCAGCCGACT 7 50 

CTGGCGGAAG AGATGGGCGT TCTTCAGGAA CGTATCACCT CCACCAAAAC 800 

25 TGG 803 



2) INFORMATION FOR SEQ ID NO: 296 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
4 0 (A) ORGANISM: Escherichia hermannii 

(B) STRAIN: ATCC 33650 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 296 

45 GATGCCGTAC CGCGCGTGTA CGATGCTCTT GAGGTGCAAA ATGGTGATGA 50 

GCGTCTGGTG CTGGAAGTGC AGCAGCAGCT CGGCGGCGGT ATCGTGCGTA 100 

CCATCGCAAT GGGTTCTTCC GACGGTCTGC GTCGTGGTCT GACTGTCGTC 150 

GACCTCGAGC ACCCGATCGA AGTCCCGGTA GGTAAAGCGA CCCTGGGCCG 200 

TATCATGAAC GTGCTGGGTC AGCCGATCGA CATGAAAGGC GATATCGGTG 250 

50 AAGAAGAGCG TTGGGCGATT CACCGCGCGG CGCCGTCCTA TGAAGAGCTG 300 

TCCAGCTCTC AGGAACTGCT GGAAACCGGC ATCAAAGTTA TCGACCTGAT 350 

GTGTCCGTTC GCGAAGGGCG GTAAAGTCGG TCTGTTCGGT GGTGCGGGCG 4 00 

TAGGTAAAAC CGTAAACATG ATGGAGCTGA TCCGTAACAT CGCGATCGAG 4 50 

CACTCCGGTT ACTCTGTGTT TGCGGGCGTG GGTGAACGTA CTCGTGAGGG 500 

55 TAACGACTTC TACCATGAAA TGACCGACTC CAACGTTCTG GACAAAGTAT 550 

CCCTGGTTTA CGGCCAGATG AACGAACCGC CGGGAAACCG TCTGCGCGTT 600 

GCACTGACCG GCCTGACCAT GGCTGAGAAA TTCCGTGACG AAGGTCGTGA 650 

CGTTCTGTTG TTCGTCGACA ACATCTACCG TTACACCCTG GCCGGTACTG 700 

AAGTATCCGC ACTGCTGGGC CGTATGCCTT CTGCGGTAGG TTACCAGCCG 750 

50 ACCCTGGCGG AAGAGATGGG CGTTCTGCAG GAGCGTATCA CCTCCACCAA 8 00 

177 
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5 2) INFORMATION FOR SEQ ID NO: 297 

( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 808 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia vulneris 

(B) STRAIN: ATCC 33821 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 297 



CCGAACGTGT ACGACGCCCT CGAAGTGACA AATGGTAATG AGCGTCTGGT 50 

GCTGGAAGTT CAGCAGCAGC TCGGCGGCGG TATCGTACGT ACCATCGCTA 100 

TGGGTTCTTC CGACGGTCTG CGTCGTGGTC TGGAAGTTCA GGACCTCGAG 150 

CACCCGATCG AAGTGCCGGT AGGTAAAGCG ACCCTGGGTC GTATCATGAA 200 

25 CGTACTGGGT CAGCCGATCG ATATGAAAGG CGACATCGGT GAAGAAGAGC 250 

GTTGGGCTAT TCACCGTGCA GCACCGTCCT ATGAAGAGCT CTCCAGCTCT 300 

CAGGAACTGC TGGAAACCGG CATCAAGGTT ATCGACCTGA TGTGTCCGTT 350 

CGCCAAGGGC GGTAAAGTCG GCCTGTTCGG CGGCGCGGGC GTGGGTAAAA 400 

CCGTAAACAT GATGGAGCTG ATCCGTAACA TCGCGATCGA GCACTCCGGT 450 

3 0 TACTCCGTGT TTGCAGGCGT GGGTGAGCGT ACTCGTGAGG GTAACGACTT 500 

CTACCACGAG ATGACCGACT CCAACGTTCT GGACAAAGTA TCCCTGGTGT 550 

ACGGCCAGAT GAACGAGCCG CCGGGAAACC GTCTGCGCGT GGCACTGACC 600 

GGCCTGACCA TGGCTGAGAA GTTCCGTGAC GAAGGTCGTG ACGTTCTGCT 650 

GTTCGTTGAC AACATCTATC GTTACACCCT GGCCGGTACG GAAGTATCTG 700 

35 CACTGCTGGG CCGTATGCCT TCAGCGGTAG GTTACCAGCC GACGCTGGCG 750 

GAAGAGATGG GCGTTCTGCA GGAGCGTATC ACCTCCACCA AAACCGGTTC 800 

TATCACCT . 808 



40 



50 



2) INFORMATION FOR SEQ ID NO: 298 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 843 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Euibacteriu/n lentum 

(B) STRAIN: ATCC 43055 



55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 298 

TTTCCCCCTG ATCAGCTGCC GGCGATTTAC AACGCGCTGA CGGTTGATGC 50 

CAAGACCCTG GCGGGCGACT TGCACCTCGT GCTCGAGGTC GAGACGCACC 100 

TGCCGGGCAA CCTTGTCCGC TCGGTGGCCA TGAGCTCGAC GGACGGTCTC 150 

50 GTCCGCGGCC TCGAGGTCGT CGACACGGGC AACCCGATCA TGATGCCCGT 200 

178 
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GGGTCCCGAG ACCCTGGGTC GCATCTGGAA CGTCATGGGC GAGCCCGTCG 250 

ACGAGAAGCC GATGCCCGAG GTGAAGGGCT ACATGCCCAT CCACCGTCCG 300 

GCTCCGGACT ACGACGAGCT GTCCACCACC ACCGAGATCT TCGAGACCGG 350 

CATCAAGGCC ATCGACCTCG TCGAGCCCTT CGTCAAGGGC GGCAAGACGG 4 00 

5 GTCTGTTCGG CGGCGCCGGC GTGGGCAAGA CGGTTATCAT CCAGGAGCTC 450 

ATCAACAACC TGGCCCAGGA GCACGGCGGC ACGTCGGTGT TCACGGGCGT 500 

GGGCGAGCGT ACCCGCGAGG GTACCGACCT CTACCTGGAG ATGAGCGACT 550 

CGGGCGTCAT CAACAAGACC TGCCTCGTGT ACGGTCAGAT GAACGAGCCT 600 

CCGGGAGCGC GTCTGCGCGT GGGTCTCGCG GGCCTCACCG AGGCGGAGTA 650 

10 CTTCCGCGAT CAGGGCCAGG ACGTGCTTCT GTTCGTGGAC AACATCTTCC 700 

GCTTCACGCA GGCCGGCTCC GAGGTGTCCG CTCTGCTGGG CCGCATGCCC 750 

TCTGCCGTGG GTTACCAGCC GACGCTGGCA ACCGAGATGG GCGACCTGCA 800 

GGAGCGCATC ACGTCGACGT CCACCGGCTC CATCACGTCC GTG 843 



15 



30 



60 



2) INFORMATION FOR SEQ ID NO: 299 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH; 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ewingella americana 

(B) STRAIN: ATCC 33852 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 299 



TCCCTCAGGA TGCAGTACCG AACGTGTACA ATGCTCTTGA GGTAGAAAAC 50 

GGTGCCTCCA AACTGGTTCT GGAAGTTCAG CAACAGTTAG GCGGCGGCGT 100 

35 TGTTCGTTGT ATCGCAATGG GTACCTCAGA CGGCCTTCGT CGCGGTCTGA 150 

AAGTGAACAA CCTGGAACAC CCAATTGAAG TTCCGGTTGG TAAAGCGACT 2 00 

CTGGGTCGTA TCATGAACGT ATTGGGTGAA CCAATCGACA TGAAAGGTGA 250 

AATCGGCGAA GAAGAACGTC GTGCAATTCA CCGTCCAGCG CCTTCTTATG 300 

AAGAGCTGGC TAACTCCCAA GAATTGCTGG AAACCGGTAT CAAAGTTATG 350 

40 GACCTGATGT GTCCGTTCGC TAAGGGCGGT AAAGTCGGTC TGTTCGGTGG 4 00 

TGCGGGTGTT GGTAAAACTG TAAACATGAT GGAGCTGATC CGTAACATCG 4 50 

CGATCGAGCA CTCCGGTTAC TCAGTGTTTG CAGGCGTGGG TGAGCGTACT 500 

CGTGAGGGTA ACGACTTCTA CCACGAAATG ACTGACTCCA ACGTTATCGA 550 

CAAAGTTTCC CTGGTCTATG GTCAGATGAA TGAGCCACCA GGTAACCGTC 600 

4 5 TGCGCGTTGC ACTGACCGGC CTGACCATGG CGGAGAAATT CCGTGATGAA 650 

GGTCGTGACG TACTGCTGTT CGTTGACAAC ATTTACCGTT ACACCCTGGC 700 

AGGTACCGAA GTGTCCGCAC TTCTGGGCCG TATGCCATCG GCGGTAGGTT 750 

ATCAGCCAAC GCTGGCGGAA GAGATGGGTG CTCTGCAAGA GCGTATCACC 800 

TCTACCAAAA GTGGTTCTAT CACCTCCGT 82 9 

50 



2) INFORMATION FOR SEQ ID NO: 300 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 805 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



179 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Francisella tularensis 
5 (B) STRAIN: LVS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 300 

AACACGCCTA AAGTATATGA TGCTTTAAAT GTAGTAGAAG CTGGTTTAGT 50 

10 ATTAGAAGTT CAGCAACAAA TTGGTGATGG CGTAGTTCGT ACAATTGCTA 100 

TGGGATCTAG TGATGGTCTT AGACGTGGTA TGGAAGTTAA GAACACAAAT 150 

GCGCCTATTT CTGTTCCAGT TGGACATGGC ACACTTGGAC GTATCATGAA 2 00 

TGTTTTAGGT GAACCAATTG ATGAAGCTGG TCCAATTGAA TATACTGAGA 250 

AAAGATCTAT CCATCAAGCT CCTCCTGCAT ATGATGAGTT AGCATTAAGT 300 

15 ACAGAAATAT TAGAAACAGG TATCAAAGTA GTTGACCTTA TTTGTCCATT 350 

TGCTAAGGGC GGTAAAGTTG GTTTATTTGG CGGTGCAGGT GTTGGTAAAA 4 00 

CTGTAACGAT GATGGAACTT ATCAACAATA TTGCAAAAGA ACATAGTGGC 450 

TACTCTGTAT TTTCCGGTGT TGGTGAAAGA ACTCGTGAAG GTAATGACTT 500 

CTACTATGAG ATGAAATATT CTAATGTATT GGATAAAGTA TCATTAGTAT 550 

20- ATGGTCAGAT GAATGAGCCG CCTGGAAACA GATTAAGAGT AGCTCTTAGT 600 

GGCTTAACAA TAGCAGAAGG ATTCCGTGAT GAAAAGCGTG ATGTTTTGAT 650 

GTTTATCGAT AACATCTATC GTTATACATT AGCAGGTACA GAGGTATCGG 700 

CGCTACTTGG TCGTATGCCA TCTGCTGTGG GTTATCAGCC AACGCTTGCA 750 

GCTGAGATGG GTGCTTTACA GGAGCGTATT ACATCTACTA AGACAGGATC 800 

25 TATTA 805 



30 



2) INFORMATION FOR SEQ ID NO: 301 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

3 5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

4 0 (A) ORGANISM: Fusobacterium gonidiaformans 

(B) STRAIN: ATCC 25563 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 301 

4 5 GACGAATTGC CAAAAATATA CAATGCATTA AAGGTGCAAG TTGGAGAAAA 50 

AGAACTTGTA TTGGAAGTGC AACAACATTT GGGAAATAAT GTTGTGAGAA 100 

CAGTAGCGAT GGACTCAACA GATGGATTGC TTCGAGGAAT GGAAGTAATG 150 

GATACCGGAG CACCGATTAC TGTTCCAGTA GGGAAGGCGG TTTTAGGAAG 200 

AATATTGAAT GTTTTGGGAG AGCCTGTGGA TCAAAAAGGG CCTGTGGAAA 250 

50 CAGAAGAATA TTTACCTATC CATAGAGAAG CACCAAAATT TGAAGAACAA 3 00 

GAAACAGTAA CAGAAATTTT TGAAACAGGA ATTAAAGTCA TAGATTTGTT 350 

AGCCCCTTAT ATCAAAGGAG GAAAGACAGG TCTATTCGGT GGAGCCGGAG 4 00 

TAGGGAAAAC AGTTTTAATT ATGGAATTAA TTAATAACAT TGCAAAGGGC 4 50 

CACGGAGGAA TTTCTGTGTT TGCAGGAGTT GGAGAAAGAA CAAGAGAAGG 500 

55 AAGAGATTTA TACAACGAAA TGACAGAGTC CGGAGTTTTG AATAAGACCT 550 

CGTTGGTGTA TGGTCAAATG AATGAGCCGC CCGGAGCAAG ACTTCGTGTG 600 

GCGTTGACAG GATTAACGGT TGCTGAAAAC TTTAGAGATA AAGAAGGGCA 650 

AGATGTATTG TTGTTTATCG ACAATATCTT CCGTTTCACA CAAGCAGGAT 700 

CAGAAGTATC GGCTCTATTG GGAAGAATTC CATCGGCAGT AGGATATCAA 750 

50 CCGAACTTAG CGACAGAAAT GGGAACTTTA CAAGAAAGAA TTACTTCTAC 800 

180 
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2) INFORMATION FOR SEQ ID NO: 302 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusobacteri um necrophorum subsp. 
necrophorum 

<B) STRAIN: ATCC 25286 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 302 

ACAATGCATT AAAGGTACAG GTGGGAGAAA GGGAACTTGT GTTGGAAGTG 50 

CAGCAACATT TAGGAAATAA TGTTGTCAGA ACAGTAGCAA TGGATTCAAC 100 

AGACGGATTA CTTCGGGGAA TGGAAGTGAG AGATACAGGA GTTCCCATTA 150 

CTGTTCCGGT AGGAAAGGCG GTTTTGGGAA GAATATTAAA TGTCTTAGGG 2 00 

GAGCCTGTGG ACGAAAAAGG TCCGATAGAG ACAGAAGAAT ATTTACCAAT 250 

ACATAGAGAA GCACCGAAAT TTGAAGAACA GGAAACGGTG ACAGAAATTT 300 

TTGAAACAGG AATTAAAGTC ATTGATTTGT TAGCTCCTTA TATTAAAGGA 350 

GGAAAAACAG GCCTATTCGG AGGAGCCGGA GTAGGAAAAA CCGTTTTGAT 400 

TATGGAACTG ATCAATAATA TTGCAAAAGG TCATGGAGGA ATTTCTGTTT 450 

TTGCAGGAGT TGGAGAAAGA ACGAGAGAGG GAAGAGATCT ATACAACGAA 500 

ATGACAGAGT CCGGAGTTTT GAATAAAACT TCTTTGGTAT ATGGGCAAAT 550 

GAATGAGCCG CCCGGAGCAA GACTTCGAGT GGCTTTAACC GGACTTACTG 600 

TTGCCGAAAA TTTCAGAGAT AAAGAGGGAC AGGATGTCTT ATTGTTCATT 650 

GACAATATTT TCCGTTTCAC ACAAGCAGGT TCGGAAGTAT CGGCACTTTT 700 

GGGGAGAATT CCTTCTGCAG TGGGATATCA ACCGAACTTG GCGACAGAAA 750 

TGGGAAGCTT ACAAGAAAGA ATTACTTCTA CAAAATCCGG TTCTATCACT 800 

TCCGTG 806 



2) INFORMATION FOR SEQ ID NO: 303 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: FusoJbacteriu/n nucleatum subsp. polymorph, 

(B) STRAIN: ATCC 10953 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 303 

GATGAATTGC CTGCAATATA TAATGCTTTA AAAGTAAAAT TAGAAGATAA 50 

GGAACTTGTT CTAGAAGTTG AACAACATCT TGGTAACAAT GTTGTAAGAA 100 

CTGTTGCTAT GGATTCAACT GATGGATTAA AAAGAGGAAT GGAAGTTATA 150 

181 
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GATACAGGTA AACCAATTAC AGTACCAGTT 
AATATTAAAT GTTTTAGGAG AACCTGTTGA 
CTGAAACATT TTTACCTATT CATAGAGAAG 
GAAACTGAAA CTGAAATATT TGAAACAGGA 
5 AGCACCATAT ATTAAAGGTG GAAAAATAGG 
TAGGAAAAAC AGTTTTAATA ATGGAACTTA 
CATGGAGGAA TTTCAGTTTT TGCAGGAGTT 
TAGAGACTTA TATGGTGAAA TGACTGAATC 
CTCTTGTTTA TGGACAAATG AATGAGCCAC 
10 GCATTAACAG GGCTTACTGT TGCAGAAAAC 
AGATGTTCTT CTATTTATAG ATAATATATT 
CAGAAGTTTC AGCTTTACTT GGAAGAATAC 
CCAAACCTAG CAACTGAAAT GGGTGCTTTA 
AAAATCTGGT TCAATTACAT C 

15 



PCT/CA00/01150 

GGTAAAGCTG TTCTTGGTAG 200 

TAATCAAGGT CCTATAAATG 2 50 

CACCAGAATT TGATGACTTA 300 

ATAAAAGTTA TAGACTTATT 350 

ATTATTTGGT GGAGCTGGAG 4 00 

TCAACAACAT TGCAAAAGGA 4 50 

GGAGAAAGAA CAAGAGAAGG 500 

AGGAGTTATC ACAAAAACAG 550 

CTGGAGCAAG ACTTAGAGTT 600 

TTTAGAGATA AAGATGGGCA 650 

TAGATTTACA CAAGCAGGTT 700 

CATCAGCTGT TGGATATCAA 750 

CAAGAAAGAA TAACATCTAC 800 

821 



2) INFORMATION FOR SEQ, ID NO: 304 



20 



25 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 864 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Gardnerella vaginalis 

(B) STRAIN: ATCC 49145 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 04 



TTCCCAGTTG 

35 CAACACCGTT 
AAGTTGAGCA 
CCTACGGACG 
AATTTCTGTG 
CTGGTAACAT 

4 0 CGCTGGCCAA 
GACTCAAATG 
ACGTTCAGGG 
ACTGTGTTGA 
TGTGTCTGTG 

4 5 TGATTGGCGA 
TTTGGTCAGA 
TGCTTTGACT 
TGCTGTTTAT 
TCCACGTTGC 

50 GGCGGATGAA 
ATTCTATTAC 



GCTATCTTCC 
GGAAACACGG 
GCACCTTGGT 
GCTTGGTCCG 
CCTGTTGGAG 
TTTAAACGCT 
TCCACCGCAA 
TTTGAAACAG 
CGGAAAGATT 
TTCAGGAGAT 
TTTGCTGGCG 
AATGGCTGAG 
TGGATGAGCC 
ATGGCTGAGT 
CGACAACATC 
TTGGTCGTAT 
ATGGGTGCGT 
GTCG 



AGATATTTAT 
AGGGAGATAC 
GATTCAACTG 
TGGTGCTTTA 
ATGTTACAAA 
AAACCAGGCG 
CCCACCTGCT 
GCATTAAGGT 
GGTCTGTTCG 
GATTCAGCGC 
TTGGCGAACG 
GCTGGCGTTT 
TCCTGGGACT 
ATTTCCGTGA 
TTCCGCTTTA 
GCCTTCTGCA 
TGCAGGAGCG 



AATGCTCTCA 
CGTCCACGAG 
TGCGAGCAGT 
GTGCGAGATA 
AGGTCACGTT 
AAAACATTGA 
TTCGATCAGC 
TATCGATTTG 
GTGGTGCAGG 
GTTGCACAGA 
TACTCGTGAG 
TGGAGAAAAC 
CGTCTTCGTG 
TGTTCAGAAT 
CTCAGGCAGG 
GTTGGTTATC 
CATTACTTCT 



AGGTTGATAT 50 

ATTACATTGG 100 

GGCACTTAAG 150 

CTGGTGGCCC 200 

TTTGACGTAA 250 

GGTGACCGAG 300 

TTGAGTCTAA 350 

CTTACGCCTT 400 

CGTTGGTAAA 4 50 

ACCACGGCGG 500 

GGTAACGATT 550 

AGCGCTTGTC 600 

TGCCTCTTAC 650 

CAGGATGTGT 700 

TTCTGAGGTT 750 

AGCCAAACTT 800 

ACGCGCGGTC 850 
864 



55 2) INFORMATION FOR SEQ ID NO: 305 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 848 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

182 
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(D) TOPOLOGY: Linear 



PCT/CA00/01150 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Gemella haemolysans 

(B) STRAIN: ATCC 10379 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 305 

TCGAATCAGG GCATATGCCA AATCTATTAA ACGCTTTAGA AGTTTACATA 50 

GAAAAAGGCG ATGGGAAAAA AGAAAAATTA GTTCTTGAAG TTTCTCTTGA 100 

AATTGGTGAT AACGTAGTAA GAACAATCGC TATGTCATCT ACTGATGGAT 150 

TAAATAGGGG AGCAGAAGTA GTAGATACAG GAGCACCAAT TACAGTTCCT 200 

GTAGGTAACT ACACATTAGG TCGTGTGTTC AACGTATTAG GTGAAGCAGT 250 

TGACCACGGT GAAGAAGCAG GAGCAGAAGT TCGTAAAGAT TCAATTCACA 300 

AAGAAGCTCC AACATTCGAT GAATTATCAA CTCACGTTGA GGTTCTTGAA 350 

ACAGGTATTA AAGTTATCGA CTTACTTGCA CCATATATTA AAGGTGGTAA 400 

AATCGGTCTT TTCGGTGGTG CGGGAGTTGG TAAAACGGTT CTTATCCAAG 450 

AACTTATCAA CAACGTTGCG CAACAACACG GTGGATTATC AGTATTCACA 500 

GGTGTAGGTG AGCGTACTCG TGAAGGAAAT GACTTATACT ATGAAATGAA 550 

AGATTCTGGT GTTATTAACA AAACAGCCAT GGTATTCGGA CAAATGAACG 600 

AACCACCAGG TGCTCGTATG CGTGTAGCAT TAACAGGATT AACAATGGCG 650 

GAATACTTCC GTGATGAAGA AGGACAAGAC GTGCTTCTAT TCATCGATAA 700 

CATTTTCCGT TTCACACAAG CAGGTTCTGA GGTTTCTGCG TTATTAGGAC 750 

GTATGCCATC AGCCGTTGGT TACCAACCAA CACTTGCTAC AGAGATGGGA 800 

CGTTTACAAG AACGTATAAC ATCAACTAAA AAAGGTTCTG TTACATCT 848 



2) INFORMATION FOR SEQ ID NO: 306 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Gemella morbillorum 

(B) STRAIN: ATCC 27824 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 306 

TCGAATCAGG GCATATGCCT AATCTACTAA ACGCTTTAGA AGTTTATATA 50 

GAAAAAGGCG ATGGAAAAAA AGAAAAATTA GTTCTTGAAG TTTCTCTTGA 100 

AATCGGGGAT AATGTCGTAA GAACTATTGC GATGTCATCT ACTGATGGAT 150 

TAAACAGAGG GGCAGAAGTA GTTGATACTG GAGCGCCAAT TACAGTGCCA 200 

GTAGGTAACT ATACATTAGG ACGTGTGTTC AACGTATTAG GTGAAGCAGT 250 

TGACCACGGA GAAGAAGCTG GAGCAGAAGT TCAAAAAGAA TCTATTCATA 300 

AAGAAGCTCC AACTTTCGAA GAATTATCAA CACATGTTGA GGTATTAGAA 350 

ACAGGTATTA AAGTTATCGA CCTTCTTGCA CCATATATTA AAGGTGGTAA 400 

GATTGGACTA TTCGGTGGTG CTGGAGTTGG GAAAACAGTT CTTATCCAAG 4 50 

AACTTATTAA CAACGTAGCA CAACAACACG GAGGACTTTC AGTATTTACT 500 

GGGGTAGGTG AACGTACTCG TGAGGGTAAC GACTTGTACT ATGAAATGAA 550 

AGACTCTGGA GTTATTAATA AAACTGCCAT GGTATTTGGT CAAATGAATG 600 

AGCCACCAGG TGCACGTATG CGTGTTGCCT TAACAGGATT AACAATGGCA 650 

GAGTACTTCC GTGATGAAGA AGGACAAGAC GTACTATTAT TTATCGATAA 700 

183 
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PCT/CA00/01150 



TATCTTCCGT TTCACACAAG CAGGGTCTGA 
GTATGCCTTC AGCCGTTGGA TATCAACCAA 
CGTCTTCAAG AACGTATTAC ATCAACTAAA 



GGTATCTGCA 
CTCTTGCAAC 
AAAGGATCTG 



TTATTAGGGC 
AGAAATGGGA 
TTACATCT 



750 
800 
848 



LO 



L5 



20 



25 



30 



35 



10 



15 



50 



55 



2) INFORMATION FOR SEQ ID NO: 307 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: , Haemophilus ducreyi 

(B) STRAIN: DSM 8925 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 307 

GATGCAGTAC CAAAAGTATA TGATGCTTTA AAAGTTGAAT CAGGTTTAAC 50 

CTTAGAAGTT CAACAACAAT TAGGTGGTGG TTTAGTACGT TGTATCGCAT 100 

TAGGTACCTC AGATGGTTTA AAGCGTAGCT TAAAGGTTGT AAATACAGGT 150 

AACCCTATTC AAGTTCCTGT AGGCACTAAA ACATTAGGCC GTATTATGAA 200 

TGTATTAGGC GAACCAATTG ATGAAAAAGG ACCTATTAGC GAAGAAGCTC 250 

GTTGGGATAT TCATCGTGCG GCTCCAAATT ATGAAGAACA GTCAAATAGT 300 

ACTGAATTAC TTGAAACCGG TATCAAAGTT ATTGACTTAA TTTGTCCATT 3 50 

TGCAAAAGGT GGTAAAGTCG GCTTATTTGG TGGAGCTGGT GTAGGTAAAA 4 00 

CCGTTAATAT GATGGAATTG ATCCGTAATA TTGCTATTGA GCACTCAGGT 4 50 

TATTCGGTTT TTGCTGGTGT AGGTGAGCGT ACTCGTGAAG GTAATGATTT 500 

TTATCATGAA ATGACGGATT CTAATGTATT AGATAAAGTA TCACTAGTAT 550 

ATGGTCAAAT GAATGAACCA CCAGGTAACC GCCTACGTGT TGCGTTAACA 600 

GGTTTAACTA TGGCTGAAAA ATTCCGTGAT GAAGGTCGTG ATGTATTATT 650 

TTTCGTAGAT AATATTTATC GTTATACTTT AGCCGGTACA GAAGTTTCTG 7 00 

CTTTATTAGG CCGTATGCCA TCAGCGGTAG GTTATCAACC AACCCTTGCA 7 50 

GAAGAAATGG GTGTATTACA AGAACGTATT ACCTCAACTA AAACTGGTTC 800 

AATCACGGCA GTA 813 



2) INFORMATION FOR SEQ ID NO: 308 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus haemolyticus 

(B) STRAIN: ATCC 33390 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 308 

TGAATTTCCA CAAGATGCAG TGCCAAAAGT TTACGATGCA TTAAAAGTTG 50 
AATCAGGTTT AACACTTGAG GTGCAACAAC AATTAGGTGG CGGTGTGGTA 100 



SUBSTITUTE SHEET (RULE 26) 
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CGTTGTATCG CATTAGGTGC TTCTGACGGT 
AGAAAACACG AATGATCCGA TTCAAGTACC 
GTCGTATCAT GAATGTATTG GGTGAACCAA 
GGTGAAGAAG AGCGTTGGGC TATCCATCGT 
5 ACAATCAAAC AGTACGGAAT TATTAGAGAC 
TAATTTGTCC ATTCGCAAAA GGTGGTAAAG 
GGTGTAGGTA AAACCGTTAA CATGATGGAA 
CGAGCACTCA GGTTACTCCG TATTTGCGGG 
AAGGTAACGA CTTCTATCAT GAAATGAAAG 

LO GTATCTTTGG TTTATGGTCA GATGAATGAG 
TGTTGCGTTA ACTGGTTTAA CCATGGCAGA 
GTGATGTATT ATTCTTCGTG GATAATATCT 
ACGGAAGTAT CTGCGTTATT AGGTCGTATG 
ACCAACTCTT GCTGAAGAAA TGGGTGTGTT 

15 CCAAAACAGG TTCTATTACA TCTGTA 



PCT/CAOO/01150 

TTAAAACGTG GTTTAAAAGT 150 

GGTAGGCACA AAAACCCTTG 200 

TTGACGAACA AGGTCCAATC 250 

TCTGCACCAA GCTATGAAGA 300 

TGGTATCAAA GTTATCGACT 350 

TTGGTCTATT CGGTGGTGCG 400 

TTAATCCGTA ACATCGCGAT 450 

TGTAGGTGAA CGTACTCGTG 500 

ATTCTAACGT ATTAGATAAA 550 

CCACCAGGTA ACCGTTTACG 600 

AAAATTCCGC GATGAAGGTC 650 

ATCGTTATAC CCTTGCTGGT 700 

CCATCTGCGG TAGGTTACCA 750 

ACAAGAACGT ATCACTTCAA 800 

826 



2) INFORMATION FOR SEQ ID NO: 309 



20 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 809 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Haemophilus parahaemolyticus 

(B) STRAIN: ATCC 10014 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 309 

35 GATGCAGTAC CAAAAGTATA TGATGCGTTA AAAGTTGAAT CAGGTTTAAC 50 

GCTTGAAGTT CAACAACAAT TAGGCGGTGG CTTAGTGCGC TGTATCGCAT 100 

TAGGTACGTC TGATGGTTTA AAACGTGGCT TAAAAGTAGA AAATACAGGC 150 

AACCCAATTG AAGTGCCAGT GGGCACTAAA ACCCTTGGTC GTATTATGAA 200 

CGTATTGGGT GAGCCGATTG ACGAAAAAGG TCCTATCGGT GAAGAAGCAC 250 

4 0 GCTGGGCAAT CCACCGTGCA GCACCAAGCT ACGAAGAGCA ATCAAATAGC 300 

ACGGAATTAC TCGAAACAGG TATCAAAGTT ATCGACTTAA TCTGCCCATT 350 

CGCAAAAGGG GGTAAAGTTG GTTTATTTGG TGGTGCAGGT GTAGGTAAAA 400 

CCGTAAATAT GATGGAGTTA ATCCGTAACA TCGCGATCGA ACACTCTGGT 450 

TACTCTGTAT TTGCAGGGGT AGGTGAGCGT ACTCGTGAAG GTAATGACTT 500 

4 5 CTACCACGAA ATGACAGACT CTAACGTATT AGATAAAGTA TCGTTAGTGT 550 

ATGGTCAAAT GAACGAACCA CCAGGTAACC GTTTACGCGT AGCTTTAACA 600 

GGCTTAACCA TGGCGGAAAA ATTCCGCGAT GAAGGTCGTG ACGTATTATT 650 

CTTCGTCGAT AACATCTACC GTTATACCCT AGCAGGTACG GAAGTGTCAG 700 

CACTTCTCGG TCGTATGCCA TCTGCGGTAG GTTATCAGCC AACCTTAGCA 750 

50 GAAGAAATGG GTGTATTACA AGAGCGTATC ACTTCAACCA AAACTGGTTC 800 

TATCACCTC 809 



55 2) INFORMATION FOR SEQ ID NO: 310 

( i ) SEQUENCE CHARACTERI ST ICS : 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 

185 
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(D) TOPOLOGY: Linear 



PCT/CA00/01150 



10 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus parainfluenzas 

(B) STRAIN: ATCC 7901 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310 



CGAATTTCCA CAAGATGCAG TACCAAAAGT TTATGATGCA TTAAAAGTTG 50 

AATCGGGTTT AACCCTTGAA GTTCAACAAC AATTAGGTGG TGGTGTGGTA 100 

CGTTGTATCG CACTGGGAGC TTCTGACGGT TTAAAACGCA GTTTAAGCGT 150 

TGAAAATACC AATAAACCAA TTTCAGTACC GGTTGGTGTA AAAACTCTCG 200 

15 GTCGTATTAT GAACGTATTG GGCGAACCGA TTGATGAAAG AGGTCCTATC 250 

GGTGCGGAAG AAGAATGGGC AATTCACCGT TCTACTCCAA GTTATGAAGA 300 

ACAGTCCAAC AGTACCGAAT TATTAGAAAC CGGTATCAAA GTTATCGACT 350 

TAATTTGTCC ATTCGCGAAG GGTGGTAAAG TTGGTTTATT CGGTGGTGCG 4 00 

GGTGTAGGTA AGACCGTAAA TATGATGGAA TTAATCCGTA ATATTGCGAT 4 50 

20 TGAGCACTCA GGTTACTCCG TATTTGCCGG TGTAGGTGAG CGTACCCGTG 500 

AAGGTAACGA CTTCTACCAT GAAATGACAG AATCTAACGT ATTAGACAAA 550 

GTATCCCTAG TTTACGGACA AATGAATGAG CCGCCGGGTA ACCGTTTACG 600 

TGTTGCTTTA ACCGGTTTAA CCATGGCAGA AAAATTCCGT GACGAAGGTC 650 

GTGATGTATT ATTCTTCGTG GATAACATCT ATCGTTATAC CCTTGCAGGG 700 

25 ACTGAAGTAT CGGCACTTTT AGGCCGTATG CCATCAGCGG TAGGTTATCA 750 

GCCGACACTT GCAGAAGAAA TGGGTGTGTT ACAAGAACGT ATTACATCAA 800 

CCAAAACAGG TTCTATTACT TCTG 824 



30 



35 



40 



45 



2) INFORMATION FOR SEQ ID NO: 311 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 811 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hafnia alvei 

(B) STRAIN: ATCC 13337 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311 



GCCGTGCCTA 
ACTGGTACTG 
TCGCTATGGG 

50 CTGGAGCACC 
TATGAACGTA 
AAGATCGCTG 
AACTCGCAAG 
TCCGTTCGCT 

55 GTAAAACAGT 
TCAGGTTACT 
CGACTTCTAC 
TGGTTTATGG 
CTGACCGGTC 

60 ACTGCTGTTC 



AAGTGTATAA 
GAAGTTCAGC 
TACTTCTGAC 
CGATTGAAGT 
CTGGGTGAGC 
GGCTATTCAC 
AACTGCTGGA 
AAGGGCGGTA 
AAACATGATG 
CTGTATTTGC 
CACGAAATGA 
CCAGATGAAC 
TGACTATGGC 
ATCGATAACA 



CGCACTTGAG 
AGCAGCTAGG 
GGTCTGCGTC 
CCCAGTAGGT 
CAATTGATAT 
CGTGAAGCTC 
AACTGGTATC 
AAGTTGGTCT 
GAGCTGATCC 
CGGCGTGGGT 
CCGACTCCAA 
GAGCCACCAG 
TGAGAAGTTC 
TCTACCGTTA 
18 



GTGAAAGGCG 
CGGCGGCGTT 
GCGGACTGGA 
AAAGCGACCT 
GAAGGGTGAT 
CAAGCTACGA 
AAGGTAATGG 
GTTCGGTGGT 
GTAACATCGC 
GAACGTACTC 
CGTATTGGAC 
GAAACCGTCT 
CGTGACGAAG 
TACCTTGGCC 



GTGCCACTAA 50 

GTACGCTGTA 100 

CGTTGTTGAC 150 

TAGGCCGCAT 200 

ATCGGCGAAG 250 

AGAACTGTCT 3 00 

ACCTGATTTG 350 

GCGGGTGTTG 4 00 

GATCGAGCAC 4 50 

GTGAGGGTAA 500 

AAAGTATCAC 550 

GCGCGTTGCG 600 

GTCGTGACGT 650 

GGTACCGAAG 700 
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TATCTGCACT GTTGGGTCGT ATGCCTTCTG 
CTGGCGGAAG AGATGGGTGT TCTGCAAGAA 
GGGTTCAATC A 



PCT/CAOO/01150 

CGGTAGGTTA TCAGCCAACG 750 
CGTATCACCT CGACCAAAAC 800 

811 



5 



2) INFORMATION FOR SEQ ID NO: 312 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 (ii) MOLECULE TYPE: Genomic DNA, 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kingella kingae 

(B) STRAIN: ATCC 23330 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 312 



GCGATGCTAT TCCACGCGTT TACGATGCGT TGAAACTGGT TGATGTGGAC 50 

TTGACATTAG AAGTGCAACA ACAACTGGGC GATGGCGTAG TGCGTACCAT 100 

25 TGCGATGGGT AGTACCGATG GTTTGAAACG CGGCTTAGCC GTGAACAACA 150 

CAGGCGCACC TATTACAGTG CCTGTTGGTA AAGCAACATT GGGTCGTATT 200 

ATGGACGTAT TGGGTAATCC TGTTGATGAA GCAGGTCCAA TTGGTTCTGA 250 

CCAAACGCGT GCTATTCACC AACCAGCTCC TAAATTTGAT GAACTGTCTA 300 

GCGCAACCGA ATTGCTGGAA ACAGGCATCA AAGTGATTGA CTTGCTTTGC 350 

30 CCATTTGCAA AAGGTGGTAA AGTAGGTTTG TTTGGTGGTG CAGGTGTGGG 400 

CAAAACTGTG AACATGATGG AGTTGATTAA CAACATTGCC AAAGCGCACA 450 

GTGGTTTGTC TGTATTTGCA GGCGTGGGTG AACGTACTCG CGAAGGTAAT 50 0 

GACTTCTATC ACGAGATGAA AGATTCTAAC GTGTTGGATA AAGTTGCCAT 550 

GGTGTATGGT CAAATGAATG AACCTCCTGG CAACCGTTTG CGCGTTGCAT 60 0 

35 TGACTGGTTT GTCTATGGCA GAACACTTCC GTGATGAAAA AGACGAAAAT 650 

GGCAAAGGTC GCGATGTATT GTTCTTTGTG GACAACATCT ATCGCTACAC 700 

ATTGGCAGGT ACAGAAGTAT CGGCATTGCT GGGTCGTATG CCCTCTGCGG 750 

TAGGTTATCA ACCAACATTG GCAGAAGAAA TGGGTCGTTT GCAAGAGCGT 800 

ATTACTTCAA CGCAAACAGG TTCGATTACT T 831 

10 



2) INFORMATION FOR SEQ ID NO: 313 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. ozaenae 
55 (B) STRAIN: ATCC 11296 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 313 



ATGCCGTACC ACGCGTGTAC GAAGCCCTTG AGGTACAGAA TGGTAATGAA 50 
30 GTTCTGGTGC TGGAAGTTCA GCAGCAGCTG GGCGGCGGTA TCGTACGTAC 100 

187 
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CATCGCCATG GGTTCTTCTG ATGGTCTGCG CCGCGGTCTG GATGTAAAAG 150 

ACCTCGAGCA CCCGATCGAA GTCCCGGTAG GTAAAGCAAC GCTGGGTCGT 200 

ATCATGAACG TACTGGGTCA ACCGGTTGAC ATGAAAGGCG ACATCGGCGA 250 

AGAAGAGCGT TGGGCTATCC ACCGCGCGGC ACCGTCCTAT GAAGAGCTGT 300 

5 CCAGCTCTCA GGAACTGCTG GAAACCGGCA TCAAAGTTAT CGACCTGATG 350 

TGTCCGTTCG CCAAGGGCGG TAAAGTTGGT CTGTTCGGCG GTGCGGGTGT 400 

AGGTAAAACT GTAAACATGA TGGAGCTGAT CCGTAACATC GCGATCGAGC 4 50 

ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAGCGTAC TCGTGAGGGT 500 

AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG ATAAAGTATC 550 

LO CCTGGTGTAC GGCCAGATGA ACGAGCCGCC GGGAAACCGT CTGCGCGTTG 600 

CGCTGACCGG CCTGACCATG GCTGAGAAAT TCCGTGACGA AGGTCGTGAC 650 

GTACTGCTGT TCGTCGATAA CATCTATCGT TACACCCTGG CCGGTACTGA 700 

AGTATCCGCG CTGCTGGGTC GTATGCCTTC AGCGGTAGGT TATCAGCCGA 750 

CCCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC CTCCACCAAA 800 

L5 ACCGGTTCTA TC 812 



2) INFORMATION FOR SEQ ID NO: 314 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Klebsiella ornithinolytica 

(3) STRAIN: ATCC 31898 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 314 

35 ATGCCGTACC GCGCGTGTAC GATGCTCTTG AGGTACAGAA TGGTAATGAG 50 

AGCCTGGTGC TGGAAGTTCA GCAGCAGCTC GGCGGTGGTA TCGTACGTGC 100 

TATCGCCATG GGTTCTTCCG ACGGTCTGCG TCGTGGTCTG GAAGTTAAAG 150 

ACCTTGAGCA CCCGATCGAA GTCCCGGTTG GTAAAGCAAC GCTGGGTCGT 200 

ATCATGAACG TGCTGGGTCA GCCAATCGAT ATGAAAGGCG ACATCGGCGA 2 50 

10 AGAAGAGCGT TGGGCTATTC ACCGTGCAGC TCCGTCCTAT GAAGAGCTGT 3 00 

CCAGCTCTCA GGAACTGCTG GAAACCGGCA TCAAAGTTAT CGACCTGATG 3 50 

TGTCCGTTCG CTAAGGGCGG TAAAGTTGGT CTGTTCGGTG GTGCGGGTGT 4 00 

AGGTAAAACC GTAAACATGA TGGAGCTGAT CCGTAACATC GCGATCGAGC 4 50 

ACTCCGGTTA CTCCGTGTTT GCGGGCGTAG GTGAACGTAC TCGTGAGGGT 500 

15 AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTCTGG ATAAAGTATC 550 

CCTGGTTTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT CTGCGCGTTG 600 

CTCTGACCGG CCTGACCATG GCTGAGAAAT TCCGTGACGA AGGTCGTGAC 650 

GTTCTGCTGT TCGTCGATAA CATCTATCGT TATACCCTGG CCGGTACTGA 700 

AGTATCCGCA CTGCTGGGTC GTATGCCTTC AGCGGTAGGT TATCAGCCGA 750 

50 CCCTGGCGGA AGAGATGGGT GTTCTGCAGG AACGTATCAC CTCCACCAAA 800 

ACCGGTTCTA TC 812 



55 2) INFORMATION FOR SEQ ID NO: 315 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 813 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

188 
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(D) TOPOLOGY: Linear 



PCT/CAOO/01150 



LO 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella oxytoca 

(B) STRAIN: ATCC 33496 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 315 



GTACCGCGCG TGTACGAGGC TCTTGAGGTA CAAAATGGTA GTGAGAATCT 50 

GGTGCTGGAA GTTCAGCAGC AGCTCGGCGG CGGTATTGTT CGTACCATCG 100 

CCATGGGTTC TTCCGACGGT CTGCGTCGCG GTCTGGAAGT CAAAGACCTC 150 

GAGCATCCGA TCGAAGTCCC GGTAGGTAAA GCAACGCTGG GTCGTATCAT 200 

L5 GAACGTACTG GGCCAACCGG TAGACATGAA . AGGCGACATC GGCGAAGAAG 2 50 

AGCGTTGGGC GATTCACCGC GCAGCGCCTT CCTACGAAGA GTTGTCAAAC 3 00 

TCTCAGGAAC TGCTGGAAAC CGGCATCAAA GTTATCGACC TGATGTGTCC 350 

GTTTGCGAAG GGCGGTAAAG TTGGTCTGTT CGGTGGTGCG GGTGTAGGTA 4 00 

AAACCGTAAA CATGATGGAG CTGATCCGTA ACATCGCGAT CGAGCACTCC 4 50 

20 GGTTACTCCG TGTTTGCGGG CGTAGGTGAA CGTACTCGTG AGGGTAACGA 500 

CTTCTACCAC GAAATGACCG ACTCCAACGT TATCGATAAA GTATCCCTGG 550 

TGTATGGCCA GATGAACGAG CCGCCGGGAA ACCGTCTGCG CGTTGCGCTG 600 

ACCGGCCTGA CCATGGCTGA GAAGTTCCGT GACGAAGGTC GTGACGTTCT 650 

GCTGTTCGTC GATAACATCT ATCGTTACAC CCTGGCCGGT ACTGAAGTAT 700 

25 CCGCACTGCT GGGTCGTATG CCTTCAGCGG TAGGTTACCA GCCGACTCTG 750 

GCGGAAGAGA TGGGCGTTCT GCAGGAACGT ATCACCTCCA CCAAAACGGG 800 

TTCTATCACT TCC 813 



30 



10 



2) INFORMATION FOR SEQ ID NO: 316 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 822 bases 
35 <B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella planticola 

(B) STRAIN: ATCC 33531 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 316 

GATGCCGTAC CGCGCGTGTA CGATGCTCTT GAGGTACAGA ATGGTAATGA 50 

GAGCCTGGTG CTGGAAGTTC AGCAGCAGCT CGGCGGTGGT ATCGTACGTG 100 

CTATCGCCAT GGGTTCTTCT GACGGTCTGC GTCGTGGTCT GGAAGTTAAA 150 

50 GACCTTGAGC ACCCGATCGA AGTCCCGGTT GGTAAAGCAA CGCTGGGTCG 200 

TATCATGAAC GTGCTGGGTC AGCCGATCGA TATGAAAGGC GACATCGGCG 250 

AAGAAGAGCG TTGGGCTATT CACCGCGCAG CTCCGTCTTA TGAAGAGCTG 300 

TCCAGTTCTC AGGAACTGCT GGAAACCGGC ATCAAAGTTA TCGACCTGAT 3 50 

GTGTCCGTTC GCTAAGGGCG GTAAAGTAGG TCTGTTCGGT GGTGCGGGCG 4 00 

55 TAGGTAAAAC CGTAAACATG ATGGAGCTGA TCCGTAACAT CGCGATCGAG 4 50 

CACTCCGGTT ACTCCGTGTT TGCGGGCGTC GGTGAACGTA CTCGTGAGGG 500 

TAACGACTTC TACCACGAAA TGACCGACTC CAACGTTCTG GATAAAGTAT 550 

CCCTGGTTTA TGGCCAGATG AACGAGCCGC CGGGAAACCG TCTGCGCGTT 600 

GCTCTGACCG GCCTGACCAT GGCTGAGAAA TTCCGTGACG AAGGTCGTGA 650 

50 CGTTCTGCTG TTCGTCGATA ACATCTATCG TTATACCCTG GCCGGTACTG 700 

189 
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AAGTATCCGC ACTGCTGGGT CGTATGCCTT CAGCGGTAGG TTATCAGCCG 750 

ACCCTGGCGG AAGAGATGGG TGTTCTGCAG GAACGTATCA CCTCCACCAA 800 
AACCGGTTCT ATCACTTCCG TA 822 



2) INFORMATION FOR SEQ ID NO: 317 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 785 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM:. Klebsiella pneumoniae subsp. pneumoniae 

(B) STRAIN: ATCC 13883 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 317 



20 



AGAATGGTAA TGAAGTTCTG GTGCTGGAAG TTCAGCAGCA GCTGGGCGGC 50 

GGTATCGTAC GTACCATCGC CATGGGTTCT TCTGATGGTC TGCGCCGCGG 100 

25 TCTGGATGTA AAAGACCTCG AGCACCCGAT CGAAGTCCCG GTAGGTAAAG 150 

CAACGCTGGG TCGTATCATG AACGTACTGG GTCAACCGGT TGACATGAAA 200 

GGCGACATCG GCGAAGAAGA GCGTTGGGCT ATCCACCGCG CGGCACCGTC 250 

CTATGAAGAG CTGTCCAGCT CTCAGGAACT GCTGGAAACC GGCATCAAAG 3 00 

TTATCGACCT GATGTGTCCG TTCGCCAAGG GCGGTAAAGT TGGTCTGTTC 350 

30 GGCGGTGCGG GTGTAGGTAA AACTGTAAAC ATGATGGAGC TGATCCGTAA 400 

CATCGCGATC GAGCACTCCG GTTACTCTGT GTTTGCGGGC GTAGGTGAGC 450 

GTACTCGTGA GGGTAATGAC TTCTACCACG AAATGACCGA CTCCAACGTT 500 

ATCGATAAAG TATCCCTGGT GTACGGCCAG ATGAACGAGC CGCCGGGAAA 550 

CCGTCTGCGC GTTGCGCTGA CCGGCCTGAC CATGGCTGAG AAATTCCGTG 600 

35 ACGAAGGTCG TGACGTACTG CTGTTCGTCG ATAACATCTA TCGTTACACC 650 

CTGGCCGGTA CTGAAGTATC CGCACTGCTG GGTCGTATGC CTTCAGCGGT 700 

AGGTTATCAG CCGACCCTGG CGGAAGAGAT GGGCGTTCTG CAGGAACGTA 750 

TCACCTCCAC CAAAACCGGT TCTATCACCT CCGTA 785 



10 



55 



2) INFORMATION FOR SEQ ID NO: 318 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 759 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) . TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kluyvera ascorbata 

(B) STRAIN: ATCC 33433 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 318 

CTGGTGCTGG AAGTTCAGCA GCAGCTCGGC GGCGGTATCG TACGTWCCAT 50 

CGCTATGGGT TCTTCCGACG GTCTGCGTCG CGGTCTGGAT GTTAAAGATC 100 

TCGAGCACCC AATCGAAGTT CCGGTMGGTA AAGCAACMCT GGGTCGTATC 150 

190 
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ATGAACGTAC TGGGTCAKCC AGTMGACATG AAAGGCGACA TCGGTGAA<5a '200 

AGAGCGTTGG GCTATCCACC GCGCTGCACC TTCCTACGAA GAGCTGTCTA 2 50 

GCTCTCAGGA ATTGCTGGAA ACCGGTATCA AAGTTATCGA CCTGATGTGT 300 

CCGTTCGCTA AGGGCGGTAA AGTCGGTCTG TTCGGTGGTG CSGGTGTTGG 350 

5 TAAAACCGTA AACATGATGG AGCTGATCCG TAACATCGCG ATCGAGCACT 4 00 

CCGGTTACTC CGTGTTTGCG GGCGTAGGTG AACGTACTCG TGAGGGTAAC 4 50 

GACTTCTACC ACGAAATGAC CGACTCCAAC GTTATCGATA AAGTATCCCT 500 

GGTATATGGC CAGATGAACG AGCCACCGGG AAACCGTCTG CGCGTTGCTC 550 

TGACCGGTCT GACCATGGCT GAGAAATTCC GTGACGAAGG TCGTGACGTA 600 

10 CTGCTGTTCG TCGATAACAT CTATCGTTAC ACCCTGGCCG GTACTGAAGT 650 

ATCTGCWCTG CTGGGTCGTA TGCCTTCAGC GGTAGGTTAC CAGCCGACCC 700 

TGGCGGAAGA GATGGGCGTT CTGCAGGAAC GTATCACCTC CACCAAGACC 750 

GGTTCTATC 759 



L5 



10 



2) INFORMATION FOR SEQ ID NO: 319 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

IS (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kluyvera cryocrescens 

(B) STRAIN: ATCC 33435 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 319 



TTCCCTCAGG ATGCCGTACC GCGTGTGTAC GAAGCCCTTG AGGTTCAGAA 50 

TGGTAATGAA GTGCTGGTGC TGGAAGTTCA GCAGCAGCTC GGCGGCGGTA 100 

IS TCGTACGTAC CATCGCTATG GGTTCTTCCG ACGGTCTGCG TCGTGGTCTG 150 

GATGTAAAAG ACCTCGAGCA CCCGATCGAA GTCCCGGTAG GTAAAGCAAC 200 

ACTGGGTCGT ATCATGAACG TACTGGGCCA ACCGGTAGAC ATGAAAGGCG 2 50 

ACATCGGTGA AGAAGAACGT TGGGCTATCC ACCGTGCAGC ACCTTCCTAC 300 

GAAGAGCTGT CAAGCTCTCA GGAACTGCTG GAAACCGGCA TCAAAGTTAT 350 

10 CGACCTGATG TGTCCGTTTG CGAAGGGCGG TAAAGTTGGT CTGTTCGGTG 4 00 

GTGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTTAT TCGTAACATC 4 50 

GCGATTGAGC ACTCCGGTTA TTCTGTGTTT GCGGGCGThG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG 550 

ATAAAGTTTC CCTGGTTTAC GGCCAGATGA ACGAGCCACC AGGAAACCGT 600 

15 CTGCGCGTTG CGCTGACTGG TCTGACTATG GCTGAGAAGT TCCGTGACGA 650 

AGGTCGCGAC GTACTGCTGT TCGTCGATAA CATCTATCGT TACACCCTGG 700 

CCGGTACAGA AGTATCTGCA CTGCTGGGTC GTATGCCTTC AGCGGTAGGT 750 

TACCAGCCGA CTCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC 800 

CTCCACCAAA ACCGGTTCTA TCACCTCCGT A 831 



2) INFORMATION FOR SEQ ID NO: 320 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kluyvera georgiana 
5 (B) STRAIN: ATCC 51603 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 320 

GCCGTACCGC GCGTGTACGA AGCCCTTGAG GTACAGAATG GTAATGAAGT 50 

L0 GCTGGTGCTG GAAGTTCAGC AGCAGCTCGG TGGCGGTATC GTGCGTACCA 100 

TCGCCATGGG TTCCTCCGAC GGTCTGCGTC GCGGTCTGGA AGTTAAAGAT 150 

CTCGAGCACC CGATCGAAGT TCCGGTAGGT AAAGCAACAC TGGGTCGTAT 200 

CATGAACGTA CTGGGTCACC CGGTAGACAT GAAAGGCGAC ATCGGTGAAG 250 

AAGAGCGTTG GGCTATCCAC CGCGCTGCGC CTTCCTACGA AGAGCTGTCC 300 

L5 AGCTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTCGCG AAGGGCGGTA AAGTCGGTCT GTTCGGCGGT GCGGGTGTTG 4 00 

GTAAAACCGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAGCAC 450 

TCCGGTTACT CTGTGTTTGC GGGCGTAGGT GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTATCGAT AAAGTATCCC 550 

20 TGGTGTATGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCG 600 

CTGACCGGCC TGACCATGGC TGAGAAATTC CGTGACGAAG GTCGTGACGT 650 

ACTGCTGTTC GTCGATAACA TCTATCGTTA CACCCTGGCC GGTACTGAAG 700 

TATCTGCACT GCTGGGTCGT ATGCCTTCAG CGGTAGGTTA CCAGCCGACT 750 

CTGGCGGAAG AGATGGGCGT TCTGCAGGAA CGTATCACCT CCACCAAGAC 800 

25 CGGTTCTATC 810 



2) INFORMATION FOR SEQ ID NO: 321 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 834 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Lactobacillus acidophilus 

<• (B) STRAIN: ATCC 4356 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 321 

15 TCGATAAGAA TTTACCTGAT ATTAACAACG CCTTACGTGT AATCAAGTCC 50 

GAAGATGAAA GCATCGTTCT TGAAGTTACA CTTGAACJCG GTGATGGTGT 100 

TTTAAGAACA ATCGCCATGG AATCTACCGA TGGTCTTCGT CGTGGTATGA 150 

AAGTCGAAGA TACTGGCGCT CCAATTTCAG TTCCAGTTGG AGAAGACACT 200 

TTAGGTCGTG TGTTTAACGT TTTAGGACAG CCTATTGATG GTGGTCCAGC 2 50 

50 CTTTCCAAAG GATCACCCAC GTGAGGGTAT CCACAAGGAA GCACCTAAAT 300 

ATGAAGATTT AACTACTAGT CGTGAAATTC TTGAAACTGG TATCAAGGTT 350 

ATCGACCTTC TTGAACCATA TGTTCGTGGT GGTAAAGTTG GTTTGTTTGG 400 

TGGTGCCGGT GTTGGTAAAA CTACTATTAT TCAAGAATTA ATTCACAACA 450 

TCGCTCAAGA ACACGGTGGT ATTTCCGTAT TTACTGGTGT TGGTGAAAGA 500 

55 ACTCGTGAAG GTAATGACCT TTACTTTGAA ATGAAAGCTT CAGGCGTTTT 550 

AAGTAAGACT GCCATGGTAT TTGGTCAGAT GAACGAGCCG CCTGGTGCCA 600 

GAATGCGTGT TGCATTAACC GGTTTGACAC TTGCTGAATA CTTTAGAGAT 650 

GTTGAAGGTC AAGACGTATT GCTCTTTATT GACAATATCT TTAGATTTAC 700 

TCAGGCTGGT TCAGAGGTAT CTGCTTTGCT TGGTCGTATG CCAAGTGCCG 750 

50 TAGGTTATCA GCCAACTTTG GCAACAGAAA TGGGTCAATT GCAGGAAAGA 800 
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2) INFORMATION FOR SEQ ID NO: 322 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Legionella pneumophila subsp. pneumophila 
<B) STRAIN: ATCC 33152 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 322 

TTCCTCGTGA TAGCGTGCCT AAAGTCAATG ATGCGTTAAA GCTTGTTGAT 50 

AGTGATCTGG TTTTTGAAGT GCAGCAGCAA CTTGGAGACG GAGTTGTGCG 100 

TACTATTGCC ATGGGAACAA CCGATGGTTT AAAGCGAGGA TTAAAAGCAG 150 

AAAATACAGG CCATCCTATT CAAGTGCCAG TAGGTAAGAA AACTTTGGGA 200 

CGCATTATGG ATGTTCTTGG GCGTCCTGTA GATGATGCTG GGCCTATCGA 250 

TGCTGAAGAG ACTTGGGCTA TTCATCGTAA AGCACCAAGT TATGAAGAGC 300 

AAGCTGGCAG CCAGGAATTA TTGGAAACTG GTATTAAAGT AATTGATTTG 350 

CTTTGCCCTT TTGCCAAGGG AGGTAAAGTT GGTCTATTCG GTGGTGCCGG 4 00 

TGTAGGCAAA ACCGTTAACA TGATGGAATT AATACGAAAC ATTGCAATTG 4 50 

AGCATAGCGG TTATTCAGTG TTTGCAGGGG TTGGTGAACG TACCCGTGAA 500 

GGAAACGACT TCTATCATGA GATGAAAGAC TCTAATGTAT TGGATAAAGT 550 

ATCGCTTGTT TATGGTCAGA TGAATGAGCC GCCAGGAAAC CGTTTGCGTG 600 

TTGCTCTAAC CGGTTTGACT ATGGCTGAAA AATTCCGGGA TGAAGGGCGA 650 

GACGTTCTTT TGTTTATCGA TAATATTTAT CGTTATACCT TGGCTGGGGT 700 

TGAAGTATCT GCGCTGTTAG GCCGTATGCC TTCTGCAGTA GGATATCAGC 750 

CGACATTAGC AGAGGAAATG GGTATGCTGC AAGAGCGCAT TACCTCCACA 800 

AAAACAGGTT CTATTACTTC CATA 824 



2) INFORMATION FOR SEQ ID NO: 323 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leminorella grimontii 

(B) STRAIN: ATCC 33999 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 323 

GACGCCGTAC CGAAAGTGTA CGATGCGCTT GAAGTTCAAA TTGATGCCAA 50 

GCTGGTTCTG GAAGTTCAAC AGCAGCTCGG CGGCGGCGTT GTTCGCTGCA 100 

TCGCGATGGG TACTTCAGAC GGCTTAAGCC GCGGTCTGGA CGTGCTCGAT 150 

CTGGAACACC CGATTGAAGT ACCGGTGGGC AAAGCGACGC TGGGCCGCAT 200 
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CATGAACGTG CTTGGTCACC CTATCGACAT GAAGGGCGAC ATCGGCGAAG ' 2*5 0 

AAGAGCGTTG GGCTATTCAC CGCGCAGCGC CGAGCTACGA AGACCTGTCG 300 

GGCGCAACCG AGCTGCTGGA GACCGGCATC AAGGTTATCG ACCTGATTTG 350 

TCCGTTCGCC AAGGGCGGTA AAGTCGGCCT GTTCGGCGGC GCCGGCGTAG 400 

5 GTAAAACCGT AAACATGATG GAGCTCATTC GCAACATTGC GACCGAGCAC 4 50 

TCCGGTTACT CCGTGTTTGC AGGCGTAGGT GAACGTACCC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CTGAATCCAA CGTATTGGAC AAGGTGTCGC 550 

TGGTATACGG TCAGATGAAC GAGCCGCCTG GAAACCGTCT GCGCGTAGCG 600 

TTAACGGGCT TGACCATGGC GGAGAAGTTC CGTGATGAAG GCCGTGACGT 650 

10 TCTGCTGTTT ATCGACAACA TTTACCGCTA TACCCTGGCC GGTACGGAAG 700 

TATCCGCACT GCTGGGCCGT ATGCCTTCAG CCGTAGGCTA CCAGCCGACT 750 

CTGGCTGAGG AAATGGGCGT GCTTCAAGAG CGTATTACCT CTACCAAGAC 800 

GGGGTCTATC ACCTCCGT 818 



15 



50 



2) INFORMATION FOR SEQ ID NO: 324 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Listeria monocytogenes 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 324 

GTGGAAACTT ACCTGAAATC TACAATGCCC TAGTTATTGA ATATAAATCT 50 

GATGCAGAAG AAGCACCAAC TAGCCAACTT ACTTTAGAAG TAGCCATCCA 100 

ATTAGGTGAT GATGTTGTAC GTACAATCGC AATGGCATCA ACAGATGGTG 150 

35 TTCAAAGAGG TATGGAAGTT ATTGATACTG GGAGCCCAAT TACAGTTCCT 200 

GTAGGTACAG TAACTCTTGG TCGTGTATTT AATGTATTAG GAAACACCAT 250 

CGATTTGGAC GAACCACTTC CAAGCGATAT TAAACGTAAT AAAATTCACC 300 

GCGAAGCTCC AACTTTCGAT CAATTAGCAA CGACAACAGA AATTCTTGAA 350 

ACAGGAATTA AAGTAGTTGA CTTGCTAGCT CCTTACTTAA AAGGTGGTAA 400 

40 AATCGGATTG TTCGGTGGTG CCGGTGTTGG TAAAACCGTT CTAATTCAAG 4 50 

AGCTTATCCA TAATATCGCA CAAGAACATG GTGGTATTTC TGTGTTCGCT 500 

GGCGTTGGAG AACGTACTCG TGAAGGTAAC GACCTTTACT TCGAAATGAA 550 

AGATTCAGGC GTTATTGAAA AAACAGCGAT GGTATTCGGT CAAATGAACG 600 

AGCCACCAGG TGCGCGTATG CGTGTTGCCT TAACTGGTCT AACAATTGCT 650 

4 5 GAATATTTCC GTGATGAAGA ACATCAAGAT GTACTTTTAT TCATTGATAA . 700 

CATTTTCCGT TTCACACAAG CTGGTTCAGA GGTTTCGGCT TTACTAGGTC 750 

GTATGCCATC TGCGGTAGGT TACCAACCAA CCCTAGCTAC TGAAATGGGT 800 

CAACTACAAG AACGTATTAC ATCTACTAAC GTTGG 835 



2) INFORMATION FOR SEQ ID NO: 325 



(i) SEQUENCE CHARACTERISTICS : 
55 (A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Micrococcus lylae 

(B) STRAIN: ATCC 27566 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 325 

CCCGCGTGGC GAGTTGCCGG CACTGTTCAA CGCGCTGACT GTCGAGGTCA 50 

CCCTCGAAGC AGTCGCTAAA ACCATTACCC TTGAGGTTGC TCAGCACCTC 100 

10 GGCGACAACT TGGTTCGTGC CGTGTCCATG GCACCGACCG ACGGTCTCGT 150 

CCGTGGCGCT GCTGTGATCG ACAGCGGTAA GCCGATCTCA GTTCCCGTTG 200 

GTGACGTAGT CAAGGGACAC GTCTTCAACG CTCTGGGTGA TTGCCTCGAT 250 

GAGCCAGGTC TTGGCCGTGA CGGTGAGCAG TGGGGCATCC ACCGCGATCC 300 

GCCACCTTTT GACCAGCTTG AGGGTAAGAC CGAGATTCTG GAAACCGGTA 350 

15 TTAAGGTCAT CGACCTGCTG ACCCCGTATG, TTAAGGGCGG CAAGATCGGC 400 

CTGTTCGGTG GTGCTGGTGT GGGTAAGACC GTTCTTATCC AGGAAATGAT 450 

CACCCGTATC GCTCGCGAGT TCTCCGGTAC CTCGGTGTTC GCAGGCGTGG 500 
GTGAGCGTAC CCGTGAGGGC ACCGACCTCT TCCTGGAAAT GGAAGAGATG , 550 

GGCGTTCTCC AGGACACCGC TCTTGTGTTC GGCCAGATGG ACGAGCCTCC 600 

2 0 AGGAGTTCGT ATGCGCGTGG CGCTGTCCGG CCTGACCATG GCGGAGTACT 650 

TCCGCGATGT GCAGCACCAG GACGTGCTTC TGTTCATCGA CAACATCTTC 700 

CGTTTCACCC AGGCAGGTTC CGAGGTTTCC ACCCTCCTAG GCCGCATGCC 750 

TTCTGCCGTG GGTTACCAGC CAACGCTGGC AGACGAGATG GGTGTTCTGC 800 

AGGAGCGTAT TACCTCCACA AAGGGTAA 828 

25 



2 ) INFORMATION FOR SEQ ID NO: 326 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Moellerella wisconsensis 
40 (B) STRAIN: ATCC 35017 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 326 

GATGCCGTAC CAAAAGTGTA CGATGCTCTT GAGGTTCTTA ACGGTAAAGA 50 

4 5 AAAATTGGTG CTGGAAGTTC AGCAACAATT AGGCGGTGGT GTTGTTCGTT 100 

GTATCGCAAT GGGTACATCA GATGGTTTAA GCCGCGGTTT AGAAGTTAAA 150 

AATACAGATC ATCCGATCGA AGTTCCTGTC GGTGTTAAAA CGCTTGGCCG 2 00 

TATCATGAAC GTGCTGGGTG ACCCAATCGA CATGAAAGGT GATATCGGCG 250 

AAGAAGAACG CTGGTCAATT CACCGCGCAG CACCAAGCTA TGAAGATCTG 3 00 

50 GCTAACTCAA CAGAACTTCT AGAAACAGGT ATCAAAGTTA TGGACCTGAT 350 

TTGCCCATTC GCTAAAGGGG GTAAAGTGGG TCTGTTCGGT GGTGCGGGTG 400 

TCGGTAAAAC AGTTAACATG ATGGAGCTTA TTCGTAATAT CGCGATTGAG 4 50 

CACTCAGGTT ATTCTGTATT CGCGGGTGTT GGTGAACGTA CTCGTGAAGG 500 

TAACGATTTC TACCATGAAA TGACAGACTC AAACGTTCTG GATAAAGTTT 550 

55 CATTGGTTTA TGGCCAGATG AATGAGCCAC CAGGAAACCG TCTGCGTGTT 600 

GCTCTGACTG GTCTGACTAT GGCAGAGAAA TTCCGTGACG AAGGTCGTGA 650 

CGTACTGTTA TTCGTAGATA ATATTTATCG TTATACCTTA GCAGGGACAG 700 

AAGTATCTGC ACTGCTGGGT CGTATGCCTT CAGCGGTGGG TTATCAGCCA 750 

ACGCTGGCGG AAGAGATGGG TGTTCTGCAA GAACGTATCA CCTCGACTAA 800 

50 GACCGGCTCT ATCACTTCCG TA 822 
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2) INFORMATION FOR SEQ ID NO: 327 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 854 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Branhamella catarrhalis 

(B) STRAIN: ATCC 43628 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 327 

CCGTGGCGAT GTCCCCCAAA TCTTTGATGC ACTTCATGTT GATGGTACTG 50 

AAACCACCCT TGAAGTCCAA CAACAGTTAG GTGATGGTGT GGTGCGTACC 100 

ATTGCCATGG GTTCTACCGA AGGCTTAAAG CGTGGCTTGC CTGTCTCTAA 150 

TTCAGGTGCA CCCATTTCGG TACCAGTCGG TCAAGCAACA CTGGGTCGCA 200 

TTATGGATGT CCTAGGTCGC CCAATCGATG AAGCAGGTCC GGTAAATGCT 250 

GAACAAAAAT GGTCCATTCA TCGTGAAGCA CCAAGTTATG ATGAACAGTC 300 

AAATAGTACA GAACTTTTAG AAACAGGCAT CAAAGTGATT GATTTGCTTT 350 

GTCCATTTGC CAAAGGTGGT AAAGTCGGTC TGTTCGGTGG TGCTGGTGTT 400 

GGTAAGACCG TTAACATGAT GGAGCTTATC AATAATATCG CCCTAAAACA 450 

CTCAGGTCTG TCGGTTTTTG CTGGTGTGGG TGAGCGTACT CGTGAGGGTA 500 

ATGACTTCTA CCATGAAATG CAAGAAGCAG GCGTTGTTAA TACCGAAGAT 550 

TTTACTCAGT CAAAAGTTGC CATGGTTTAT GGTCAGATGA ATGAGCCACC 600 

AGGAAACCGT CTGCGTGTTG CCTTAACTGG TTTGACCATG GCAGAGTATT 650 

TCCGTGATGA AAAAGACGAA GCAACGGGCA AAGGCCGTGA TGTTCTGCTG 7 00 

TTCGTTGATA ATATTTATCG TTACACATTG GCAGGTACTG AGGTATCAGC 750 

ACTTTTAGGT CGTATGCCAT CTGCGGTAGG TTATCAGCCG ACTTTGGCCG 800 

AAGAGATGGG CTTGCTACAA GAGCGTATCA CCTCCACCCA ATCAGGCTCA 850 

ATTA 854 



2) INFORMATION FOR SEQ ID NO: 328 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Moraxella osloensis 

(B) STRAIN: ATCC 19976 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 328 

CCGTCAAAGC GTACCAAGAA TTTATGATGC CTTAAAAGTT GAAGGCACAG 50 

AAACTACATT AGAAGTACAA CAACAATTGG GTGATGGTAT CGTACGTACT 100 

ATTGCCATGG GTTCTACTGA AGGTCTAAAA CGTGGTCTAC CAGTTAGCAA 150 

CACTGGCGCA CCAATCTCTG TACCTGTGGG TAAAGGTACA CTAGGTCGTA 200 
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TCATGGACGT TTTAGGACAC CCAATCGATG 

AGTAACACTT GGGCGATTCA CCGTGAAGCG 

AAACTCTACT GAACTTTTAG AAACCGGTAT 

GCCCATTTGC TAAAGGTGGT AAAGTCGGTC 

5 GGTAAAACCG TTAACATGAT GGAACTTATC 

CTCAGGTTTA TCGGTATTTG CTGGTGTAGG 
ATGACTTCTA CCACGAGATG AAAGACTCAA 

ATGGTGTATG GTCAGATGAA TGAGCCACCA 

CCTGACAGGT TTAACCATGG CAGAATACTT 

10 ACGGTAAAGG TCGTGACGTA TTATTGTTCG 

ACGCTAGCGG GTACCGAAGT ATCAGCATTA 

AGTAGGGTAT CAGCCAACGC TTGCAGAAGA 

GTATTACTTC AACCCAATCA GGCTCTATTA 



PCT/CAOO/01150 

AGGCAGGTCC GGTAGAGtSS? 35€ ' 

CCAAGCTATG ATGAACAATC 300 

TAAAGTAATT GACTTACTAT 350 

TGTTCGGTGG CGCGGGTGTT 4 00 

AATAACATCG CAAAAGCACA 450 

TGAGCGTACT CGTGAAGGTA 500 

ACGTACTTGA TAAAGTTGCG 550 

GGAAACCGTT TACGTGTTGC 600 

CCGTGACGAA AAAGATGAAA 650 

TTGACAATAT TTATCGTTAC 700 

TTAGGTCGTA TGCCATCTGC 750 

GATGGGTGTA CTACAAGAAC 800 

C 831 



15 



2) INFORMATION FOR SEQ ID NO: 329 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 



30 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Morganella morganii subsp. morganii 

(B) STRAIN: ATCC 25830 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 329 



CGAATTTCCT CAGGATGCAG TACCGAAAGT GTACGATGCG CTTGAGGTAA 50 

CAAATGGTAA AGAAAAACTG GTGCTGGAAG TTCAGCAGCA GTTAGGCGGC 100 

35 GGGGTTGTCC GTTGTATCGC TATGGGTACA TCTGATGGTC TGAGCCGTAA 150 

TCTGGAAGTA ACCGATTTAG GCCACCCGAT CGAAGTCCCT GTCGGCGTGA 200 

AAACCTTAGG ACGTATCATG AACGTTCTGG GTGATCCGAT CGATATGAAA 250 

GGTGACATCG GCGCAGAAGA AAAATGGTCT ATTCACCGTG CTGCACCAAC 300 

ATACGAAGAA CTGTCTAACT CCCAGGAACT GCTGGAAACA GGTATCAAAG 350 

40 TAATGGACCT GATCTGCCCG TTCGCGAAGG GTGGTAAAGT CGGTCTGTTC 400 

GGTGGTGCGG GTGTGGGTAA AACCGTAAAC ATGATGGAAC TGATCCGTAA 450 

CATCGCGATC GAGCACTCCG GTTACTCTGT ATTCGCAGGG GTCGGTGAGC 500 

GTACCCGTGA AGGTAACGAC TTCTATCATG AAATGACAGA CTCCAACGTT 550 

CTGGACAAAG TATCACTCGT GTACGGCCAG ATGAACGAGC CACCGGGAAA 600 

4 5 CCGTCTGCGC GTTGCTCTGA CCGGTCTGAC CATGGCGGAA AAATTCCGTG 650 

ATGAAGGCCG CGATGTACTG CTGTTCGTTG ATAACATCTA CCGTTATACC 7 00 

CTGGCCGGTA CTGAAGTATC CGCGCTGTTA GGCCGTATGC CTTCAGCGGT 750 

AGGTTACCAG CCGACACTGG CGGAAGAAAT GGGTGTGCTT CAGGAACGTA 800 

TCACATCGAC CAAAACAGGC TCTATCACGT CTGTA 835 

50 



2} INFORMATION FOR SEQ ID NO: 330 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pantoea agglomerans 
5 (B) STRAIN: ATCC 27155 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 330 

GACGCGGTAC CGCAAGTGTA CAGCGCCCTC GAGGTTATGA ATGGTGATGC 50 

10 GCGTCTGGTG CTGGAAGTTC AGCAGCAGCT CGGCGGCGGC GTAGTACGTA 100 

CCATCGCAAT GGGTACGTCT GACGGCCTGA AGCGTGGTCT GAGCGTCAAC 150 

GACCTGCAGA AACCGATTCA GGTACCCGTC GGTAAAGCGA CCCTGGGCCG 200 

TATCATGAAC GTTCTCGGCG AGCCAATCGA TATGAAAGGC GAGCTGAAAG 250 

AAGAAGATGG CAGCGCAGTA GAGATCGCCT CTATTCACCG CGCAGCCCCT 300 

15 TCTTATGAAG ATCAGTCTAA CTCGCAGGAA CTGCTGGAAA CCGGCATCAA 350 

GGTTATCGAC CTGATGTGTC CGTTTGCTAA AGGCGGTAAA GTCGGTCTGT 4 00 

TCGGTGGTGC GGGTGTAGGT AAAACCGTCA ACATGATGGA ACTGATCCGT 450 

AACATCGCGG CTGAACACTC, AGGTTACTCA GTGTTTGCCG GTGTGGGTGA 500 

GCGTACTCGT GAGGGTAACG ACTTCTACCA CGAAATGACT GACTCTAACG 550 

20 TTATCGATAA AGTTGCACTG GTCTATGGCC AGATGAACGA GCCGCCGGGT 600 

AACCGTCTGC GCGTAGCACT GACCGGTCTG ACCATGGCGG AAAAATTCCG 650 

TGATGAAGGT CGCGACGTTC TGCTGTTCAT CGATAACATC TACCGTTATA 700 

CCCTGGCCGG TACAGAAGTT TCTGCACTGC TGGGTCGTAT GCCATCTGCG 750 

GTAGGTTACC AGCCAACGCT GGCAGAAGAG ATGGGTGTGT TGCAGGAGCG 800 

25 TATTACCTCC ACCAAGACCG GTTC 824 



2) INFORMATION FOR SEQ ID NO: 331 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 808 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Pantoea dispersa 

(B) STRAIN: ATCC 14 589 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 331 

45 TATACAGCGC TCTCGAGGTA AAAAATGGTG ATGCTCGTCT GGTGCTGGAA 50 

GTACAGCAGC AGCTGGGCGG TGGCGTGGTG CGTACCATCG CCATGGGTTC 100 

TTCTGACGGC CTGAAGCGCG GTCTGGAAGT CACCGACCTG AAAAAACCTA 150 

TCCAGGTTCC GGTTGGTAAA GCAACACTCG GCCGTATCAT GAACGTGCTG 2 00 

GGTGAGCCAA TCGACATGAA AGGCGACCTG AAAGAAGAAG ACGGCAGCGC 250 

50 TGTAGAGGTT TCCTCTATTC ATCGCGCAGC GCCTTCTTAT GAAGATCAGT 300 

CAAACTCGCA GGAACTGCTG GAAACCGGCA TCAAGGTTAT CGACCTGATG 350 

TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG GTGCGGGTGT 4 00 

AGGTAAAACC GTAAACATGA TGGAGCTGAT CCGTAACATC GCGGCTGAGC 4 50 

ACTCAGGTTA TTCGGTCTTT GCCGGCGTGG GTGAGCGTAC TCGTGAGGGT 500 

55 AACGACTTCT ACCACGAAAT GACGGACTCC AACGTTATCG ATAAAGTAGC 550 

GCTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGTAACCGT CTGCGCGTAG 600 

CACTGACCGG TCTGACCATG GCGGAAAAAT TCCGTGATGA AGGCCGTGAC 650 

GTTCTGCTGT TCATCGACAA CATCTACCGT TACACCCTGG CCGGTACAGA 700 

GGTTTCTGCA CTGCTGGGTC GTATGCCATC GGCGGTAGGT TATCAGCCAA 750 

50 CGCTGGCTGA AGAGATGGGT GTGCTGCAGG AGCGTATTAC CTCCACCAAG 8 00 
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5 2) INFORMATION FOR SEQ ID NO: 332 



LO 



L5 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 805 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pasteurella mul tocida 

(B) STRAIN: NCTC 10322 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 332 



GATGCAGTAC 
ACTTGAAGTT 
TGGGATCATC 
AACCCAATTT 

25 CGTATTGGGT 
ATTGGTCTAT 
ACTGAACTTT 
TGCGAAAGGG 
CCGTCAATAT 

30 TACTCTGTCT 
CTATCATGAG 
ATGGTCAAAT 
GGCTTAACTA 
CTTCGTTGAT 

35 CATTATTAGG 
GAAGAAATGG 
TATTA 



CAAAAGTATA 
CAACAACAAT 
TGATGGATTA 
CTGTTCCAGT 
GAACCAATCG 
TCACCGTGCG 
TAGAAACGGG 
GGTAAAGTAG 
GATGGAATTA 
TTGCGGGGGT 
ATGAAAGACT 
GAACGAGCCA 
TGGCGGAAAA 
AATATTTATC 
TCGTATGCCA 
GTGTTCTGCA 



TGATGCCTTA 
TAGGTGGTGG 
AAACGCGGTT 
GGGAACGAAA 
ATGAGCAAGG 
CCACCAAGTT 
AATTAAAGTT 
GTTTATTCGG 
ATCCGTAACA 
AGGTGAGCGT 
CTAACGTATT 
CCAGGTAACC 
ATTCCGTGAT 
GTTATACTCT 
TCTGCGGTAG 
AGAGCGTATT 



AATGTTGAAA 
TGTAGTTCGC 
TAAGCGTAAC 
ACATTGGGTC 
TGAAATCGGT 
ATGAAGAACA 
ATCGACTTAG 
TGGTGCGGGT 
TCGCAATTGA 
ACGCGTGAAG 
AGATAAAGTG 
GTTTACGTGT 
GAAGGTCGTG 
TGCTGGTACA 
GTTATCAACC 
ACCTCAACCA 



CAGGTTTAGT 50 

TGTATCGCAA 100 

AAATACGAAT 150 

GTATCATGAA 200 

GCAGAAGAGA 250 

ATCTAACAGT 300 

TTTGTCCGTT 350 

GTCGGTAAAA 4 00 

GCACTCAGGT 450 

GTAACGACTT 500 

TCTCTTGTTT 550 

GGCATTAACA 600 

ATGTCTTATT 650 

GAAGTTTCTG 7 00 

AACCCTTGCA 750 

AAACAGGTTC 8 00 
805 



10 



50 



2) INFORMATION FOR SEQ ID NO: 333 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 82 8 bases 
£5 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pragia fontium 

(B) STRAIN: ATCC 4 9100 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 333 

TTTCCTCAAG ACGCCGTACC AAAAGTGTAC GACGCGCTTG AAGTTCAGAA 50 

CGATGCCAAG CTGGTGCTGG AAGTTCAACA ACAGCTCGGT GGTGGTGTCG 100 

TTCGTTGTAT CGCAATGGGT ACTTCCGATG GCTTAAGCCG CGGTTTAAAA 150 

50 GTGCTTGATT TAGAACATCC TATCGAAGTA CCGGTTGGTA CTGCGACGCT 200 

199 
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GGGCCGTATT ATGAACGTGC TCGGTCAGCC AATCGATATG AAAGGCGATA 250 

TTGGTGAAGA AGAGCGTTGG GCTATTCACC GTGAAGCACC AAGTTATGAA 300 

GATTTATCTG GCGCCAATGA ACTGCTGGAA ACGGGTATCA AGGTTATCGA 350 

CCTGATTTGT CCGTTTGCTA AAGGTGGTAA AGTTGGTCTG TTTGGTGGTG 4 00 

5 CGGGTGTAGG TAAAACCGTA AACATGATGG AGCTGATTCG TAACATTGCG 450 

ACTGAGCACT CAGGTTACTC CGTATTCGCC GGTGTAGGGG AACGTACCCG 500 

TGAAGGTAAT GACTTCTACC ACGAAATGAC CGAATCAAAC GTACTGGATA 550 

AAGTATCTCT GGTTTATGGC CAGATGAACG AGCCACCAGG AAACCGTCTG 600 

CGCGTGGCGT TAACGGGTTT GACCATGGCT GAAAAATTCC GTGATGAAGG 650 

LO TCGTGACGTT CTGTTATTTA TCGATAACAT TTATCGCTAT ACCTTAGCCG 700 

GTACCGAAGT ATCAGCACTG TTGGGGCGTA TGCCATCAGC GGTAGGTTAT 750 

CAGCCAACGT TAGCAGAAGA GATGGGTGTG TTGCAGGAAC GTATTACTTC 800 

AACCAAAACC GGTTCAATCA CTTCTGTA 828 

L5 



2) INFORMATION FOR SEQ ID NO: 334 

(i) SEQUENCE CHARACTERISTICS : 
20 (A) LENGTH: 807 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



25 (ii) MOLECULE TYPE: Genomic DNA 



30 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus mirabilis 

(B) STRAIN: ATCC 25933 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 334 



GTCCCTAAAG 
GGTGCTGGAA 

35 CAATGGGTAC 
GGCCACCCAA 
GAACGTTCTG 
AGCGTTGGTC 
TCTCAAGAAC 

10 ATTTGCTAAA 
AAACAGTTAA 
GGTTACTCTG 
CTTCTATCAT 
TTTACGGTCA 

15 ACTGGTCTGA 
GTTATTCGTC 
CAGCACTGTT 
GCTGAAGAGA 
TTCTATC 

50 



TATACGACGC 
GTTCAGCAAC 
ATCAGACGGT 
TTGAAGTACC 
GGTACACCTA 
AATCCACCGT 
TGCTTGAAAC 
GGTGGTAAAG 
CATGATGGAA 
TATTTGCTGG 
GAAATGACAG 
GATGAATGAG 
CTATGGCTGA 
GATAACATCT 
AGGTCGTATG 
TGGGTGTTCT 



TCTTGAGGTT 
AGTTAGGCGG 
TTAAGCCGTG 
AGTAGGTAAA 
TTGATATGAA 
GAAGCACCAA 
CGGTATCAAA 
TCGGTCTGTT 
TTGATCCGTA 
TGTTGGTGAG 
ATTCTAACGT 
CCACCAGGAA 
GAAATTCCGT 
ATCGTTACAC 
CCATCAGCGG 
GCAAGAGCGT 



ATGAATGGTA 
TGGTATCGTT 
GCTTAAAGGT 
GCGACTTTAG 
AGGTGAGATT 
CTTACGAAGA 
GTTATGGACT 
CGGTGGTGCG 
ATATCGCGAT 
CGTACTCGTG 
TCTTGACAAA 
ACCGTCTGCG 
GATGAAGGCC 
CTTAGCCGGT 
TAGGTTACCA 
ATCACTTCAA 



AAGAAAAACT 
CGTTGTATCG 
TGAAGATTTA 
GACGTATCAT 
GAAACCGAAG 
GTTATCAAAC 
TAATCTGTCC 
GGTGTTGGTA 
CGAGCACTCA 
AGGGTAACGA 
GTATCGTTAG 
TGTTGCACTG 
GTGACGTACT 
ACAGAAGTAT 
ACCAACATTG 
CCAAAACAGG 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
807 



2) INFORMATION FOR SEQ ID NO: 335 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 811 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 

200 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus vulgaris 

(B) STRAIN: ATCC 13315 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 335 

CCCTAAAGTA TACGACGCTC TTGAGGTTAT GAATGGTAAA GAGAAACTGG 50 

TGCTAGAAGT TCAGCAACAG TTAGGCGGTG GTATCGTTCG TTGTATCGCA 100 

ATGGGTACAT CAGACGGTTT AAGCCGTGGC TTAAAAGTTG AAAACTTAGG 150 

CCACCCAATT GAAGTACCAG TAGGTAAAGC AACACTGGGA CGTATCATGA 200 

ACGTTCTGGG TACACCTATC GATATGAAAG GTGATATTGC AACTGAAGAA 250 

CGTTGGTCTA TTCACCGCGA AGCGCCAACC TATGAAGAGT TATCAAGCTC 300 

TCAAGAACTA CTAGAAACCG GTATCAAAGT , AATGGACTTA ATCTGTCCGT 350 

TTGCTAAAGG TGGTAAAGTA GGTCTCTTCG GTGGTGCGGG TGTTGGTAAA 400 

ACAGTTAACA TGATGGAATT GATCCGTAAC ATCGCGATTG AGCACTCAGG 450 

TTATTCTGTA TTTGCAGGTG TTGGTGAGCG TACTCGTGAG GGTAACGACT 500 

TCTATCATGA AATGACAGAT TCTAACGTTC TTGACAAAGT ATCGTTAGTT 550 

TATGGTCAGA TGAATGAGCC ACCAGGAAAC CGTCTACGTG TAGCACTGAC 600 

GGGTTTAACC ATGGCGGAAA AATTCCGTGA TGAAGGCCGT GACGTACTGT 650 

TATTCGTCGA TAACATCTAT CGTTACACCT TAGCCGGTAC CGAAGTATCA 700 

GCACTGTTAG GCCGTATGCC ATCAGCAGTA GGTTACCAAC CAACATTGGC 750 

TGAAGAGATG GGTGTTCTGC AAGAACGTAT CACTTCAACC AAAACAGGTT 800 

CAATCACCTC T 811 



2) INFORMATION FOR SEQ ID NO: 336 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Providencia alcalifaciens 

(B) STRAIN: ATCC 9886 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 336 

TCAAGATAAC GTACCAAAAG TGTACGATGC TCTTGAGGTT ATTAACGGTA 50 

AAGAAAAACT GGTGTTGGAA GTTCAACAAC AGTTAGGTGG TGGTGTTGTC 100 

CGTTGTATCG CAATGGGTAC ATCAGATGGT CTGAGCCGTG GTTTAGAAGT 150 

TGTAAACTTA GAGCACCCAA TCGAAGTACC AGTCGGTAAA GCAACTCTGG 200 

GACGTATCAT GAACGTTCTG GGTGAACCAA TCGACATGAA AGGTGATATC 2 50 

GGCGAAGAAG AGCGCTGGTC TATTCACCGT GCTGCACCAA GCTACGAAGA 300 

ATTAGCTAAC TCAACTGAAC TGCTGGAAAC CGGTATCAAA GTAATGGACT 350 

TAATCTGTCC ATTCGCGAAA GGTGGTAAAG TAGGTCTGTT CGGTGGTGCG 4 00 

GGTGTTGGTA AAACCGTAAA CATGATGGAA CTGATCCGTA ACATCGCGAT 4 50 

TGAGCACTCA GGTTACTCAG TGTTCGCTGG TGTTGGTGAG CGTACCCGTG 500 

AAGGTAACGA CTTCTATCAT GAAATGACAG ACTCAAACGT TCTGGATAAA 550 

GTATCACTGG TTTATGGCCA GATGAACGAG CCACCAGGAA ACCGTCTGCG 600 

TGTTGCGCTG ACTGGTCTGA CTATGGCTGA AAAATTCCGT GACGAAGGTC 6 50 

GTGACGTACT GCTGTTCGTT GACAACATTT ATCGTTATAC ACTGGCAGGT 700 

ACTGAAGTAT CAGCACTGTT AGGTCGTATG CCATCAGCGG TAGGTTACCA 750 

ACCAACGCTG GCGGAAGAGA TGGGTGTTCT TCAAGAACGT ATTACCTCAA 800 

201 
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5 2) INFORMATION FOR SEQ ID NO: 337 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 830 bases 

(B) TYPE: Nucleic acid 
LO (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

L5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Providencia rettgeri 

(B) STRAIN: ATCC 9250 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 337 



TTCCCTCAAG ATGACGTACC AAAAGTGTAC GACGCTCTTG AGGTTGTTAA 50 

CGGTAAAGAA ACACTGGTGC TGGAAGTTCA GCAACAGTTA GGCGGTGGTG 100 

TTGTCCGTTG TATCGCAATG GGTACATCAG ATGGCCTGAG CCGTGGTTTA 150 

GAAGTTGTAA ACTTAGAGCA CCCAATTGAA GTACCAGTAG GTAAAGCAAC 200 

25 TTTAGGACGT ATCATGAACG TTCTGGGTCA GCCTATTGAT ATGAAAGGTG 250 

ATATCGGCGA AGAAGAGCGC TGGTCAATTC ACCGTGCTGC ACCTAGCTAC 300 

GAAGAGTTAG CTAACTCAAC AGAGCTGCTG GAAACCGGTA TCAAAGTAAT 350 

GGACTTAATC TGTCCATTCG CGAAAGGTGG TAAAGTTGGT CTGTTCGGTG 400 

GTGCGGGTGT TGGTAAAACA GTAAACATGA TGGAACTGAT CCGTAACATC 450 

30 GCGATTGAGC ACTCAGGTTA CTCAGTATTC GCTGGTGTTG GTGAGCGTAC 500 

TCGTGAAGGG AACGACTTCT ATCATGAAAT GACTGACTCA AACGTTCTGG 550 

ATAAAGTATC ACTGGTTTAT GGCCAGATGA ATGAGCCACC AGGAAACCGT 600 

CTGCGCGTTG CGTTGACTGG TCTGACTATG GCTGAAAAAT TCCGTGACGA 650 

AGGTCGTGAC GTACTACTGT TCGTTGACAA CATCTATCGT TATACACTGG 700 

35 CAGGTACTGA AGTATCAGCA CTGTTAGGTC GTATGCCTTC AGCGGTAGGT 750 

TATCAGCCAA CGCTGGCGGA AGAGATGGGT GTTCTGCAAG AACGTATTAC 800 

CTCAACTCAA ACGGGTTCTA TCACTTCCGT 830 



10 



50 



2} INFORMATION FOR SEQ ID NO: 338 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 812 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Providencia rustigianii 

(B) STRAIN: ATCC 33673 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 338 

AGCGTACCAA AAGTGTACGA TGCTCTTGAG GTTATTAACG GTAAAGAAAA 50 

ACTGGTGTTG GAAGTTCAGC AGCAGTTAGG CGGTGGTGTT GTCCGTTGTA 100 

TCGCAATGGG TACATCAGAT GGTCTGAGCC GTGGTTTAGA AGTTGTAAAC 150 

50 TTAGAACACC CAATTGAAGT ACCAGTAGGT AAAGCAACTC TGGGACGTAT 200 
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CATGAACGTT 
AAGAGCGCTG 
AACTCAACAG 
TCCATTCGCG 
GTAAAACAGT 
TCAGGTTACT 
CGACTTCTAT 
TGGTTTATGG 
CTGACTGGTC 
ACTGCTGTTC 
TATCAGCACT 
TTGGCAGAAG 
CGGTTCTATC 



CTGGGTGACC 
GTCTATTCAC 
AACTGCTAGA 
AAAGGTGGTA 
AAACATGATG 
CAGTATTCGC 
CATGAAATGA 
CCAGATGAAC 
TGACTATGGC 
GTTGACAACA 
GTTAGGTCGT 
AGATGGGTGT 
AC 



CTATTGATAT GAAAGGTGAT ATCGGCGAAG 
CGTTCAGCGC CAAGCTATGA AGAATTAGCT 
AACCGGTATC AAAGTAATGG ACTTAATCTG 
AAGTTGGTCT GTTCGGTGGT GCGGGTGTTG 
GAACTGATCC GTAACATCGC GATTGAGCAC 
TGGTGTTGGT GAGCGTACCC GTGAAGGTAA 
CTGATTCTAA CGTTCTGGAT AAAGTATCAC 
GAGCCACCAG GAAACCGTCT GCGTGTTGCG 
TGAAAAATTC CGTGACGAAG GTCGTGACGT 
TTTATCGTTA TACACTGGCA GGTACTGAAG 
ATGCCTTCAG CGGTAGGTTA TCAGCCAACA 
TCTACAAGAA CGTATCACTT CTACCAAAAC 



Z50 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
812 



10 



L5 



20 



25 



30 



35 



10 



15 



30 



55 



2) INFORMATION FOR SEQ ID NO: 339 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Providencia stuartii 

(B) STRAIN: ATCC 33672 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 339 

TCAAGATGCA GTACCAAAAG TGTACGATGC GCTTGAGGTT GTTAACGGTA 50 

AAGAAAAACT GGTGCTGGAA GTTCAGCAAC AGTTAGGCGG TGGTGTTGTC 100 

CGTTGTATCG CAATGGGTAC ATCAGATGGC CTAAGCCGTG GTTTAGAAGT 150 

TAAAAATTTA GAACACCCAA TTGAAGTACC AGTAGGTAAA GCAACACTCG 200 

GACGTATCAT GAACGTTCTG GGTGACCCTA TTGATATGAA AGGTGATATC 250 

GGCGAAGAAG AGCGTTGGTC TATTCACCGC GCTGCACCAA GCTACGAAGA 300 

GCTATCGAGC TCAACTGAAC TGCTAGAGAC AGGTATCAAA GTCATGGACT 350 

TGATCTGTCC ATTCGCGAAA GGTGGTAAAG TTGGTCTGTT CGGTGGTGCG 4 00 

GGTGTTGGTA AAACGGTAAA CATGATGGAA CTTATCCGTA ACATCGCGAT 4 50 

TGAGCACTCA GGTTACTCAG TATTCGCAGG TGTTGGTGAG CGTACCCGTG 500 

AAGGTAACGA CTTCTATCAT GAAATGACAG ATTCAAACGT TCTTGACAAA 550 

GTATCACTGG TTTATGGTCA GATGAATGAG CCACCAGGAA ACCGTCTACG 600 

CGTAGCATTG ACTGGTTTGA CTATGGCTGA GAAATTCCGT GACGAAGGCC 650 

GTGATGTTCT GTTGTTCGTG GATAACATCT ATCGTTATAC ACTGGCAGGT 700 

ACAGAAGTAT CGGCTCTGTT AGGTCGTATG CCATCAGCAG TAGGTTATCA 750 

GCCAACATTG GCAGAAGAGA TGGGTGTTCT TCAAGAACGT ATCACTTCTA 800 

CTAAGACAGG TTCTATCAC 819 



2) INFORMATION FOR SEQ ID NO: 34 0 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 907 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



203 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Psychrobacter phenylpyruvicus 

(B) STRAIN: ATCC 23333 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 340 



AACCGCACTG 

10 CGAAACCACC 
CTATTGCAAT 
AACACTGGTG 
TATTATGGAC 
CTGAAGACCA 

15 GCTAACAGTA 
TTGTCCGTTC 
TTGGTAAAAC 
CACTCAGGTT 
TAACGACTTC 

20 ACTTCACCAA 
CCAGGTAACC 
CTTCCGTGAT 
TCGTTGATAA 
CTTCTAGGTC 

25 AGAGATGGGT 
TTACTTC 



ACGTGCCTCA 
CTAGAAGTTC 
GGGATCTACT 
CCCCAATTAC 
GTTTTAGGTC 
TTGGTCAATC 
CTGACCTATT 
GCTAAAGGGG 
CGTAAACATG 
TATCAGTATT 
TACCACGAGA 
CTCAAAAGTT 
GTTTACGTGT 
CAAAAAGATG 
CATCTACCGC 
GTATGCCATC 
GTACTACAAG 



AATTTATGAC 
AGCAGCAGCT 
GAAGGTCTTA 
AGTTCCAGTA 
GTCCAATTGA 
CACCGTCAAG 
AGAGACAGGT 
GTAAAGTTGG 
ATGGAATTGA 
CGCTGGTGTG 
TGCAAGAAGC 
GCGATGGTTT 
TGCGTTAACC 
AAAACGGTAA 
TACACGCTAG 
AGCAGTAGGT 
AGCGTATTAC 



GCGTTAGTTG 
GGGCGATGGT 
AGCGTGGTTT 
GGTGATGCGA 
CGAACAAGGT 
CGCCATCATA 
ATTAAAGTAA 
TCTGTTCGGT 
TTAATAACAT 
GGTGAGCGTA 
GGGTGTTGTT 
ATGGTCAGAT 
GGTCTGACTA 
AGGTAAAGAC 
CCGGTACTGA 
TATCAGCCAA 
TTCAACTCAG 



TAGATGGTAC 
GTGGTACGTA 
ACCAGTAACA 
CTTTAGGTCG 
CCAGTTAATT 
TGATGAGCAA 
TTGACTTACT 
GGTGCCGGTG 
CGCTCTTAAG 
CTCGTGAAGG 
GACGTTGAAA 
GAATGAGCCA 
TGGCTGAGTA 
GTTCTATTAT 
AGTATCAGCA 
CACTAGCGGA 
ACTGGTTCTA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
907 



30 2) INFORMATION FOR SEQ ID NO: 341 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 2 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

4 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rahnella aquatilis 

(B) STRAIN: DSM 4594 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 341 



45 

GTTCCCTCAG 
ACGGTACCTC 
GTTGTTCGTT 
GAAAGTGAAC 

50 CTCTGGGTCG 
GAAATCGGCG 
TGAAGAGCTG 
TGGACCTGAT 
GGTGCGGGTG 

55 TGCGATCGAG 
CTCGTGAGGG 
GACAAAGTTT 
TCTGCGCGTT 
AAGGTCGTGA 

60 GCCGGTACCG 



GATGCAGTAC 
CAAACTGGTG 
GTATCGCAAT 
AACCTGGAAC 
TATCATGAAC 
AAGAAGAACG 
GCAAACTCCC 
GTGTCCGTTC 
TAGGTAAAAC 
CACTCCGGTT 
TAACGACTTC 
CCCTGGTGTA 
GCACTGACCG 
CGTACTGCTG 
AAGTGTCCGC 



CGAACGTGTA CAATGCTCTT 



CTGGAAGTTC 
GGGTACCTCA 
ACCCAATTGA 
GTATTGGGTG 
TCGTGCTATT 
AGGAATTGCT 
GCTAAGGGCG 
TGTGAACATG 
ATTCTGTGTT 
TACCACGAAA 
TGGCCAGATG 
GCCTGACCAT 
TTCGTTGACA 
ACTTCTGGGC 



AGCAACAGTT 
GACGGCCTGC 
AGTACCGGTA 
AACCAATCGA 
CACCGTGCTG 
GGAAACCGGT 
GTAAAGTTGG 
ATGGAGCTGA 
TGCAGGCGTG 
TGACTGATTC 
AATGAGCCAC 
GGCGGAAAAA 
ACATTTACCG 
CGTATGCCAT 



GAGGTAGAAA 

AGGCGGCGGC 

GTCGCGGTCT 

GGTAAAGCGA 

CATGAAAGGT 

CGCCTTCTTA 

ATCAAAGTTA 

TCTGTTCGGT . 

TCCGTAACAT 

GGTGAACGTA 

CAACGTTATC 

CAGGTAACCG 

TTCCGTGATG 

TTACACCCTG 

CGGCGGTAGG 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
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TTATCAGCCA ACGCTGGCGG AAGAGATGGG CGCTCTGCAA GAACGTATCA' 800 
CCTCGACCAA AAGTGGTTCT ATCACCTCCG TA 832 



2) INFORMATION FOR SEQ ID NO: 342 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 824 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 

(vi.) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. arizonae 

(B) STRAIN: ATCC 13314 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 342 

TTCCCTCAGG ATGCCGTACC ACGCGTGTAC GATGCTCTTG AGGTGCAGAA 50 

TGGTAATGAG AAGCTGGTGC TGGAAGTTCA GCAGCAGCTT GGCGGCGGTA 100 

TCGTGCGTAC CATCGCGATG GGGTCTTCTG ACGGTCTGCG TCGCGGTCTT 150 

25 GATGTAAAAG ATCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCCAC 200 

GCTGGGTCGT ATCATGAACG TCCTGGGCGA ACCGGTCGAT ATGAAAGGCG 250 

AGATCGGCGA AGAAGAGCGT TGGGCTATTC ACCGCGCGGC GCCGTCCTAC 300 

GAAGAGTTGT CAAACTCTCA GGAACTGCTG GAAACCGGTA TAAAAGTTAT 350 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 400 

30 GCGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTTAT CCGTAACATC 4 50 

GCGATCGAGC ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCT AACGTTATCG 550 

ATAAAGTATC CCTGGTGTAT GGCCAGATGA ACGAGCCACC GGGAAACCGT 600 

CTGCGCGTTG CATTGACCGG CCTGACCATG GCGGAAAAAT TCCGTGATGA 650 

35 AGGTCGTGAC GTTCTGCTGT TCGTCGACAA CATCTACCGT TATACCCTCG 700 

CCGGTACGGA AGTATCCGCA CTGCTGGGTC GTATGCCTTC CGCGGTAGGT 750 

TAYCASCCGA CTCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC 800 

CTCCACCAAA ACCGGTTCTA TCAC 824 

40 

2) INFORMATION FOR SEQ ID NO: 343 

( i) SEQUENCE CHARACTERISTICS : 
45 (A) LENGTH: 820 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Choleraesuis 
55 (B) STRAIN: ATCC 7001 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 343 

GATGCCGTAC CACGCGTGTA CGATGCCCTT GAGGTGCAGA ATGGTAATGA 50 
60 GAAGCTGGTG CTGGAAGTTC AGCAGCAGCT TGGCGGCGGT ATCGTGCGTA 100 

205 
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CCATCGCGAT GGGGTCTTCT GACGGTCTGC GTCGCGGTCT GGATGTAAA^ 150 

GATCTCGAAC ACCCGATCGA AGTCCCGGTA GGTAAAGCCA CGCTGGGTCG 200 

TATCATGAAC GTCCTGGGCG AACCGGTCGA CATGAAAGGC GAGATCGGCG 250 

AAGAAGAGCG TTGGGCGATT CACCGCGCAG CACCTTCCTA CGAAGAGTTG 300 

5 TCAAACTCTC AGGAACTGCT GGAAACCGGT ATCAAAGTTA TCGACCTGAT 350 

GTGTCCGTTC GCGAAGGGCG GTAAAGTCGG TCTGTTCGGT GGTGCGGGTG 400 

TAGGTAAAAC CGTAAACATG ATGGAGCTTA TTCGTAACAT CGCGATCGAG 450 

CACTCCGGTT ACTCAGTGTT TGCGGGCGTA GGGGAACGTA CTCGTGAGGG 500 

TAACGACTTC TACCACGAAA TGACCGACTC CAACGTTATC GATAAAGTAT 550 

10 CCCTGGTGTA TGGCCAGATG AACGAGCCGC CGGGAAACCG TCTGCGCGTT 600 

GCACTGACCG GCCTGACCAT GGCGGAGAAA TTCCGTGACG AAGGTCGTGA 650 

TGTACTGCTG TTCGTCGATA ACATCTATCG TTACACCCTG GCCGGTACGG 700 

AAGTATCCGC ACTGCTGGGC CGTATGCCTT CCGCAGTAGG TTACCAGCCG 750 

ACTCTGGCGG AAGAGATGGG CGTTCTGCAG GAACGTATCA CCTCCACCAA 800 

15 AACCGGTTCT ATCACCTCCG 82 0 



20 



25 



2) INFORMATION FOR SEQ ID NO: 344 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

( i i ) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Salmonella choleraesuis subsp. diarizonae 

(B) STRAIN: ATCC 43973 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 344 



35 TTCCCTCAGG 
TGGTAATGAG 
TCGTGCGTAC 
GATGTAAAAG 
GCTGGGTCGT 

40 AGATCGGCGA 
GAAGAGTTGT 
CGACCTGATG 
GCGCGGGTGT 
GCGATCGAGC 

4 5 TCGTGAGGGT 
ATAAAGTATC 
CTGCGCGTTG 
AGGTCGTGAC 
CCGGTACGGA 

50 TATCAGCCGA 
CTCCACCAAA 



ATGCCGTACC 
AAGCTGGTGC 
CATCGCGATG 
ATCTCGAACA 
ATCATGAACG 
AGAAGAGCGT 
CAAACTCTCA 
TGTCCGTTCG 
AGGTAAAACC 
ACTCCGGTTA 
AACGACTTCT 
CCTGGTGTAT 
CATTGACCGG 
GTTCTGCTGT 
AGTATCCGCA 
CTCTGGCTGA 
ACCGGTTCTA 



ACGCGTGTAC 
TGGAAGTTCA 
GGGTCTTCTG 
CCCGATCGAA 
TCCTGGGTGA 
TGGGCGATTC 
GGAACTGCTG 
CGAAGGGCGG 
GTAAACATGA 
CTCTGTGTTT 
ACCACGAAAT 
GGCCAGATGA 
CCTGACCATG 
TCGTCGACAA 
CTGCTGGGTC 
AGAAATGGGC 
TCACCTCCGT 



GATGCTCTTG 
GCAGCAGCTT 
ACGGTCTGCG 
GTCCCGGTAG 
ACCGGTCGAT 
ACCGCGCGGC 
GAAACCGGTA 
TAAAGTCGGT 
TGGAGCTTAT 
GCGGGCGTAG 
GACCGACTCT 
ACGAGCCACC 
GCGGAAAAAT 
CATCTACCGT 
GTATGCCTTC 
GTTCTGCAGG 
A 



AGGTGCAGAA 50 

GGCGGCGGTA 100 
TCGCGGTCTT * 150 

GTAAAGCAAC 200 

ATGAAAGGCG 250 

GCCGTCCTAC 300 

TCAAAGTTAT 350 

CTGTTCGGTG 400 

CCGTAACATC 450 

GTGAACGTAC 500 

AACGTTATCG 550 

GGGAAACCGT 600 

TCCGTGATGA 650 

TATACCCTCG 700 

CGCGGTAGGT 750 

AACGTATCAC 800 
831 



55 2) INFORMATION FOR SEQ ID NO: 345 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. houtenae 

(B) STRAIN: ATCC 43974 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 345 

TTCCCTCAGG ATGCCGTACC ACGCGTGTAC GATGCTCTTG AGGTGCAGAA 50 

TGGTAATGAG AAGCTGGTGC TGGAAGTTCA GCAGCAGCTT GGCGGCGGTA 100 

TCGTACGTAC CATCGCGATG GGGTCTTCTG ACGGTCTGCG TCGCGGTCTG 150 

GATGTAAAAG ATCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCTAC 2 00 

GCTGGGTCGT ATCATGAACG TCCTGGGCGA ACCGGTCGAT ATGAAAGGCG 2 50 

AGATCGGCGA AGAAGAGCGT TGGGCGATTC ACCGCGCTGC GCCGTCCTAC 3 00 

GAAGAGTTGT CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT 3 50 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 4 00 

GCGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTTAT CCGTAACATC 4 50 

GCGATCGAGC ACTCCGGTTA CTCCGTGTTT GCGGGCGTAG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG 550 

ATAAAGTATC CCTGGTGTAT GGTCAGATGA ACGAGCCGCC GGGAAACCGT 600 

CTGCGCGTTG CATTGACCGG CCTGACCATG GCGGAAAAAT TCCGTGACGA 650 

AGGTCGTGAC GTTCTGCTGT TCGTCGATAA CATCTATCGT TACACCCTGG 700 

CCGGTACGGA AGTATCCGCA CTGCTGGGTC GTATGCCTTC CGCGGTAGGT 750 

TATCAGCCGA CGCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC 800 

CTCCACCAAG ACCGGTTCTA TCACCTCCGT A 831 



2) INFORMATION FOR SEQ ID NO: 346 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. indica 

(B) STRAIN: ATCC 43974 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 346 

TTCCCTCAGG ATGCCGTACC ACGCGTGTAC GATGCCCTTG AGGTGCAGAA 50 

TGGTAATGAG AAGCTCGTGC TGGAAGTTCA GCAGCAGCTT GGCGGCGGTA 100 

TCGTGCGTAC CATCGCGATG GGGTCTTCTG ACGGTCTGCG TCGCGGTCTG 150 

GATGTAAAAG ATCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCCAC 200 

GCTGGGTCGT ATCATGAACG TCCTGGGCGA ACCGGTCGAT ATGAAAGGCG 250 

AGATCGGCGA AGAAGAGCGT TGGGCGATTC ACCGCGCGGC GCCGTCCTAT 300 

GAAGAGTTGT CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT 350 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 400 

GTGCGGGCGT AGGTAAAACC GTAAACATGA TGGAGCTTAT CCGTAACATC 4 50 

GCGATCGAGC ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG 550 

ACAAAGTATC CCTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT 600 

CTGCGCGTTG CACTGACCGG CCTGACCATG GCGGAGAAGT TCCGTGACGA 650 

AGGTCGTGAC GTACTGCTGT TCGTCGATAA CATCTATCGT TATACCCTGG 700 
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CCGGTACGGA AGTTTCCGCA CTGCTGGGTC GTATGCCTTC CGCGGTAGGT " 75t) 

TATCAGCCAA CTCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC 800 
CTCCACCAAA ACCGGTTCTA TCACCTCCG 829 



2) INFORMATION FOR SEQ ID NO: 347 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: • Salmonella choleraesuis subsp. choleraesuis 
serotype Paratyphi A 
20 (B) STRAIN: ATCC 9150 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 347 

GCCGTACCAC GCGTGTACGA TGCCCTTGAG GTGCAGAATG GTAATGAGAA 50 

25 GCTGGTGCTG GAAGTTCAGC AGCAGCTTGG CGGCGGTATC GTGCGTACCA 100 

TCGCGATGGG GTCTTCTGAC GGTCTGCGTC GCGGTCTGGA TGTAAAAGAT 150 

CTCGAACACC CGATCGAAGT CCCGGTAGGT AAAGCTACGC TGGGTCGTAT 2 00 

CATGAACGTC CTGGGCGAAC CGGTCGACAT GAAAGGCGAG ATCGGCGAAG 2 50 

AAGAGCGTTG GGCGATTCAC CGCGCAGCGC CTTCCTACGA AGAGTTGTCA 3 00 

30 AACTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 3 50 

TCCGTTCGCG AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GCGGGTGTAG 4 00 

GTAAAACCGT AAACATGATG GAGCTTATCC GTAACATCGC GATCGAGCAC 4 50 

TCCGGTTACT CTGTGTTTGC GGGCGTAGGT GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CATGAAATGA CCGACTCCAA CGTTATCGAT AAAGTATCCC 550 

35 TGGTGTATGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCA 600 

CTGACCGGCC TGACCATGGC GGAGAAATTC CGTGACGAAG GTCGTGACGT 650 

ACTGCTGTTC GTCGATAACA TCTATCGTTA CACCCTGGCC GGTACGGAAG 700 

TATCCGCACT GCTGGGTCGT ATGCCTTCCG CGGTAGGTTA CCAGCCGACT 750 

CTGGCGGAAG AGATGGGCGT TCTGCAGGAA CGTATCACCT CCACCAAGAC 8 00 

40 CGGTTCTATC ACCTCCG 817 



45 



60 



2) INFORMATION FOR SEQ ID NO: 348 



( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
55 (A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 

serotype Paratyphi B 
(B) STRAIN: ATCC 8759 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 348 
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GCCGTACCAC 
GCTGGTGCTG 
TCGCGATGGG 
CTCGAACACC 
CATGAACGTC 
AAGAGCGTTG 
AACTCTCAGG 
TCCGTTCGCG 
GTAAAACCGT 
TCCGGTTACT 
CGACTTCTAC 
TGGTGTATGG 
TTGACCGGTC 
ACTGCTGTTC 
TATCCGCACT 
CTGGCGGAAG 
CGGTTC 



GCGTGTACGA 
GAAGTTCAGC 
GTCTTCTGAC 
CGATCGAAGT 
CTGGGCGAAC 
GGCGATTCAC 
AACTGCTGGA 
AAGGGCGGTA 
AAACATGATG 
CTGTGTTTGC 
CACGAAATGA 
CCAGATGAAC 
TGACCATGGC 
GTCGATAACA 
GCTGGGCCGT 
AGATGGGCGT 



TGCCCTTGAG 
AGCAGCTTGG 
GGTCTGCGTC 
CCCGGTAGGT 
CGGTCGACAT 
CGCGCAGCGC 
AACCGGTATC 
AAGTCGGTCT 
GAGCTTATCC 
GGGCGTAGGT 
CCGACTCCAA 
GAGCCGCCGG 
GGAGAAATTC 
TCTATCGTTA 
ATGCCTTCCG, 
TCTGCAGGAA 



GTGCAGAATG 
CGGCGGTATT 
GCGGTCTGGA 
AAAGCCACGC 
GAAAGGCGAG 
CTTCCTACGA 
AAAGTTATCG 
GTTCGGTGGT 
GTAACATCGC 
GAACGTACTC 
CGTTATCGAT 
GAAACCGTCT 
CGTGACGAAG 
CACCCTGGCC 
CAGTAGGTTA 
CGTATCACCT 



GTAATGAGA2 
GTGCGTACCA 
TGTAAAAGAT 
TGGGTCGTAT 
ATCGGCGAAG 
AGAGTTGTCA 
ACCTGATGTG 
GCGGGTGTGG 
GATCGAGCAC 
GTGAGGGTAA 
AAAGTATCCC 
GCGCGTTGCA 
GTCGTGACGT 
GGTACGGAAG 
CCAGCCGACT 
CCACCAAAAC 



#0 

100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
806 



2) INFORMATION FOR SEQ ID NO: 349 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. salamae 

(B) STRAIN: ATCC 43972 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 349 

TTCCCTCAGG ATGCCGTACC ACGCGTGTAC GATGCCCTTG AGGTGCAGAA 50 

TGGTAATGAG AAGCTGGTGC TGGAAGTTCA GCAGCAGCTT GGCGGCGGTA 100 

TCGTGCGTAC CATCGCGATG GGGTCTTCTG ACGGTCTGCG TCGKGGTCTG 150 

GATGTAAAAG ATCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCAAC 200 

GCTGGGCCGT ATCATGAACG TACTGGGCGA ACCGGTCGAC ATGAAAGGCG 2 50 

AGATTGGTGA AGAAGAGCGT TGGGCTATTC ACCGTGCTGC GCCGTCCTAC 300 

GAAGAGTTGT CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT 350 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 400 

GCGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTCAT CCGTAACATC 450 

GCGATCGAGC ACTCCGGTTA CTCCGTGTTT GCGGGCGTAG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG 550 

ATAAAGTATC CCTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT 600 

CTGCGCGTTG CACTGACCGG CCTGACCATG GCGGAAAAAT TCCGTGACGA 650 

AGGTCGTGAC GTACTGCTGT TCGTCGATAA CATCTATCGT TATACCCTGG 700 

CCGGTACGGA AGTATCCGCA CTGCTGGGTC GTATGCCTTC CGCGGTAGGT 750 

TATCAGCCGA CGCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC 800 

CTCCACTAAA ACCGGTTCTA TCACCTCCGT A 831 



2) INFORMATION FOR SEQ ID NO: 350 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 823 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Typhi 
10 (B) STRAIN: ATCC 10749 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 350 

TTCCCTCAGG ATGCCGTACC ACGCGTGTAC GATGCCCTTG AGGTGCAGAA 50 

15 TGGTAATGAG AAGCTGGTGC TGGAAGTTCA GCAGCAGCTT GGCGGCGGTA 100 

TCGTGCGTAC CATCGCGATG GGGTCTTCTG ACGGTCTGCG TCGCGGTCTG 150 

GATGTAAAAG ATCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCTAC 200 

GCTGGGTCGT ATCATGAACG TCCTGGGCGA ACCGGTCGAC ATGAAAGGCG 250 

AGATCGGCGA AGAAGAGCGT TGGGCGATTC ACCGCGCAGC GCCTTCCTAC 300 

20 GAAGAGTTAT CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT 350 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 400 

GTGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTTAT CCGTAACATC 450 

GCGATCGAGC ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT ACCATGAAAT GACCGACTCC AACGTTATCG 550 

25 ATAAAGTATC CCTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT 600 

CTGCGCGTTG CACTGACCGG CCTGACCATG GCGGAGAAAT TCCGTGACGA 650 

AGGTCGTGAC GTACTGCTGT TCGTCGATAA CATCTATCGT TACACCCTGG 700 

CCGGTACGGA AGTATCCGCA CTGCTGGGTC GTATGCCTTC CGCGGTAGGT 750 
TACCAGCCGA CTCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC • 800 

30 CTCCACCAAG ACCGGTTCTA TCA 823 



35 



40 



2) INFORMATION FOR SEQ ID NO: 351 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Salmonella 

serotype Typhimurium 
(B) STRAIN: ATCC 14028 



choleraesuis subsp. choleraesuis 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 351 



50 



55 



60 



TTCCCTCAGG 
TGGTAATGAG 
TTGTGCGTAC 
GATGTAAAAG 
GCTGGGTCGT 
AGATCGGCGA 
GAAGAGTTGT 
CGACCTGATG 
GTGCGGGTGT 
GCGATCGAGC 



ATGCCGTACC 
AAGCTGGTGC 
CATCGCGATG 
ATCTCGAACA 
ATCATGAACG 
AGAAGAGCGT 
CAAACTCTCA 
TGTCCGTTCG 
AGGTAAAACC 
ACTCCGGTTA 



ACGCGTGTAC 
TGGAAGTTCA 
GGGTCTTCTG 
CCCGATCGAA 
TCCTGGGCGA 
TGGGCGATTC 
GGAACTGCTG 
CGAAGGGCGG 
GTAAACATGA 
CTCAGTGTTT 
21 



GATGCCCTTG 
GCAGCAGCTT 
ACGGTCTGCG 
GTCCCGGTAG 
ACCGGTCGAC 
ACCGCGCAGC 
GAAACCGGTA 
TAAAGTCGGT 
TGGAGCTTAT 
GCGGGCGTAG 
0 



AGGTGCAGAA 


50 


GGCGGCGGTA 


100 


TCGCGGTCTG 


150 


GTAAAGCTAC 


200 


ATGAAAGGCG 


250 


GCCTTCCTAC 


300 


TCAAAGTTAT 


350 


CTGTTCGGTG 


400 


CCGTAACATC 


450 


GGGAACGTAC 


500 
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TCGTGAGGGT AACGACTTCT 
ATAAAGTATC CCTGGTGTAT 
CTGCGCGTTG CATTGACCGG 
AGGTCGTGAC GTACTGCTGT 
CCGGTACGGA AGTATCCGCA 
TACCAGCCGA CTCTGGCGGA 
CTCCACCAAA ACCGGTTCTA 



ACCACGAAAT 
GGCCAGATGA 
TCTGACCATG 
TCGTCGATAA 
CTGCTGGGCC 
AGAGATGGGC 
TCACCTCCGT 



GACCGACTCC SAGGTTATOT 
ACGAGCCGCC GGGAAACCGT 
GCGGAGAAAT TCCGTGACGA 
CATCTATCGT TACACCCTGG 
GTATGCCTTC CGCAGTAGGT 
GTTCTGCAGG AACGTATCAC 
A 



600 
650 
700 
750 
800 
831 



2) INFORMATION FOR SEQ ID NO: 352 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Virchow 

(B) STRAIN: ATCC 51955 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 352 

GCCGTACCAC GCGTGTACGA TGCCCTTGAG GTGCAGAATG GTAATGAGAA 50 

GCTGGTGCTG GAAGTTCAGC AGCAGCTTGG CGGCGGTATC GTGCGTACCA 100 

TCGCGATGGG GTCTTCTGAC GGTCTGCGTC GCGGTCTGGA TGTAAAAGAT 150 

CTCGAACACC CGATCGAAGT CCCGGTAGGT AAAGCCACGC TGGGTCGTAT 200 

CATGAACGTC CTGGGCGAAC CGGTCGACAT GAAAGGCGAG ATCGGCGAAG 250 

AAGAGCGTTG GGCGATTCAC CGCGCAGCAC CTTCCTACGA AGAGTTGTCA 300 

AACTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTCGCG AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GCGGGTGTAG 4 00 

GTAAAACCGT AAACATGATG GAGCTTATTC GTAACATCGC GATCGAGCAC 450 

TCCGGTTACT CTGTGTTTGC GGGCGTAGGG GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTATCGAT AAAGTCTCCC 550 

TGGTGTATGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCA 600 

TTGACCGGTC TGACCATGGC GGAGAAATTC CGTGACGAAG GTCGTGACGT 650 

ACTGCTGTTC GTCGATAACA TCTATCGTTA CACCCTGGCC GGTACGGAAG 700 

TATCCGCACT GCTGGGCCGT ATGCCTTCCG CAGTAGGTTA CCAGCCGACT 750 

CTGGCGGAAG AGATGGGCGT TCTGCAGGAA CGTATCACCT CCACCAAAAC 800 

CGGTTCTATC 810 



2) INFORMATION FOR SEQ ID NO: 353 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 820 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia ficaria 

(B) STRAIN: ATCC 33105 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 353 

GATGCCGTAC CAAAAGTGTA CGATGCCCTT GAGGTAGAAA ACGGTACCGA 50 

5 GAAACTGGTG CTGGAAGTTC AGCAGCAGCT GGGCGGTGGC GTCGTTCGCT 100 

GTATCGCAAT GGGGACCTCT GACGGTCTGC GTCGCGGTCT GAAAGTGAAC 150 

AACCTGGAAC ACCCGATTGA AGTGCCGGTG GGTAAAGCTA CCCTGGGCCG 200 

TATCATGAAC GTATTGGGCG AACCAATCGA CATGAAAGGC GAGATCGGCG 250 

AAGAAGAGCG TTGGGCGATT CACCGTCCTG CGCCAAGCTA CGAAGAGCTG 300 

10 TCCAACTCCC AGGACCTGCT GGAAACCGGT ATCAAGGTAA TGGACCTGAT 350 

TTGTCCGTTC GCCAAGGGCG GTAAAGTCGG TCTGTTCGGT GGTGCGGGCG 4 00 

TGGGCAAAAC CGTAAACATG ATGGAGCTGA TCCGTAACAT CGCGATCGAG 450 

CACTCCGGTT ATTCCGTGTT TGCGGGCGTG GGCGAGCGTA CTCGTGAGGG 500 

TAACGACTTC TACCACGAAA TGAACGACTC CAACGTTCTG GACAAAGTAT 550 

15 CCCTGGTTTA CGGCCAGATG AACGAGCCGC CGGGTAACCG TCTGCGCGTT 600 

GCATTGACCG GCCTGACCAT GGCGGAGAAA TTCCGTGACG AAGGCCGCGA 650 

CGTTCTGCTG TTCGTTGACA ACATTTACCG TTACACCCTG GCCGGTACCG 700 

AAGTGTCCGC ACTTCTGGGC CGTATGCCAT CCGCGGTAGG TTATCAGCCA 750 

ACGCTGGCGG AAGAGATGGG CGTTCTGCAA GAACGTATCA CCTCGACCAA 800 

20 GACCGGTTCC ATCACCTCCG 820 



25 



2) INFORMATION FOR SEQ ID NO: 354 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 816 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

tvi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Serratia fonticola 

(B) STRAIN: ATCC 29844 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 354 

4 0 GATGCCGTAC CGAAAGTGTA CAACGCTCTT GAGGTTGCAA ACGGCACCGA 50 

GAAATTGGTG CTGGAAGTTC AGCAACAGCT GGGTGGCGGC GTGGTTCGCT 100 

GTATCGCAAT GGGGACCTCA GACGGTCTGC GTCGTGGTCT GGCCGTAACC 150 

GACCTGCAGC ACCCAATTGA AGTACCGGTA GGTAAAGCTA CCCTGGGCCG 200 

TATCATGAAC GTATTGGGTG AACCAATCGA CATGAAGGGC GACATCGGCG 250 

45 AAGAAGAACG TTGGGCTATT CACCGCCCTG CGCCAAGCTA CGAAGAGCTG 300 

TCCAGCTCCC AGGATCTGCT GGAAACCGGT ATCAAGGTAA TGGACCTGAT 350 

CTGCCCGTTC GCCAAGGGTG GTAAAGTTGG TCTGTTCGGT GGTGCTGGTG 400 

TAGGTAAAAC CGTAAACATG ATGGAGCTGA TCCGTAACAT CGCGATCGAG 450 

CACTCCGGTT ATTCTGTGTT TGCGGGCGTG GGTGAACGTA CTCGTGAGGG 500 

50 TAACGACTTC TACCACGAAA TGACCGATTC CAACGTACTG GACAAAGTTT 550 

CCCTGGTTTA CGGCCAGATG AACGAGCCAC CAGGTAACCG TCTGCGCGTT 600 

GCGCTGACCG GCCTGACCAT GGCTGAGAAG TTCCGTGACG AAGGTCGTGA 650 

CGTACTGCTG TTCGTCGATA ACATCTACCG TTATACCCTG GCCGGTACCG 700 

AAGTGTCCGC ACTTCTGGGC CGTATGCCAT CGGCGGTAGG TTATCAGCCA 750 

55 ACGCTGGCGG AAGAGATGGG TGTTCTGCAA GAACGTATCA CCTCTACCAA 800 

GACTGGTTCA ATCACC 816 



60 2) INFORMATION FOR SEQ ID NO: 355 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia grimes ii 

(B) STRAIN: ATCC 14460 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 355 



GATGCCGTAC CAAAAGTGTA CAACGCTCTT GAGGTAGAAA ACGGTGCCAA 50 

TAAGCTGGTG CTGGAAGTTC AGCAACAGTT GGGCGGTGGC GTGGTTCGCT 100 

GTATCGCAAT GGGGACCTCT GATGGTCTGC GTCGCGGTCT GAAAGTCACA 150 

GACCTGGACC ACCCAATTGA AGTACCGGTA GGTAAAGCTA CTCTGGGCCG 200 

20 TATCATGAAC GTATTGGGTG AACCAATCGA CATGAAGGGC GATATCGGCG 250 

AAGAAGAACG TTGGGCGATT CACCGTCCGG CGCCAAGCTA CGAAGATTTG 300 

GCCAACTCCC AGGATCTGCT GGAAACCGGT ATCAAGGTAA TGGACCTGAT 350 

CTGCCCGTTC GCCAAGGGTG GTAAAGTCGG TCTGTTCGGT GGTGCGGGTG 4 00 

TTGGTAAAAC CGTAAACATG ATGGAGCTGA TCCGTAACAT CGCGATCGAG 450 

25 CACTCCGGTT ATTCTGTGTT TGCGGGCGTG GGTGAGCGTA CTCGTGAGGG 500 

TAACGACTTC TACCACGAAA TGAACGACTC CAACGTACTG GACAAAGTAT 550 

CCCTGGTTTA CGGCCAGATG AACGAGCCAC CGGGTAACCG TCTGCGCGTT 600 

GCTCTGACCG GTCTGACCAT GGCTGAGAAA TTCCGTGACG AAGGCCGTGA 650 

CGTTCTGCTG TTCGTTGATA ACATCTACCG TTATACCCTG GCCGGTACCG 70 0 

30 AAGTGTCCGC ACTTCTGGGC CGTATGCCAT CGGCGGTAGG TTATCAGCCA 750 

ACGCTGGCGG AAGAGATGGG TGTTCTGCAA GAACGTATCA CCTCTACCAA 800 

GACTGGTTCA ATCACCTCCG TA 822 



35 



45 



2) INFORMATION FOR SEQ ID NO: 356 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 819 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia liquefaciens 

(B) STRAIN: ATCC 27592 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 356 

ATGCCGTACC AAAAGTGTAC AATGCTCTTG AGGTAGAAAA CGGTACCGAG 50 

AAGCTGGTGC TGGAAGTTCA GCAGCAGTTG GGCGGTGGCG TAGTTCGCTG 100 

TATCGCGATG GGGACCTCAG ATGGTCTGCG CCGCGGTCTG AAAGTGAACG 150 

55 ATCTGGAACA CCCAATTGAA GTACCGGTAG GTAAAGCTAC CCTGGGCCGT 200 

ATCATGAACG TATTGGGTGA ACCAATCGAC ATGAAAGGCG ACATCGGCGA 250 

AGAAGAACGT TGGGCGATTC ACCGTCCAGC GCCAAGCTAC GAAGATTTGT 3 00 

CAAACTCCCA GGATCTGCTG GAAACCGGTA TCAAGGTTAT GGACCTGATT 350 

TGTCCGTTCG CTAAGGGCGG TAAAGTTGGT CTGTTCGGTG GTGCTGGTGT 4 00 

60 TGGTAAAACC GTAAACATGA TGGAGCTGAT CCGTAACATC GCGATCGAGC 450 
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GTGAGCGTAC TCGTGAGGGT 500 

AACGTACTGG ACAAAGTATC 550 

GGGTAACCGT CTGCGCGTTG 600 

TCCGTGACGA AGGCCGCGAC 650 

TATACCCTGG CCGGTACCGA 700 

TGCGGTAGGT TATCAGCCAA 750 

AACGTATCAC CTCTACCAAG 800 

819 



10 



2 ) INFORMATION FOR SEQ ID NO: 357 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 805 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY:. Linear 



20 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia marcescens 

(B) STRAIN: ATCC 13880 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 357 



TCAGGATGCC GTACCGAAAG TGTACGACGC CCTTGAGGTA GAAAACGGCA 50 

CCGAAAAACT GGTGTTGGAA GTTCAGCAAC AGCTGGGCGG TGGCGTGGTT 100 

30 CGCTGTATCG CAATGGGGAC CTCCGACGGT CTGCGTCGCG GTCTGAAAGT 150 

GAACAACCTG GACCACCCGA TTGAAGTGCC GGTGGGTAAA GCTACCCTGG 200 

GTCGTATCAT GAACGTATTG GGTCAACCGA TCGACATGAA AGGCGACATC 250 

GGCGAAGAAG AGCGTTGGGC GATTCACCGC GCGGCGCCAA GCTACGAAGA 300 

GCTGTCAAGC TCTCAGGAAC TGCTGGAAAC CGGTATCAAG GTAATGGACC 350 

35 TGATTTGTCC GTTCGCCAAG GGCGGTAAAG TCGGTCTGTT CGGCGGTGCG 400 

GGCGTAGGTA AAACCGTAAA CATGATGGAG CTGATCCGTA ACATCGCGAT 450 

CGAGCACTCC GGTTATTCCG TGTTTGCGGG CGTGGGCGAG CGTACTCGTG 500 

AGGGTAACGA CTTCTACCAC GAAATGACCG ACTCCAACGT TCTGGACAAA 550 

GTATCCCTGG TTTACGGCCA GATGAACGAG CCACCAGGTA ACCGTCTGCG 600 

40 CGTTGCGCTG ACCGGTCTGA CCATGGCGGA GAAATTCCGT GACGAAGGCC 650 

GTGACGTTCT GCTGTTCGTT GACAACATCT ACCGTTACAC CCTGGCCGGT 700 

ACCGAAGTGT CCGCACTTCT GGGCCGTATG CCATCCGCGG TAGGTTATCA 750 

GCCAACGCTG GCGGAAGAGA TGGGCGTTCT GCAAGAACGT ATCACCTCGA 800 

CCAAG 805 

45 



2) INFORMATION FOR SEQ ID NO: 358 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: -Serratia odorifera 
60 (B) STRAIN: ATCC 33077 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 358 



GATGCCGTAC 
AACTCTGGTG 
GTATCGCTAT 
GATCTGCAAC 
TATCATGAAC 
AAGAAGAGCG 
TCCAACTCCC 
CTGCCCGTTT 
TTGGTAAAAC 
CACTCCGGTT 
TAACGACTTC 
CCCTGGTTTA 
GCGCTGACCG 
CGTTCTGCTG 
AAGTGTCTGC 
ACGCTGGCGG 
GACCGGTTCT 



CAAAAGTGTA 
CTGGAAGTTC 
GGGCACCTCC 
ACCCAATCGA 
GTATTGGGTC 
TTGGGCGATT 
AGGATCTGCT 
GCCAAGGGTG 
CGTAAACATG 
ATTCAGTGTT 
TACCACGAAA 
CGGCCAGATG 
GTCTGACCAT 
TTCGTTGACA 
ACTTCTGGGC 
AAGAGATGGG 
ATCACCTCCG 



CGATGCCCTT 
AGCAGCAGCT 
GACGGTTTGC 
AGTCCCGGTT 
AACCAATCGA 
CACCGCGCGG 
GGAAACCGGT 
GTAAAGTCGG 
ATGGAGCTGA 
TGCGGGCGTG 
TGACCGACTC 
AACGAGCCAC, 
GGCCGAGAAA 
ACATTTACCG 
CGTATGCCAT 
CGTTCTGCAA 
TA 



GAGGTAGAAA ACGGTACCGA 
GGGCGGTGGC GTGGTTCGTT 
GTCGCGGCCT GAAAGTGAAC 
GGCAAGGCAA CGCTGGGTCG 
CATGAAAGGC GACATCGGCG 
CGCCAAGCTA CGAAGAACTG 
ATCAAGGTTA TGGACCTGAT 
TCTGTTCGGT GGTGCGGGTG 
TCCGTAACAT CGCGATCGAG 
GGTGAGCGTA CTCGTGAGGG 
CAACGTACTG GACAAGGTTT 
CGGGTAACCG TCTGCGCGTT 
TTCCGTGACG AAGGTCGTGA 
TTACACCCTG GCCGGTACCG 
CGGCGGTAGG TTATCAGCCA 
GAACGTATCA CCTCGACCAA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
822 



2) INFORMATION FOR SEQ ID NO: 359 

( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 805 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Serratia plymuthica 

(B) STRAIN: ATCC 183 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 359 

GTGTACAACG CTCTTGAGGT AGAAAACGGT GCCAATAAGC TGGTGCTGGA 50 

AGTTCAGCAA CAGCTGGGCG GTGGCGTGGT TCGCTGTATC GCGATGGGGA 100 

CCTCTGATGG TCTGCGTCGC GGTCTGAAAG TGATCGACCT GGATCACCCG 150 

ATTGAAGTAC CGGTAGGTAA AGCTACCCTG GGCCGTATCA TGAACGTATT 200 

GGGTGAACCA ATCGACATGA AAGGCGACAT CGGCGAAGAA GAACGTTGGG 250 

CAATTCACCG TCCAGCGCCA AGCTACGAAG ATTTGGCCAA CTCCCAGGAT 300 

CTGCTGGAAA CCGGTATCAA GGTTATGGAC CTGATCTGTC CGTTCGCTAA 350 

GGGCGGTAAA GTGGGTCTGT TCGGCGGTGC GGGCGTGGGT AAAACCGTAA 400 

ACATGATGGA GCTGATCCGT AACATCGCGA TCGAACACTC CGGTTATTCC 450 

GTGTTTGCGG GCGTGGGTGA GCGTACTCGT GAGGGTAACG ACTTCTACCA 500 

CGAAATGAAC GACTCCAACG TACTGGACAA AGTATCCCTG GTTTACGGCC 550 

AGATGAACGA GCCACCGGGT AACCGTCTGC GCGTTGCTCT GACCGGTCTG 600 

ACCATGGCGG AGAAATTCCG TGACGAAGGC CGCGACGTTC TGCTGTTCGT 650 

TGATAACATC TACCGTTATA CCCTGGCCGG TACCGAAGTG TCCGCACTTC 700 

TGGGCCGTAT GCCATCTGCG GTAGGTTATC AGCCAACGCT GGCGGAAGAG 750 

ATGGGCGTTC TGCAAGAACG TATCACCTCT ACCAAGACCG GTTCTATCAC 800 

CTCCG 805 



2) INFORMATION FOR SEQ ID NO: 360 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia rubidaea 

(B) STRAIN: ATCC 27593 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 360 



15 

TTCCCTCAGG 
CGGTAACGAA 
TTGTACGCTG 
AAAGTTAACG 

20 GCTGGGTCGT 
ACATCGGCGA 
GAAGAGCTGT 
GGACCTGATC 
GTGCGGGCGT 

25 GCGATCGAGC 
TCGTGAGGGT 
ACAAAGTATC 
CTGCGCGTTG 
AGGCCGCGAC 

3 0 CCGGTACCGA 
TATCAGCCAA 
CTCGACCAAG 



ATGCCGTACC 
AAACTGGTGC 
TATCGCCATG 
ACCTCGAGCA 
ATCATGAACG 
AGAAGAGCGT 
CCAGCTCCCA 
TGCCCGTTCG 
AGGTAAAACC 
ACTCCGGTTA 
AACGACTTCT 
CCTGGTTTAC 
CACTGACCGG 
GTTCTGCTGT 
AGTGTCCGCA 
CGCTGGCGGA 
ACCGGTTCAA 



AAAAGTGTAC 
TGGAAGTTCA 
GGTACTTCCG 
CCCAATCGAA 
TATTGGGTCA 
TGGGCGATTC 
AGAGCTGCTG 
CCAAGGGTGG 
GTAAACATGA 
CTCTGTGTTT 
ACCACGAAAT 
GGCCAGATGA 
CCTGACCATG 
TCGTGGATAA 
CTGCTCGGCC 
AGAGATGGGC 
TCACCTCCGT 



GATGCCCTTG 
GCAGCAGCTG 
ACGGTCTGCG 
GTGCCAGTTG 
GCCAATCGAC 
ACCGCGCGGC 
GAAACCGGTA 
TAAAGTTGGT 
TGGAGCTGAT 
GCGGGCGTGG 
GACCGACTCC 
ACGAGCCGCC 
GCGGAAAAAT 
CATCTACCGT 
GTATGCCATC 
GTTCTGCAAG 
A 



AGGTAGAGAA 
GGCGGCGGCG 
TCGCGGTCTG 
GTAAAGCAAC 
ATGAAAGGCG 
GCCAAGCTAC 
TCAAGGTAAT 
CTGTTCGGTG 
CCGTAACATC 
GTGAGCGTAC 
AACGTACTGG 
GGGTAACCGT 
TCCGTGATGA 
TACACCCTGG 
TGCGGTAGGT 
AACGTATCAC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
831 



35 



45 



2) INFORMATION FOR SEQ ID NO: 3 61 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 4 06 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas putida 

(B) STRAIN: LCDC D7172 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 361 

CCGTAAACAT GATGGAACTG ATCCGTAACA TCGCCATCGA GCACAGCGGT 50 

TATTCCGTGT TCGCCGGTGT GGGTGAGCGT ACTCGTGAGG GTAACGACTT 100 

CTACCACGAG ATGAAGGACT CCAACGTTCT CGACAAGGTA GCGCTGGTCT 150 

55 ACGGTCAGAT GAACGAGCCA CCAGGAAACC GTCTGCGCGT AGCGCTGACC 200 

GGCCTGACCA TGGCCGAGAA GTTCCGTGAC GAAGGTAACG ACGTTCTGCT 250 

GTTCGTCGAC AACATCTATC GTTACACCCT GGCCGGTACC GAAGTATCCG 3 00 

CACTGCTGGG CCGTATGCCT TCGGCGGTAG GTTACCAGCC GACCCTGGCT 350 

GAAGAGATGG GCGTTCTGCA AGAACGTATC ACTTCGACCA AGGAAGGTTC 4 00 

60 GATCAC 4 06 
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2) INFORMATION FOR SEQ ID NO: 362 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shigella boydii 

(B) STRAIN: ATCC 9207 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 362 

TTCCCTCAGG ATGCCGTACC GCGCGTGTAC GATGCTCTTG AGGTGCAAAA 50 

TGGTAATGAG CGTCTGGTGC TGGAAGTTCA GCAGCAGCTC GGCGGCGGTA 100 

TCGTGCGTAC CATCGCAATG GGTTCCTCCG ACGGTCTGCG TCGCGGTCTG 150 

GATGTAAAAG ACCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCGAC 200 

TCTGGGCCGT ATCATGAACG TACTGGGTGA ACCGGTCGAC ATGAAAGGCG 250 

AGATCGGTGA AGAAGAGCGT TGGGCGATTC ACCGCGCAGC ACCTTCCTAC 300 

GAAGAGCTGT CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT 350 

CGACCTGATG TGTCCGTTCG CTAAGGGCGG TAAAGTTGGT CTGTTCGGTG 4 00 

GTGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTCAT TCGTAACATC 450 

GCGATCGAGC ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG 550 

ACAAAGTATC CCTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT 600 

CTGCGCGTTG CTCTGACCGG TCTGACCATG GCTGAGAAAT TCCGTGACGA 650 

AGGTCGTGAC GTTCTGCTGT TCGTTGACAA CATCTATCGT TACACCCTGG 700 

CCGGTACGGA AGTATCCGCA CTGCTGGGCC GTATGCCTTC AGCGGTAGGT 750 

TATCAGCCGA CCCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC 800 

CTCCACCAAA ACTGGTTCTA TCACCTCCGT A 831 



2) INFORMATION FOR SEQ ID NO: 363 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 802 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shigella dysenteriae 

(B) STRAIN: ATCC 11835 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 363 

GCCGTACCGC GCGTGTACGA TGCTCTTGCG GTGCAAAATG GTAATGAGCG 50 

TCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTGCGTACCA 100 

TCGCAATGGG TTCCTCCGAC GGTCTGCGTC GCGGTCTGGA TGTAAAAGAC 150 

CTCGAACACC CGATCGAAGT CCCGGTAGGT AAAGCGACTC TGGGCCGTAT 200 

CATGAACGTA CTGGGTGAAC CGGTCGACAT GAAAGGCGAG ATCGGTGAAG 250 
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AAGAGCGTTG 
AACTCTCAGG 
TCCGTTCGCT 
GTAAAACCGT 
TCCGGTTACT 
CGACTTCTAC 
TGGTGTATGG 
CTGACCGGTC 
TCTGCTGTTC 
TATCCGCACT 
CTGGCGGAAG 
CG 



GGCTATTCAC 
AACTGCTGGA 
AAGGGCGGTA 
AAACATGATG 
CTGTGTTTGC 
CACGAAATGA 
CCAGATGAAC 
TGACCATGGC 
GTTGACAACA 
GCTGGGCCGT 
AGATGGGCGT 



CGCGCAGCAC CTTCCTACGA AGAGCTGTCA 
AACCGGTATC AAAGTTATCG ACCTGATGTG 
AAGTTGGTCT GTTCGGTGGT GCGGGTGTAG 
GAGCTCATTC GTAACATCGC GATCGAGCAC 
GGGCGTAGGT GAACGTACTC GTGAGGGTAA 
CCGACTCCAA CGTTATCGAC AAAGTATCCC 
GAGCCGCCGG GAAACCGTCT GCGCGTTGCT 
TGAGAAATTC CGTGACGAAG GTCGTGACGT 
TCTATCGTTA CACCCTGGCC GGTACGGAAG 
ATGCCTTCAG CGGTAGGTTA TCAGCCGACC 
TCTGCAGGAA CGTATCACCT CCACCAAAAC 



300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
802 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 364 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shigella flexneri 

(B) STRAIN: ATCC 12022 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 364 

ATGCCGTACC GCGCGTGTAC GATGCTCTTG AGGTGCAAAA TGGTAATGAG 50 

CGTCTGGTGC TGGAAGTTCA GCAGCAGCTC GGCGGCGGTA TCGTGCGTAC 100 

CATCGCAATG GGTTCCTCCG ACGGTCTGCG TCGCGGTCTG GATGTAAAAG 150 

ACCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCGAC TCTGGGCCGT 200 

ATCATGAACG TACTGGGTGA ACCGGTCGAC ATGAAAGGCG AGATCGGTGA 250 

AGAAGAGCGT TGGGCGATTC ACCGCGCAGC ACCTTCCTAC GAAGAGCTGT 300 

CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT CGACCTGATG 350 

TGTCCGTTCG CTAAGGGCGG TAAAGTTGGT CTGTTCGGTG GTGCGGGTGT 400 

AGGTAAAACC GTAAACATGA TGGAGCTCAT TCGTAACATC GCGATCGAGC 450 

ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC TCGTGAGGGT 500 

AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG ACAAAGTATC 550 

CCTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT CTGCGCGTTG 600 

CTCTGACCGG TCTGACCATG GCTGAGAAAT TCCGTGACGA AGGTCGTGAC 650 

GTTCTGCTGT TCGTTGACAA CATCTATCGT TACACCCTGG CCGGTACGGA 700 

AGTATCCGCA CTGCTGGGCC GTATGCCTTC AGCGGTAGGT TATCAGCCGA 750 

CCCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC CTCCACCAAA 800 

ACTGGTTCTA TCACCTCCG 819 



2) INFORMATION FOR SEQ ID NO: 365 

( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 802 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shigella sonnei 

(B) STRAIN: ATCC 2 9930 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 365 

GTACCGCGCG TGTACGATGC TCTTGAGGTG CAAAATGGTA ATGAGCGTCT 50 

GGTGCTGGAA GTTCAGCAGC AGCTCGGCGG CGGTATCGTG CGTACCATCG 100 

10 CAATGGGTTC CTCCGACGGT CTGCGTCGCG GTCTGGATGT AAAAGACCTC 150 

GAACACCCGA TCGAAGTCCC GGTAGGTAAA GCGACTCTGG GCCGTATCAT 2 00 

GAACGTACTG GGTGAACCGG TCGACATGAA AGGCGAGATC GGTGAAGAAG 250 

AGCGTTGGGC GATTCACCGC GCAGCACCTT CCTACGAAGA GCTGTCAAAC 300 

TCTCAGGAAC TGCTGGAAAC CGGTATCAAA GTTATCGACC TGATGTGTCC 350 

15 GTTCGCTAAG GGCGGTAAAG TTGGTCTGTT- CGGTGGTGCG GGTGTAGGTA 4 00 

AAACCGTAAA CATGATGGAG CTCATTCGTA ACATCGCGAT CGAGCACTCC 450 

GGTTACTCTG TGTTTGCGGG CGTAGGTGAA CGTACTCGTG AGGGTAACGA 500 

CTTCTACCAC GAAATGACCG ACTCCAACGT TATCGACAAA GTATCCCTGG 550 

TGTATGGCCA GATGAACGAG CCGCCGGGAA ACCGTCTGCG CGTTGCTCTG 600 

20 ACCGGTCTGA CCATGGCTGA GAAATTCCGT GACGAAGGTC GTGACGTTCT .650 

GCTGTTCGTT GACAACATCT ATCGTTACAC CCTGGCCGGT ACGGAAGTAT 700 

CCGCACTGCT GGGCCGTATG CCTTCAGCGG TAGGTTATCA GCCGACCCTG 750 

GCGGAAGAGA TGGGCGTTCT GCAGGAACGT ATCACCTCCA CCAAAACTGG 800 

TT 802 

25 



2) INFORMATION FOR SEQ ID NO: 366 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 785 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



40 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 366 



TAACGCCTTG GTTATTGATG TGCCTAAAGA AGAAGGTACA ATACAACTAA 50 

CATTAGAAGT TGCGCTGCAA TTAGGTGACG ACGTTGTTCG TACAATTGCG 100 

45 ATGGATTCAA CTGATGGTGT CCAAAGAGGC ATGGATGTAA AAGATACAGG 150 

CAAAGAAATT AGTGTACCTG TTGGTGATGA AACATTAGGT CGTGTATTTA 200 

ATGTACTAGG TGAAACAATT GACCTTAAAG AAGAAATTAG TGATTCTGTT 250 

CGCCGCGATC CTATCCATCG TCAAGCACCA GCATTCGATG AACTTTCAAC 300 

AGAAGTTCAA ATTTTAGAAA CAGGTATTAA AGTAGTAGAT TTACTAGCAC 350 

50 CTTATATTAA AGGTGGTAAA ATCGGATTGT TCGGTGGTGC CGGTGTAGGT 400 

AAAACAGTAT TAATCCAAGA ATTAATTAAC AACATCGCTC AAGAGCACGG 450 

TGGTATTTCT GTATTCGCCG GTGTAGGTGA ACGTACTCGT GAAGGTAACG 500 

ATTTATACTT CGAAATGAGT GATAGTGGTG TAATTAAGAA AACAGCCATG 550 

GTATTCGGGC AAATGAATGA GCCACCTGGT GCACGTATGC GTGTTGCATT 600 

55 ATCTGGTTTA ACAATGGCTG AATATTTCCG TGACGAACAA GGTCAAGACG 650 

TATTATTATT CATCGATAAC ATTTTCAGAT TTACACAAGC TGGTTCTGAG 700 

GTATCTGCAT TATTAGGTCG TATGCCTTCT GCAGTAGGTT ACCAACCAAC 750 

ACTTGCTACT GAAATGGGAC AATTACAAGA ACGTA 78 5 



60 
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2) INFORMATION FOR SEQ ID NO: 367 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 843 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



10 



(ii) MOLECULE TYPE: Genomic DNA 



15 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus auricularis 

(B) STRAIN: ATCC 33753 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 367 



20 



25 



30 



35 



GAACATAACG 
AAGAGAAGAC 
GCGATGACGT 
CGTGGTAACG 
AGACGAAACT 
TAGAAGATAA 
GCGCCAGCGT 
AATTAAAGTT 
GACTCTTCGG 
ATCAACAACA 
AGGTGAACGT 
GTGGTGTAAT 
CCTGGCGCAC 
TTTCCGTGAT 
TCCGTTTCAC 
CCATCAGCCG 
ACAAGAACGT 



AAGTGCCTAA 
GGAACAGTGT 
TGTTCGTACC 
AAGTCAAAGA 
TTAGGACGTG 
ACTTGATGAT 
TTGATCAATT 
GTTGACTTAT 
TGGTGCCGGT 
TCGCTCAAGA 
ACACGTGAAG 
CAAGAAAACA 
GTATGCGTGT 
GAACAAGGAC 
ACAAGCCGGT 
TTGGTTATCA 
ATTACTTCAA 



TATTAATAAC 
CTTTAACTTT 
ATTGCAATGG 
TACTGGTAAT 
TCTTCAACGT 
TCTGCGCGAC 
ATCAACTCAA 
TAGCACCTTA 
GTTGGTAAAA 
ACACGGTGGT 
GTAACGACTT 
GCCATGGTCT 
TGCTTTATCT 
AAGACGTATT 
TCAGAAGTTT 
ACCTACATTA 
CAACAAAAGG 



GCCTTAGTAC 
AGAAGTAGCT 
ATTCAACTGA 
AGCATTAGCG 
TCTAGGTGAA 
GTGACCCTAT 
GTTGAAATTT 
TATTAAAGGT 
CCGTTTTAAT 
ATTTCAGTCT 
GTACTATGAA 
TCGGACAAAT 
GGTTTAACAA 
GTTATTCATC 
CTGCCTTACT 
GCAACAGAAA 
ATCAGTTACT 



TCGATGTTGA 50 

TTACAATTAG 100 

TGGTGTTAAA 150 

TACCAGTCGG 200 

ACAATTGATT 250 

ACATAGAGAA 300 

TAGAAACAGG 350 

GGTAAAGTTG 4 00 

CCAAGAATTA 450 

TTGCCGGTGT 500 

ATGAGCGACA 550 

GAACGAACCA 600 

TGGCTGAATA 650 

GACAATATTT 700 

AGGTCGTTTA 750 

TGGGACAATT 800 

TCA 843 



2) INFORMATION FOR SEQ ID NO: 368 



40 



45 



50 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 84 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus capitis subsp, 

(B) STRAIN: ATCC 2784 0 



capitis 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 368 



GCTTTGAACA TAATGAAGTT CCTGATATTA ACAATGCCTT 
55 GTTCCTAAAG AAGATAGCAC ACTTCATTTA ACTTTAGAAG 
ATTAGGTGAC GATGTAGTAC GTACAATCGC AATGGACTCA 
TTCAAAGAGG TATGGAAGTT AAAGATACAG GTAAAGATAT 
GTTGGTGATG CAACTTTAGG AAGAGTATTT AACGTATTAG 
CGATTTAGAT GAAAAGATTG ATGATTCAGT ACGTCGTGAT 
60 GACAGGCACC TGGCTTCGAT GAATTATCTA CTAAAGTAGA 

220 



ACACATCGAA 50 

TTGCACTTCA 100 

ACTGACGGCG 150 

TAGCGTACCT 200 

GAGAAACAAT 250 

CCTATTCATA 3 00 

AATCTTAGAA 350 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



ACAGGTATCA AAGTAGTAGA 
AATTGGATTA TTCGGTGGTG 
AACTTATCAA TAATATCGCT 
GGTGTTGGTG AACGTACACG 
CGATAGTGGT GTAATTAAGA 
AGCCACCTGG TGCTCGTATG 
GAATATTTCC GTGATGAAGA 
TATCTTCAGA TTCACACAAG 
GTATGCCTTC AGCCGTTGGT 
CAATTACAAG AACGTATTAG 



CTTATTAGCA 
CCGGTGTTGG 
CAAGAGCATG 
TGAAGGTAAC 
AAACAGCGAT 
CGTGTTGCAT 
AGGCCAAGAC 
CTGGTTCTGA 
TATCAACCAA 
TTCAACTAAT 



CCTTACATTA AAGGTGGTAA 
TAAGACAGTT TTAATCCAAG 
GTGGTATTTC AGTATTCGCC 
GACCTTTACT ATGAAATGAG 
GGTATTCGGT CAGATGAACG 
TATCAGGTTT AACAATGGCA 
GTATTATTAT TCATTGATAA 
AGTTTCAGCA TTACTTGGAC 
CACTTGCTAC TGAAATGGGT 
AAAGGTTCTG TTACTTCAA 



4 00" 

450 

500 

550 

600 

650 

700 

750 

800 

849 
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20 
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2) INFORMATION FOR SEQ ID NO:- 369 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 830 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus cohnii 

(B) STRAIN: DSM 20260 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 369 

GAAGTTCCAG AAATTAATAA TGCCTTAGTT CTCGATATAG AAAGAGAAGA 50 

AGGTACTGTT GAATTAACGT TAGAAGTTGC ATTACAACTT GGTGATGACG 100 

TAGTACGAAC AATCGCAATG GATTCAACTG ATGGTGTTAA ACGTGGTACA 150 

GAAGTTAGAG ATAGCGGAAA TAGTATCAGC GTACCAGTTG GTAATGAAAC 2 00 

ATTAGGTAGA GTATTTAATG TATTAGGTGA TACGATAGAT TTAGATGAAG 250 

ACATAGATGA CTCAGTGCGT CGTGACCCAA TTCATAGAGA AGCACCTGCA 3 00 

TTTGATCAGT TATCTACTAA AGTTGAAATT TTAGAAACAG GTATCAAAGT 3 50 

CATTGATTTA TTAGCACCAT ATATCAAAGG TGGTAAAGTT GGATTATTCG 4 00 

GTGGTGCCGG TGTTGGTAAA ACTGTATTAA TTCAAGAATT AATCAATAAT 4 50 

ATCGCTCAAG AGCATGGTGG TATATCCGTA TTTGCTGGTG TAGGTGAGCG 500 

TACGCGTGAA GGTAATGACC TATACTTTGA AATGAGTGAT AGTGGTGTTA 550 

TTAAAAAGAC AGCTATGGTA TTTGGACAAA TGAACGAACC ACCTGGTGCG 600 

CGTATGCGAG TAGCACTTTC TGGTTTAACA ATGGCTGAAT ATTTCCGGGA 650 

TGAACAAGGA CAAGATGTTC TATTATTCAT AGATAACATC TTTAGATTTA 700 

CTCAAGCTGG TTCAGAAGTT TCTGCGTTAT TAGGTCGTAT GCCTTCAGCT 7 50 

GTTGGTTACC AACCAACGTT AGCAACTGAA ATGGGACAAT TACAAGAACG 800 

TATTACTTCT ACAACTAAAG GTTCAGTAAC 830 



2) INFORMATION FOR SEQ ID NO: 370 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 787 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Staphylococcus epidermidi r s 

(B) STRAIN: ATCC 14990 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 370 



AATAATGCAT 
AACATTAGAA 
CAATGGACTC 
GGTAGAGACA 
TAACGTACTA 
TACGACGTGA 
ACAAAAGTAG 
ACCTTACATA 
GTAAAACCGT 
GGTGGTATCT 
TGATCTTTAC 
TGGTCTTTGG 
TTATCCGGAT 
TGTGTTATTA 
AAGTTTCTGC 
ACACTTGCTA 



TACACATCGA 
GTTGCACTTC 
AACTGACGGC 
TAAGTGTACC 
GGAGAAACTA 
CCCTATCCAT 
AAATCTTAGA 
AAAGGTGGTA 
ACTAATCCAA 
CAGTATTCGC 
TATGAAATGA 
TCAAATGAAT 
TAACAATGGC 
TTCATTGATA 
GTTATTAGGT 
CAGAAATGGG 



AGTTCCTAAA GAAGATGGAG CGCTTCAATT 
AACTAGGTGA CGATGTAGTT CGTACAATTG 
GTTCAAAGAG GAATGGAAGT TAAAGATACA 
TGTCGGTGAC GTAACTCTAG GAAGAGTGTT 
TTGACTTAGA TGAAAAAATT GATGATTCAG 
AGACAAGCTC CAGGATTCGA CGAATTATCA 
AACTGGTATT AAAGTAGTAG ACTTATTAGC 
AAATTGGATT ATTTGGTGGT GCCGGTGTAG 
GAACTTATTA ATAACATCGC TCAAGAACAC 
TGGTGTTGGT, GAACGTACAC GTGAAGGTAA 
GTGACAGTGG TGTTATCAAG AAAACTGCAA 
GAGCCACCTG GTGCACGTAT GCGTGTAGCA 
CGAATATTTC CGAGATGAAG AAGGCCAAGA 
ACATTTTCAG ATTCACTCAA GCTGGTTCAG 
CGTATGCCAT CAGCTGTTGG TTATCAACCT 
TCAATTACAA GAACGTA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
787 



2) INFORMATION FOR SEQ ID NO: 371 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 

(B) STRAIN: ATCC 29970 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 371 

GAAGTACCTG AAATTAATAA CGCCTTAATC ATCGAAGTTC CCAAAGAAGA 50 

TGGTACTTTT GAATTAACGC TTGAAGTTGC ATTACAACTA GGTGATGACG 100 

TTGTTCGTAC AATTGCTATG GATTCAACAG ATGGTGTTCA ACGTGGTATG 150 

GAAGTTCAGA ACACTGGAAA AGACATTTCA GTACCAGTTG GCGAAGTAAC 2 00 

TTTAGGACGT GTATTTAACG TATTAGGTGA CACAATTGAT TTAGAAGATA 2 50 

AATTAGATGG TTCAGTAAGA CGTGATCCAA TTCATAGACA ATCACCTAAC 300 

TTTGACGAAT TATCTACTGA AGTAGAAATT CTTGAAACTG GAATCAAAGT 3 50 

TGTAGACTTA TTAGCACCAT ACATCAAAGG TGGTAAAATC GGTCTATTTG 400 

GTGGTGCCGG TGTTGGTAAA ACCGTTTTAA TCCAAGAATT GATTAATAAT 4 50 

ATCGCACAAG AACATGGTGG TATCTCAGTA TTTGCTGGTG TAGGTGAACG 500 

TACACGTGAA GGTAACGACC TATATTATGA AATGAGAGAT AGTGGTGTTA 550 

TTAAGAAAAC AGCAATGGTA TTTGGTCAAA TGAACGAGCC ACCTGGTGCA 600 

CGTATGCGTG TGGCACTTTC TGCATTGACA ATGGCTGAGT ATTTCCGTGA 650 

TGAACAAGGA CAAGACGTTC TGTTATTCAT CGATAACATT TTCAGATTTA 700 

CTCAAGCAGG TTCAGAAGTA TCAGCATTAT TGGGACGTAT GCCTTCAGCT 750 

GTAGGTTATC AACCTACTTT AGCTACAGAA ATGGGTCAAT TACAAGAACG 800 

TATTACATCA ACGAATAAAG GTTCAGTAAC 830 



222 
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( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 84 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

10 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus hominis subsp. hominis 

(B) STRAIN: ATCC 27844 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 372 



TCGAACATAA TGAAGTCCCT GAAATTAATA ATGCCCTAAT TATTGAAGTA 50 

CCCAAAAATG ATGGCACATT TAAATTAACA TTAGAAGTTG CATTGCAACT 100 

AGGTGATGAT GTTGTTCGTA CTATTGCAAT GGATTCAACT GATGGTGTTC 150 

20 AACGTGGTAT GCAAGTTGTG AATACTGGTA AAGATATTAG TGTTCCTGTA 2 00 

GGTGAAGAAA CACTTGGACG TGTGTTTAAC GTTTTAGGAG AAACAATAGA 250 

TTTAAACGAA AAAATAGATA GTTCTGTTAG ACGTGATCCA ATTCATCGTC 300 

GTCAACCTAA TTTTGATGAA TTATCTACTG AAGTAGAAAT TCTTGAAACA 350 

GGTATTAAAG TTGTAGACTT ATTAGCACCT TATATTAAAG GTGGTAAGAT 400 

25 TGGTTTATTC GGTGGTGCCG GCGTAGGTAA AACTGTATTA ATTCAAGAAT 450 

TAATCAATAA TATCGCTCAA GAACATGGTG GTATTTCTGT ATTCGCTGGT 500 

GTAGGTGAAC GTACTCGTGA AGGTAACGAT TTATACTATG AAATGAGCGA 550 

TAGTGGCGTT ATCAATAAAA CAGCCATGGT ATTTGGGCAA ATGAATGAGC 600 

CGCCAGGTGC GCGTATGCGT GTTGCTTTAT CAGCATTGAC AATGGCTGAA 650 

30 TATTTCCGTG ATGAACAAGG TCAAGATGTA CTTTTATTCA TTGACAATAT 700 

TTTCCGCTTT ACTCAAGCTG GTTCTGAAGT TTCAGCATTA TTAGGACGTA 750 

TGCCTTCAGC TGTAGGTTAT CAACCTACAT TAGCAACTGA AATGGGTCAA 800 

TTACAAGAAC GTATTACATC TACTAATAAA GGTTCAGTCA CTTCAA 846 



35 



2) INFORMATION FOR SEQ ID NO: 373 

(i) SEQUENCE CHARACTERISTICS: 
4 0 (A) LENGTH: 84 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus hominis 

(B) STRAIN: CSG 175 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 373 



TCGAACATAA TGAAGTCCCT 
CCCAAAAATG ATGGCACATT 

55 AGGTGATGAT GTTGTTCGTA 
AACGTGGTAT GCAAGTTGTG 
GGTGAAGAAA CACTTGGACG 
TTTAAACGAA AAAATAGATA 
GTCAACCTAA TTTTGATGAA 

60 GGTATTAAAG TTGTAGACTT 



GAAATTAATA ATGCCCTAAT 
CAAATTAACA TTAGAAGTTG 
CTATTGCAAT GGATTCAACT 
AATACTGGTA AAGATATTAG 
TGTGTTTAAC GTTTTAGGAG 
GTTCTGTTAG ACGTGATCCA 
TTATCTACTG AAGTAGAAAT 
ATTAGCACCT TATATTAAAG 
223 



TATTGAAGTA 50 

CATTGCAACT 100 

GATGGTGTTC 150 

TGTTCCTGTA 200 

AAACAATAGA 250 

ATTCATCGTC 300 

TCTTGAAACA 350 

GTGGTAAGAT 400 
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TGGTTTATTC GGTGGTGCCG GCGTAGGTAA 
TAATCAATAA TATCGCTCAA GAACATGGTG 
GTAGGTGAAC GTACTCGTGA AGGTAACGAT 
TAGTGGCGTT ATCAATAAAA CAGCCATGGT 
5 CGCCAGGTGC GCGTATGCGT GTTGCTTTAT 
TATTTCCGTG ATGAACAAGG TCAAGATGTA 
TTTCCGCTTT ACTCAAGCTG GTTCTGAAGT 
TGCCTTCAGC TGTAGGTTAT CAACCTACAT 
TTACAAGAAC GTATTACATC TACTAATAAA 



PCT/CA00/01150 

AACTGTATTA ATTCAAGAAT 450 

GTATTTCTGT ATTCGCTGGT 500 

TTATACTATG AAATGAGCGA 550 

ATTTGGGCAA ATGAATGAGC 600 

CAGCATTGAC AATGGCTGAA 650 

CTTTTATTCA TTGACAATAT 700 

TTCAGCATTA TTAGGACGTA 750 

TAGCAACTGA AATGGGTCAA 800 

GGTTCAGTCA CTTCAA 846 



2) INFORMATION FOR SEQ ID NO: 374 

15 (i) SEQUENCE CHARACTERISTICS: , 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus lugdunensis 
25 (B) STRAIN: ATCC 43809 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 374 

ATAATGAAGT GCCTGAAATA AATAATGCGC TCATTGTTGA AATTCCTAAA 50 

30 AGTGATACAA CAATCAGTTT AACACTTGAA GTTGCTTTGC AATTAGGTGA 100 

CGATGTTGTA CGTACTATTG CAATGGATTC AACTGATGGC GTTCAACGTG 150 

GTATGGAAGT TCAAAACACA GGTAAAGACA TCAGTGTACC TGTTGGAGAT 2 00 

GAAACATTAG GAAGAGTATT TAACGTTTTA GGAGAATCTA TTGATTTAGA 250 

AGAAAAGCTA GATGACTCTG TGCGTAGAGA TCCAATTCAT AGACTAGCAC 300 

35 CTAAATTTGA TGAATTATCT ACAGAAGTAG AAATTCTTGA AACTGGTATT 350 

AAAGTTGTTG ATTTATTAGC ACCATATATT AAAGGTGGTA AAGTTGGATT 4 00 

GTTTGGTGGT GCCGGAGTAG GTAAAACGGT ATTAATTCAA GAATTAATCA 4 50 

ACAATATTGC TCAAGAACAT GGTGGTATTT CTGTGTTTGC CGGAGTAGGT 500 

GAACGTACAC GTGAAGGTAA TGACTTATAT TATGAAATGA GCGATAGTGG 550 

4 0 CGTAATTAAG AAAACAGCGA TGGTATTTGG CCAAATGAAT GAACCACCTG 600 

GTGCACGTAT GAGAGTTGCG TTATCTGCCT TAACAATGGC TGAATATTTC 650 

CGTGACGAGC AAGGACAAGA CGTATTGCTG TTTATCGATA ATATATTCCG 700 

TTTTACACAA GCAGGTTCAG AAGTATCTGC ATTACTTGGA CGTATGCCAT 750 

CTGCCGTTGG TTATCAACCA ACATTGGCTA CAGAAATGGG ACAATTGCAA 800 

45 GAAAGAATTA CATCTACAAA TAAAGGTTCT GTAAC 835 



2) INFORMATION FOR SEQ ID NO: 375 



50 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 842 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Staphylococcus saprophyticus 

224 
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(B) 



STRAIN: ATCC 15305 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 75 



GAGCACAATG 
AAGAGATGAA 
GCGATGATGT 
CGTGGTACAG 
TGATGCTACG 
TAGACGAGAA 
GCACCTGCAT 
TATTAAAGTA 
GTTTATTCGG 
ATTAATAATA 
AGGTGAACGT 
GTGGTGTTAT 
CCTGGTGCGC 
CTTCCGTGAT 
TCAGATTTAC 
CCATCAGCCG 
ACAAGAACGT 



AAGTTCCAGA 
GGTACAGTAT 
CGTACGTACA 
AAGTTCGAGA 
TTAGGACGTG 
GCTTGATACT 
TCGATCAATT 
ATTGATTTAC 
TGGCGCTGGT 
TAGCTCAAGA 
ACGCGTGAAG 
TAAGAAAACA 
GTATGCGTGT 
GTACAAGGAC 
GCAAGCTGGT 
TTGGTTATCA 
ATTACATCAA 



AATTAACAAT 
CTCTTACATT 
ATTGCAATGG 
TAGCGGAGAT 
TGTTTAATGT 
TCTGTCAAAC 
ATCAACAAAA 
TTGCACCATA 
GTAGGTAAAA 
ACATGGTGGT 
GTAATGACTT 
GCTATGGTCT 
TGCTTTATCA 
AAGATGTTTT 
TCAGAAGTAT 
ACCTACCCTT 
CAACTAAAGG 



GCCTTAGTCG 
AGAAGTGGCA 
ATTCTACTGA 
AGCATCAGTG 
TCTTGGTGAT 
GTGATCCAAT 
GTTGAAATCT 
TATTAAAGGT 
CAGTATTAAT 
ATTTCAGTAT 
ATACTACGAA 
TCGGACAAAT 
GGCTTAACAA 
ACTATTTATT 
CAGCACTATT 
GCTACTGAAA 
ATCTGTAACG 



TAGACGTTGA 
TTACAACTTG 
TGGTGTTAAA 
TTCCAGTTGG 
ACAATTGACT 
TCATAGAGAA 
TAGAAACAGG 
GGTAAAATCG 
TCAAGAATTA 
TTGCCGGCGT 
ATGAGTGATA 
GAATGAGCCA 
TGGCTGAACA 
GATAACATAT 
AGGTCGTATG 
TGGGTCAATT 
TC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
842 



2) INFORMATION FOR SEQ ID NO: 376 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 842 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus simulans 

(B) STRAIN: ATCC 27848 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 376 

TGATGAACTG CCTAAGATTA ATAACGCATT AGTGCTAGAT GTACCTAAGA 50 

AAGATGGCAC GACTGAATCT CTTACATTAG AAGTAGCACT TGAATTAGGC 100 

GACGACGTAG TTAGAACTAT CGCCATGGAC TCTACAGACG GAATTAAACG 150 

TGGTGACGAC GTTAAAGACA CTGGTCGTCC AATCAGTGTA CCTGTCGGTG 200 

AAGATACGTT AGGAAGAGTA TTTAACGTTT TAGGTGATCC AATCGATAAT 250 

GATGGACCGA TTTCTGAATC AGTTCCACGT GAACCAATTC ATAGACAACC 3 00 

ACCTAAATTT GATGAATTAT CAACAAAAGT TGAACTACTT GAAACTGGTA 350 

TCAAAGTAGT AGACTTATTA GCACCATATA TCAAAGGTGG TAAAGTTGGT 4 00 

TTATTCGGTG GTGCCGGAGT AGGTAAAACT GTATTAATCC AAGAATTAAT 450 

TAATAACATC GCTCAAGAAC ACGGCGGTAT TTCAGTATTC GCAGGTGTTG 500 

GTGAACGTAC ACGTGAAGGT AACGACTTGT ACTTCGAAAT GAGCGACAGT 550 

GGTGTTATCA AGAAAACAGC GATGGTATTC GGACAAATGA ACGAACCACC 600 

TGGTGCACGT ATGCGTGTAG CTTTATCAGG TTTAACAATG GCTGAATACT 650 

TCCGTGATGT TAAAGGACAA GACGTTCTTT TATTCATCGA TAACATTTTC 700 

CGCTTCACAC AAGCAGGTTC TGAGGTATCA GCATTGCTTG GCCGTATGCC 750 

ATCAGCCGTT GGTTACCAAC CAACATTGGC AACAGAAATG GGTCAATTAC 800 

AAGAACGTAT CACTTCTACA ATGAAAGGTT CTATCACATC TA 842 



225 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 841 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

10 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus warneri 

(B) STRAIN: ATCC 27836 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 377 

CATAACGAAG TCCCTGATAT TAATAATGCC CTTATTATTG AAGTTCCAAA 50 

AGAAGATGGA ACGTTAAACT TAACATTAGA AGTTGCACTA CAATTAGGTG 100 

ATGATGTTGT ACGTACAATT GCAATGGATT CAACTGATGG TGTTCAAAGA 150 

2 0 GGCATGGATG TTAAAGACAC AGGTAAAGAT ATTAGTGTAC CTGTAGGCGA 200 

TGAAACGCTT GGAAGAGTGT TTAATGTACT AGGTGAAACA ATTGACTTGG 250 

AAGAGAAAAT TGATGATTCC GTACGTCGTG ATCCAATCCA TAGACAATCA 300 

CCAGGTTTCG ATGAATTATC TACTGAAGTA GAAATCTTAG AAACAGGTAT 350 

TAAAGTAGTA GACTTATTAG CACCTTACAT TAAAGGTGGT AAAGTTGGAC 4 00 

25 TATTCGGTGG TGCCGGAGTA GGTAAAACCG TTTTAATCCA AGAATTAATT 450 

AACAATATTG CACAAGAACA TGGTGGTATT TCAGTATTCG CGGGTGTAGG 500 

TGAACGTACT CGTGAAGGTA ATGATTTATA CTATGAAATG AGTGATAGTG 550 

GTGTAATTAA GAAAACAGCG ATGGTATTTG GACAAATGAA TGAACCACCT 600 

GGCGCACGTA TGCGTGTAGC TTTATCTGGT TTAACTATGG CTGAATACTT 650 

30 CCGTGATGAA CAAGGACAAG ACGTACTTTT ATTCATCGAT AATATTTTCA 700 

GATTTACACA AGCTGGTTCT GAAGTTTCTG CATTACTTGG TCGTATGCCT 750 

TCAGCCGTTG GTTACCAACC AACATTAGCA ACTGAAATGG GTCAATTACA 800 

AGAACGAATT ACATCTACAA ATAAAGGTTC TGTAACATCT A 841 



35 



2) INFORMATION FOR SEQ ID NO: 378 



(i) SEQUENCE CHARACTERISTICS: 
4 0 (A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus acidominimus 

(B) STRAIN: ATCC 51726 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 378 



TTTAACACGA ATGAACCGCT 
CAAAGACAGT GAGAAAAAAC 

55 TTGGTGAAGG CCTCGTTCGT 
ACACGTGGTC TAGAAGTTCT 
TGGTAAAGAA ACGCTTGGAC 
ATCTTGAAGA ACCATTTGGA 
AGTGCCCCAA CTTTTGATGA 

60 AGGGATTAAA GTTATCGACC 



TCCTGAGATA AATAATGCAC 
ATAAAATCGT TCTTGAAGTA 
ACCATTGCTA TGGAATCAAC 
TGATACAGGC CGTGCAATCA 
GTGTCTTCAA CGTTCTTGGT 
GAAGATGCAG AACGTCACCC 
ATTATCAACG TCAACAGAAA 
TACTTGCCCC TTACTTAAAA 
226 



TTGTTGTTTA 50 

GCTCTTGAAC 100 

TGATGGTTTG 150 

GTGTACCAGT 200 

GATGCTATCG 250 

CATTCATAAG 300 

TCCTTGAAAC 350 

GGAGGGAAGG 4 00 
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TTGGACTTTT CGGTGGTGCC 
TTGATTCATA ACATTGCTCA 
AGTTGGTGAA CGTACACGTG 
AATCAGGCGT TATTGAAAAA 
CCACCTGGTG CACGTATGCG 
ATATTTCCGT GATGTTGAAG 
TTTTTCGTTT CACACAAGCA 
ATGCCATCAG CCGTTGGTTA 
ATTGCAAGAA CGTATCACGT 



GGAGTTGGTA 
AGAGCATGGT 
AAGGTAATGA 
ACAGCTATGG 
TGTAGCCCTT 
GACAGGACGT 
GGTTCTGAAG 
TCAACCAACC 
CAACTAAAAA 



AGACCGTTCT TATCCAAGAG 
GGTATTTCAG TATTTACCGG 
CCTCTATTGG GAAATGAAAG 
TATTTGGTCA GATGAATGAG 
ACTGGTTTGA CAATCGCTGA 
GCTTCTCTTT ATTGATAACA 
TTTCAGCTCT TCTTGGACGT 
TTGGCAACTG AAATGGGTCA 
AGGTTCTGTT ACATCA 



457) 
500 
550 
600 
650 
700 
750 
800 
846 
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2) INFORMATION FOR SEQ ID NO: 379 

( i ) SEQUENCE CKARACTERI STI CS : 

(A) LENGTH: 84 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

(B) STRAIN: ATCC 124 03 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 379 

TTGCAAGTGG CGACAAACTT CCTGAGATTA ATAATGCATT GATTGTCTAT 50 

AAAAATGGCG ATAAGTCACA AAAAGTAGTA CTTGAAGTTA CTCTTGAACT 100 

TGGTGACGGC CTCGTTCGTA CAATCGCTAT GGAATCAACT GATGGGCTTA 150 

CACGTGGTTT GGAAGTATTA GATACTGGTC GCGCAATTAG TGTGCCGGTT 200 

GGTAAGGATA CTTTGGGTCG TGTCTTCAAC GTTCTTGGAG ATGCTATTGA 250 

CCTTGAAGAG CCTTTTGCAG AAGATGCAGA ACGTCAACCA ATCCATAAAA 300 

AAGCACCATC GTTTGATGAA TTATCAACAT CATCAGAAAT CTTAGAAACA 350 

GGTATTAAAG TTATTGACTT ATTAGCACCT TACTTAAAAG GTGGTAAAGT 4 00 

TGGACTTTTC GGTGGTGCAG GTGTTGGTAA AACCGTTCTT ATTCAAGAAT 450 

TAATCCACAA CATCGCCCAA GAACATGGTG GTATTTCAGT ATTTACTGGT 500 

GTAGGAGAAC GTACTCGTGA AGGGAATGAC CTTTATTGGG AAATGAAAGA 550 

ATCTGGCGTT ATTGAAAAAA CGGCTATGGT CTTTGGTCAA ATGAATGAAC 600 

CACCAGGAGC ACGTATGCGT GTGGCACTTA CTGGTCTTAC AATAGCTGAG 650 

TACTTCCGTG ATGTAGAAGG ACAAGATGTG CTTCTCTTCA TTGATAATAT 700 

CTTCCGTTTC ACACAAGCTG GGTCAGAAGT GTCAGCGCTT TTAGGTCGTA 750 

TGCCTTCAGC CGTTGGTTAT CAACCAACAC TTGCTACAGA AATGGGACAA 800 

TTACAAGAGC GTATCACTTC AACTAAAAAA GGTTCTGTTA CCTCAA 846 



2) INFORMATION FOR SEQ ID NO: 380 

( i ) S EQUENCE CHARACTER I STI CS : 

(A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 380 



TTGCAAGTGG 
AAAAATGGCG 
TGGTGACGGC 
CACGTGGTTT 
GGTAAGGATA 
TCTTGAAGAG 
AAGCACCATC 
GGTATTAAAG 
TGGACTTTTC 
TAATCCACAA 
GTAGGAGAAC 
ATCTGGCGTT 
CACCAGGAGC 
TACTTCCGTG 
CTTCCGTTTC 
TGCCTTCAGC 
TTACAAGAGC 



CGACAAACTT 
ATAAGTCACA 
CTCGTTCGTA 
GGAAGTATTA 
CTTTGGGTCG 
CCTTTTGCAG 
GTTTGATGAA 
TTATTGACTT 
GGTGGTGCGG 
CATCGCCCAA 
GTACTCGTGA 
ATTGAAAAAA 
ACGTATGCGT 
ATGTAGAAGG 
ACACAAGCTG 
CGTTGGTTAT 
GTATCACTTC 



CCTGAGATTA 
AAAAGTAGTA 
CAATCGCTAT 
GATACTGGTC 
TGTCTTCAAC 
AAGATGCAGA 
TTATCAACAT 
ATTAGCACCT 
GTGTTGGTAA 
GAACATGGTG 
AGGGAATGAC, 
CGGCTATGGT 
GTGGCACTTA 
ACAAGATGTG 
GGTCAGAAGT 
CAACCAACAC 
AACTAAAAAA 



ATAATGCATT GATTGTCTAT 
CTTGAAGTTG CTCTTGAACT 
GGAATCAACT GATGGGCTTA 
GCGCAATTAG TGTGCCGGTT 
GTTCTTGGAG ATGCTATTGA 
ACGTCAACCA ATCCATAAAA 
CATCAGAAAT CTTAGAAACT 
TACTTAAAAG GTGGTAAAGT 
AACCGTTCTT ATTCAAGAAT 
GTATTTCAGT ATTTACTGGT 
CTTTATTGGG AAATGAAAGA 
CTTTGGTCAA ATGAATGAAC 
CTGGTCTTAC AATAGCTGAG 
CTTCTCTTCA TTGATAATAT 
GTCAGCGCTT TTAGGTCGTA 
TTGCTACAGA AATGGGACAA 
GGTTCTGTTA CCTCAA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
846 



2) INFORMATION FOR SEQ ID NO: 381 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 845 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

(B) STRAIN: ATCC 12973 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 381 

TTGCAAGTGG CGACAAACTT CCTGAGATTA ATAATGCATT GATTGTCTAT 50 

AAAAATGGCG ATAAGTCACA AAAAGTAGTA CTTGAAGTTG CTCTTGAACT 100 

TGGTGACGGC CTCGTTCGTA CAATCGCTAT GGAATCAACT GATGGGCTTA 150 

CACGTGGTTT GGAAGTATTA GATACTGGTC GTGCAATTAG TGTGCCGGTT 200 

GGTAAGGATA CTTTGGGTCG TGTCTTCAAC GTTCTTGGAG ATGCTATTGA 250 

CCTTGAAGAG CCTTTTGCAG AAGATGCAGA ACGTCAACCA ATCCATAAAA 300 

AAGCACCATC ATTTGATGAA TTATCAACAT CATCAGAAAT CTTAGAAACA 350 

GGTATTAAAG TTATTGACTT ATTAGCACCT TACTTAAAAG GTGGTAAAGT 400 

TGGACTTTTC GGTGGTGCGG GTGTTGGTAA AACCGTTCTT ATTCAAGAAT 4 50 

TAATCCACAA CATCGCCCAA GAACATGGTG GTATTTCAGT ATTTACTGGT 500 

GTAGGAGAAC GTACTCGTGA AGGGAATGAC CTTTATTGGG AAATGAAAGA 550 

ATCTGGCGTT ATTGAAAAAA CGGCTATGGT CTTTGGTCAA ATGAATGAAC 600 

CACCAGGAGC ACGTATGCGT GTGGCACTTA CTGGTCTTAC AATAGCTGAG 650 

TACTTCCGTG ATGTAGAAGG ACAAGATGTG CTTCTCTTCA TTGATAATAT 700 

CTTCCGTTTC ACACAAGCTG GGTCAGAAGT GTCAGCGCTT TTAGGTCGTA 750 

TGCCTTCAGC CGTTGGTTAT CAACCAACAC TTGCTACAGA AATGGGACAA 800 

TTACAAGAGC GTATCACTTC AACTAAAAAA GGTTCTGTTA CCTCA 845 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

(B) STRAIN: ATCC 27591 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 382 

TTGCAAGTGG CGACAAACTT CCTGAGATTA ATAATGCATT GATTGTCTAT 50 

AAAAATGGCG ATAAGTCACA AAAAGTAGTA CTTGAAGTTG CTCTTGAACT 100 

TGGTGACGGC CTCGTTCGTA CAATCGCTAT GGAATCAACT GATGGGCTTA 150 

20 CACGTGGTTT GGAAGTATTA GATACTGGTC GCGCAATTAG TGTGCCGGTT 200 

GGTAAGGATA CTTTGGGTCG TGTCTTCAAC GTTCTTGGAG ATGCTATTGA 250 

CCTTGAAGAG CCTTTTGCAG AAGATGCAGA ACGTCAACCA ATCCATAAAA 300 

AAGCACCATC GTTTGATGAA TTATCAACAT CATCAGAAAT CTTAGAAACT 350 

GGTATTAAAG TTATTGACTT ATTAGCACCT TACTTAAAAG GTGGTAAAGT 400 

2 5 TGGACTTTTC GGTGGTGCAG GTGTTGGTAA AACCGTTCTT ATTCAAGAAT 450 

TAATCCACAA CATCGCCCAA GAACATGGTG GTATTTCAGT ATTTACTGGT 500 

GTAGGAGAAC GTACTCGTGA AGGGAATGAC CTTTATTGGG AAATGAAAGA 550 

ATCTGGCGTT ATTGAAAAAA CGGCTATGGT CTTTGGTCAA ATGAATGAAC 600 

CACCAGGAGC ACGTATGCGT GTGGCACTTA CTGGTCTTAC AATAGCTGAG 650 

30 TACTTCCGTG ATGTAGAAGG ACAAGATGTG CTTCTCTTCA TTGATAATAT 700 

CTTCCGTTTC ACACAAGCTG GGTCAGAAGT GTCAGCGCTT TTAGGTCGTA 750 

TGCCTTCAGC CGTTGGTTAT CAACCAACAC TTGCTACAGA AATGGGACAA 800 

TTACAAGAGC GTATCACTTC AACTAAAAAA GGTTCTGTTA CATCA 845 



35 



50 



2) INFORMATION FOR SEQ ID NO: 383 



(i) SEQUENCE CHARACTERISTICS: 
4 0 (A) LENGTH: 84 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

4 5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

(B) STRAIN: CDC SS1073 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 383 



TTGCAAGTGG CGACAAACTT CCTGAGATTA ATAATGCATT GATTGTCTAT 50 

AAAAATGGCG ATAAGTCACA AAAAGTAGTA CTTGAAGTTG CTCTTGAACT 100 

55 TGGTGACGGC CTCGTTCGTA CAATCGCTAT GGAATCAACT GATGGGCTTA 150 

CACGTGGTTT GGAAGTATTA GATACTGGTC GCGCAATTAG TGTGCCGGTT 200 

GGTAAGGATA CTTTGGGTCG TGTCTTCAAC GTTCTTGGAG ATGCTATTGA 250 

CCTTGAAGAG CCTTTTGCAG AAGATGCAGA ACGTCAACCA ATCCATAAAA 300 

AAGCACCATC GTTTGATGAA TTATCAACAT CATCAGAAAT CTTAGAAACT 350 

60 GGTATTAAAG TTATTGACTT ATTAGCACCT TACTTAAAAG GTGGTAAAGT 4 00 
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TGGACTTTTC GGTGGTGCAG 
TAATCCACAA CATCGCCCAA 
GTAGGAGAAC GTACTCGTGA 
ATCTGGCGTT ATTGAAAAAA 
CACCAGGAGC ACGTATGCGT 
TACTTCCGTG ATGTAGAAGG 
CTTCCGTTTC ACACAAGCTG 
TGCCTTCAGC CGTTGGTTAT 
TTACAAGAGC GTATCACTTC 



GTGTTGGTAA 
GAACATGGTG 
AGGGAATGAC 
CGGCTATGGT 
GTGGCACTTA 
ACAAGATGTG 
GGTCAGAAGT 
CAACCAACAC 
AACTAAAAAA 



AACCGTTCTT ATTCAAGAA'f 
GTATTTCAGT ATTTACTGGT 
CTTTATTGGG AAATGAAAGA 
CTTTGGTCAA ATGAATGAAC 
CTGGTCTTAC AATAGCTGAG 
CTTCTCTTCA TTGATAATAT 
GTCAGCGCTT TTAGGTCGTA 
TTGCTACAGA AATGGGACAA 
GGTTCTGTTA CATCA 



450 
500 
550 
600 
650 
700 
750 
800 
845 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 384 

(i) SEQUENCE CHARACTERISTICS: , 

(A) LENGTH: 845 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus dysgalactiae 

(B) STRAIN: ATCC 43078 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 384 

TTGCTAGTGG GGACAAACTT CCAGAGATTA ATAATGCATT GATTGTTTAT 50 

AAAGATAGTG ATAAAAAGCA AAAAATCGTC CTTGAAGTTG CTCTGGAACT 100 

TGGTGACGGT ATGGTGCGAA CAATCGCTAT GGAATCAACT GATGGGCTTA 150 

CACGTGGGTT AGAAGTTCTT GACACTGGTC GTGCGATTAG TGTACCAGTA 200 

GGTAAAGAAA CTTTGGGACG CGTCTTTAAT GTACTTGGAG AAACCATTGA 250 

CTTGGAAGAA CCATTTGCAG AAGACGTTGA CCGTCAGCCA ATCCATAAAA 300 

AAGCACCATC GTTTGATGAA TTATCAACAT CATCAGAAAT TCTTGAAACT 3 50 

GGTATCAAGG TAATTGACCT TCTTGCCCCT TACCTTAAAG GTGGTAAAGT 4 00 

TGGACTTTTC GGGGGTGCCG GAGTTGGTAA GACTGTCCTT ATCCAAGAAT 4 50 

TAATCCACAA TATCGCCCAA GAACACGGAG GTATTTCAGT ATTTACCGGT 500 

GTTGGTGAGC GAACACGTGA AGGAAATGAC CTTTACTGGG AAATGAAAGA 550 

ATCAGGCGTT ATTGAGAAAA CTGCCATGGT TTTTGGTCAG ATGAATGAGC 600 

CGCCTGGGGC ACGTATGCGT GTAGCCCTTA CTGGTTTAAC CATTGCTGAG 650 

TATTTCCGTG ATGTAGAAGG CCAAGATGTT TTGCTCTTTA TTGATAATAT 700 

CTTCCGTTTC ACTCAGGCAG GTTCAGAAGT ATCAGCCCTC TTAGGCCGTA 750 

TGCCTTCTGC TGTTGGTTAC CAACCGACCC TTGCTACTGA AATGGGACAA 800 

TTGCAAGAAC GTATTACGTC AACTCAAAAA GGATCTGTTA CTTCT 845 



2) INFORMATION FOR SEQ ID NO: 385 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus equi subsp. egui 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 385 

5 TTGCGAGTGG GGACAAACTA CCAGAGATTA ATAATGCGTT GATAGTTTAT 50 

AAAGATGGCG ATAAAAAGCA AAAAATCGTT CTCGAGGTTG CCCTAGAGCT 100 

TGGAGACGGT ATGGTACGTA CAATTGCTAT GGAATCAACC GATGGGCTTA 150 

CACGTGGATT AGAGGTTCTT GATACTGGTC GTGCCATTAG TGTACCAGTT 200 

GGTAAAGAGA CTCTAGGTCG TGTTTTCAAC GTTCTTGGTG AAACCATCGA 250 

10 CCTAGAAGCA CCATTTGCAG ATGATGTTAA TCGTGAACCG ATCCATAAAA 300 

AAGCACCAGC CTTTGATGAA TTGTCAACAT CATCAGAAAT TCTTGAAACA 350 

GGTATCAAGG TTATTGACCT GCTTGCCCCT TACTTAAAGG GTGGTAAGGT 4 00 

CGGTCTTTTC GGTGGTGCCG GAGTTGGTAA AACCGTTCTT ATCCAAGAAT 4 50 

TAATCCACAA TATCGCTCAA GAGCATGGTG GGATCTCGGT ATTTACCGGT 500 

15 GTTGGTGAGC GTACGCGTGA AGGAAATGAC CTTTACTGGG AAATGAAGGA 550 

ATCAGGCGTT ATTGAAAAAA CAGCCATGGT TTTTGGTCAG ATGAATGAAC 600 

CACCAGGAGC CCGTATGCGT GTTGCCTTGA CCGGCTTGAC AATTGCTGAA 650 

TATTTCCGCG ATGTTGAAGG CCAAGACGTC CTGCTCTTCA TTGACAATAT 700 

TTTCCGCTTT ACTCAAGCAG GCTCAGAGGT ATCAGCCCTT CTAGGTCGTA 750 

20 TGCCTTCAGC CGTTGGTTAC CAGCCAACAC TTGCCACTGA AATGGGACAA 800 

TTGCAAGAGC GTATCACCTC AACGAAAAAA GGCTCTGTTA CCTCTA 846 



25 2) INFORMATION FOR SEQ ID NO: 386 

( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus anginosus 

(B) STRAIN: ATCC 27335 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 386 



TTGCAGCTGG TGATAAATTA CCTGAGATAA ATAATGCACT TGTAGTCTAT 50 

AAAAATGACG AAAATAAATC AAAAATCGTC CTTGAAGTAG CTCTTGAGCT 100 

TGGTGATGGA GTGGTTCGAA CTATTGCCAT GGAATCCACT GATGGGTTGA 150 

CTCGTGGCAT GGAAGTTCTA GATACTGGTC GACCAATTTC TGTTCCAGTT 200 

45 GGGAAAGAAA CACTTGGTCG CGTCTTTAAC GTTTTAGGCG ATACCATTGA 250 

TTTGGATACT CCATTCGGCG AAGATGCAGA ACGTCAACCA ATCCATAAAA 300 

AAGCTCCAAC TTTTGATGAG TTATCTACTT CATCAGAAAT CTTAGAAACA 350 

GGAATAAAGG TTATTGACCT TTTAGCCCCC TACCTCAAAG GTGGGAAAGT 400 

CGGCCTCTTC GGTGGTGCTG GCGTTGGGAA AACTGTCTTG ATTCAAGAGT 450 

50 TGATTCATAA TATCGCCCAA GAACACGGCG GGATTTCAGT CTTTACTGGT 500 

GTTGGGGAAC GAACTCGTGA AGGGAATGAC CTGTACTGGG AAATGAAAGA 550 

ATCTGGTGTT ATCGAAAAGA CGGCTATGGT CTTTGGGCAA ATGAATGAAC 600 

CGCCTGGAGC ACGTATGCGT GTAGCTTTGA CTGGGTTAAC GATTGCAGAG 650 

TATTTCCGTG ATGTGGAAGG TCAAGATGTT CTTTTGTTTA TTGATAATAT 700 

55 TTTCCGTTTC ACTCAAGCTG GTTCTGAAGT GTCAGCCCTT CTTGGTCGTA 750 

TGCCATCAGC TGTTGGTTAC CAACCAACCT TGGCTACTGA AATGGGGCAA 800 

TTACAAGAAC GTATTACATC AACGAAAAAA GGTTCTGTTA CCTCAA 846 



60 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 843 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

10 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus salivarius 

(B) STRAIN: ATCC 7073 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 387 

GCAGCTGGTG ATAAACTTCC TGAGATTAAC AATGCATTGG TCGTTTATAC 50 

TGATGAACAA AAGTCTAAAC GTATCGTGCT CGAAGTAGCT CTTGAACTTG 100 

GAGAAGGTGT GGTTCGTACC ATTGCCATGG AATCTACTGA TGGATTGACT 150 

20 CGTGGACTAG AAGTTCTGGA CACTGGTCGT CCAATCAGCG TTCCTGTTGG 200 

TAAAGATACC CTTGGACGTG TCTTTAACGT TCTTGGTGAT ACCATTGACT 250 

TGGAAGCACC TTTTGCAGAC GATGCAGAGC GTGAACCAAT TCACAAAAAA 300 

GCACCAACTT TCGATGAATT GTCAACATCT ACTGAAATCC TTGAAACAGG 350 

GATTAAAGTT ATCGACTTGC TAGCCCCTTA CCTTAAGGGT GGTAAAGTCG 400 

25 GACTCTTCGG TGGTGCCGGT GTTGGTAAAA CCGTTCTTAT TCAAGAGTTG 450 

ATTCACAACA TTGCCCAAGA GCACGGTGGT ATTTCCGTGT TTACAGGTGT 500 

TGGTGAACGT ACACGTGAAG GTAATGACCT TTACTGGGAA ATGAAAGAAT 550 

CTGGCGTTAT CGAGAAAACA GCCATGGTCT TCGGTCAAAT GAACGAACCA 600 

CCTGGAGCAC GTATGCGTGT TGCCCTTACT GGTTTGACAA TTGCGGAATA 650 

3 0 CTTCCGTGAT GTCGAGGGTC AAGACGTTCT TCTCTTCATC GATAACATCT 700 

TCCGTTTCAC TCAAGCAGGT TCTGAGGTTT CTGCCCTTCT TGGTCGTATG 750 

CCATCAGCCG TTGGTTACCA ACCTACACTT GCTACTGAAA TGGGTCAATT 800 

GCAAGAACGT ATCACATCAA CTAAAAAAGG TTCTGTTACA TCT 843 



35 



2) INFORMATION FOR SEQ ID NO: 388 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 841 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus suis 

(B) STRAIN: ATCC. 43765 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 388 



TTGCAGCAGA AGATAAACTT 
AAAAATGATG ATTCCAAACA 

55 TGGTGATGGC GTTGTACGGA 
CACGTGGGAT GGAAGTTCTC 
GGTAAAGAAA CGCTGGGTCG 
CCTTGAAGAG TCTTTTCCGG 
AAGCGCCGGC TTTTGACGAA 

60 GGGATTAAGG TTATCGACCT 



CCTGAGATTA ACAACGCACT 
AAAAGTCGTG CTTGAAGTGG 
CCATTGCCAT GGAATCAACG 
GATACAGGTC GTCCCATCTC 
TGTCTTCAAT GTGTTGGGAG 
CAGATTTTGA ACGTGAGCCT 
TTATCTACTT CAAGCGAAAT 
CCTAGCACCT TATCTAAAAG 
232 



CGTTGTATAT 50 

CTTTGGAACT 100 

GATGGATTGA 150 

TGTTCCAGTC 200 

ATACCATTGA 250 

ATCCATAAGA 3 00 

TTTGGAAACA 350 

GTGGTAAGGT 4 00 
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TGGTCTCTTC 
TGATTCACAA 
GTTGGCGAGC 
ATCAGGTGTT 
CACCAGGAGC 
TACTTCCGTG 
CTTCCGTTTC 
TGCCATCAGC 
TTGCAGGAGC 



GGTGGTGCTG 
TATTGCCCAA 
GTACCCGTGA 
ATTGAAAAAA 
CCGTATGCGT 
ATGTGGAAGG 
ACACAGGCTG 
CGTTGGTTAT 
GTATTACCTC 



GTGTTGGTAA 
GAACACGGTG 
AGGGAACGAT 
CGGCCATGGT 
GTTGCTCTTA 
GCAGGATGTT 
GTTCAGAAGT 
CAGCCAACAC 
AACCAAGAAG 



AACCGTTCTT ATCCAAGAAT 
GTATCTCTGT ATTTACCGGA 
CTTTACTGGG AAATGAAAGA 
ATTTGGTCAG ATGAATGAGC 
CTGGTTTGAC TATTGCGGAA 
CTTCTGTTCA TCGATAATAT 
GTCTGCCCTC TTGGGTCGTA 
TTGCGACGGA GATGGGACAA 
GGTTCTGTTA C 



450 
500 
550 
600 
650 
700 
750 
800 
841 
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2) INFORMATION FOR SEQ ID NO: 389 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 844 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE : Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus uJberis 

(B) STRAIN: ATCC 19436 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 389 

GCAAACGGTG AAAAATTACC AGAGATTAAT AATGCATTGA TAGTTTATAA 50 

AGGTAGCGAT AAAAAACAAA AGATTGTTCT TGAAGTTGCT TTGGAACTTG 10 0 

GGGACGGAAT GGTTCGTACA ATCGCTATGG AATCAACTGA TGGGCTTACA 15 0 

CGTGGATTAG AAGTTTTAGA TACTGGCCGT GCCATTAGTG TACCAGTCGG 200 

AAAAGAAACT TTGGGTCGTG TTTTCAATGT GCTTGGTGAA ACCATTGATT 250 

TGGATGAACC ATTTGCCGCT GATGCTGCAA GAGAACCCAT CCATAAAAAA 3 00 

GCCCCAGCAT TTGATGAACT ATCAACGTCT TCAGAAATTC TTGAAACCGG 350 

AATAAAAGTT ATTGACTTAT TAGCCCCTTA TCTCAAAGGT GGTAAAGTTG 400 

GTTTATTTGG TGGTGCCGGA GTAGGTAAAA CGGTTTTAAT TCAAGAATTA 450 

ATTCATAATA TTGCACAAGA ACATGGTGGT ATTTCAGTAT TTACCGGTGT 500 

TGGTGAAAGA ACTCGTGAAG GTAATGACCT TTATTGGGAA ATGAAAGAAT 550 

CTGGCGTTAT TGAAAAAACA GCCATGGTAT TTGGACAAAT GAACGAACCA 600 

CCAGGAGCAC GTATGCGCGT TGCTTTAACA GGTTTAACCA TTGCTGAATA 650 

TTTCCGGGAT GTTGAAGGTC AAGATGTTTT GCTCTTTATT GACAACATTT 700 

TCCGTTTCAC GCAAGCTGGT TCAGAAGTTT CAGCCCTATT GGGTCGTATG 750 

CCTTCAGCGG TAGGATACCA ACCAACACTT GCTACCGAAA TGGGACAATT 800 

GCAAGAAAGA ATTACCTCAA CTAACAAGGG ATCTGTTACT TCTA 844 



2) INFORMATION FOR SEQ ID NO: 390 

( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 896 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Tatumella ptyseos 
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(B) 



STRAIN: ATCC 33301 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 390 



TTCCCTCAGG 
TGGTGATACC 
TCGTTCGTAC 
AAGGTGACCG 
GCTGGGCCGT 
ACCTGAAGAA 
GCAGCGCCAA 
GGGTATCAAG 
TGGGTCTGTT 
CTGATCCGTA 
GGTGGGTGAG 
AGTCTAACGT 
CCACCAGGAA 
AAAATTCCGT 
ATCGTTATAC 
CCTTCTGCGG 
TCAGGAACGT 



ACGCTGTACC 
CGTCTGGTGC 
GATTGCAATG 
ATCTGCAAAA 
ATCATGAACG 
CGAAGATGGT 
GCTACGAAGA 
GTTATCGACC 
CGGTGGTGCG 
ACATCGCTAT 
CGTACCCGTG 
TCTGGATAAA 
ACCGTCTGCG 
GACGAAGGCC 
CCTGGCCGGT 
TAGGTTATCA 
ATCACGTCAA 



ACAGGTGTAC 
TGGAAGTTCA 
GGAACCTCTG 
ACCGATTCAG 
TACTGGGTCA 
AGCAATGTTG 
ACTGGCTAAC 
TGATCTGTCC 
GGTGTAGGTA 
CGAGCACTCT 
AAGGTAACGA, 
GTTGCTCTGG 
CGTTGCGCTG 
GTGACGTACT 
ACTGAAGTTT 
GCCAACACTG 
CCAAAACCGG 



AACGCTCTTG 
GCAGCAGCTG 
ACGGCCTGAA 
GTACCGGTCG 
GCCAATCGAT 
AGGTGAACTC 
TCTACTGAGC 
GTTTGCAAAA 
AGACCGTCAA 
GGTTACTCTG 
CTTCTACCAC 
TTTATGGCCA 
ACCGGTCTGA 
GCTGTTCGTT 
CAGCACTGCT 
GCCGAAGAAA 
TTCAATCACT 



AGGTTGAAAA 
GGCGGTGGTG 
ACGTGGCCTT 
GTAAAGCGAC 
ATGAAAGGCG 
TATTCACCGT 
TGCTGGAAAC 
GGCGGTAAAG 
CATGATGGAA 
TATTTGCAGG 
GAAATGACCG 
GATGAACGAG 
CTATGGCTGA 
GATAACATCT 
GGGTCGTATG 
TGGGTGTTCT 
TCCGTA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
896 



2) INFORMATION FOR SEQ ID NO: 391 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trabulsiella guamensis 

(B) STRAIN: ATCC 4 94 90 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 391 

TTCCCTCAGG ATGCCGTACC GCGCGTGTAC GATGCTCTTG AGGTTATGAA 50 

TGGTAGTGAG CGTCTGGTGC TGGAAGTTCA GCAGCAGCTC GGTGGTGGTA 100 

TCGTACGTAC TATCGCCATG GGTTCTTCCG ACGGTCTGCG TCGTGGTCTG 150 

GATGTAAAAG ATCTCGAGCA TCCGATCGAA GTCCCGGTAG GTAAAGCAAC 200 

GCTGGGTCGT ATCATGAACG TGCTGGGTCA GCCGATCGAT ATGAAAGGCG 2 50 

ACATCGGCGA AGAAGAGCGT TGGGCTATCC ACCGCGCAGC ACCGTCCTAC 300 

GAAGAGCTGT CCAGCTCTCA GGAACTGCTG GAAACCGGCA TCAAAGTTAT 350 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 4 00 

GTGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTGAT TCGTAACATC 450 

GCGATCGAGC ACTCCGGTTA CTCTGTGTTT GCGGGCGTGG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTCTGG 550 

ACAAAGTATC CCTGGTGTAT GGACAGATGA ACGAGCCGCC GGGAAACCGT 600 

CTGCGCGTTG CACTGACCGG TCTGACCATG GCTGAGAAGT TCCGTGACGA 650 

AGGTCGTGAC GTTCTGCTGT TCGTCGATAA CATCTACCGT TACACCCTGG 700 

CGGGTACTGA AGTATCTGCA CTGCTGGGCC GTATGCCTTC AGCGGTAGGT 750 

TACCAGCCGA CCCTGGCGGA AGAGATGGGT GTTCTTCAGG AACGTATCAC 800 

CTCAACCAAA ACCGGTTCTA TCACCTCCG 829 



234 
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WO 01/23604 
2) INFORMATION FOR SEQ ID NO: 3 92 



PCT/CA00/01150 



10 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia bercovieri 

(B) STRAIN: ATCC 43970 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 392 



20 



25 



30 



CGAATTCCCC 
AAGGCACAGC 
GGTGTTGTTC 
GTTGAAAGTC 
CAACTCTGGG 
GGTCCTATCG 
TTACGAAGAG 
TAATGGATCT 
GGTGGTGCGG 
TATTGCGATT 
GTACTCGTGA 
CTGGACAAAG 
CCGTCTGCGC 
ATGAAGGTCG 
CTGGCCGGTA 
AGGCTATCAG 
TCACTTCCAC 



CAAGACGCTG 
TCAGAAGCTG 
GTTGTATCGC 
ATCAACCTGG 
CCGTATCATG 
GTGAAGAAGA 
CTTGCCAGCT 
GATTTGTCCG 
GTGTGGGTAA 
GAGCACTCAG 
GGGTAACGAC 
TATCCTTGGT 
GTTGCACTGA 
TGATGTACTG 
CAGAGGTATC 
CCAACGCTGG 
CAAGACGGGT 



TACCAAAAGT 
GTGCTGGAAG 
AATGGGCTCT 
AACACCCAAT 
AACGTATTGG 
GCGTTGGGCA 
CGCAAGATCT 
TTCGCTAAGG 
AACAGTCAAC 
GTTATTCTGT 
TTCTACCACG 
TTATGGCCAG 
CCGGCTTGAC 
TTATTCATCG 
TGCACTGCTA 
CAGAAGAGAT 
TCAATCACCT 



GTACAACGCC 
TTCAGCAACA 
TCCGATGGTC 
TGAAGTGCCG 
GTGACCCAAT 
ATCCACCGCG 
GTTAGAAACC 
GCGGTAAAGT 
ATGATGGAGC 
ATTTGCCGGT 
AGATGACTGA 
ATGAATGAGC 
CATGGCGGAG 
ATAACATCTA 
GGTCGTATGC 
GGGTGTGTTG 
CCGTA 



CTTGAGGTTG 50 

GCTGGGCGGT 100 

TGAGCCGCGG 150 

GTGGGTAAAT 2 00 

CGACATGAAA 250 

AAGCGCCTTC 3 00 

GGTATCAAGG 3 50 

CGGTCTGTTC 4 00 

TGATTCGTAA 450 

GTGGGTGAGC 500 

CTCCAACGTT 550 

CACCAGGTAA 600 

AAATTCCGTG 650 

TCGTTATACC 700 

CATCAGCGGT 750 

CAGGAACGTA 8 00 
835 



35 



2) INFORMATION FOR SEQ ID NO: 393 



(i) SEQUENCE CHARACTERISTICS : 
40 (A) LENGTH: 812 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



50 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia enterocolitica 

(B) STRAIN: ATCC 9610 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 393 



GCTGTACCAA 
GCTGGTGCTG 

55 TCGCAATGGG 
CTGGAACACC 
CATGAACGTA 
AAGAGCGTTG 
AGCTCGCAAG 

60 TCCGTTCGCT 



AAGTGTACAA 
GAAGTTCAGC 
CTCTTCCGAT 
CAATTGAAGT 
TTGGGTGACC 
GGCAATCCAT 
ACTTGTTAGA 
AAGGGCGGTA 



CGCCCTTGAG 
AACAGCTGGG 
GGTCTGAGCC 
GCCTGTGGGC 
CAATCGACAT 
CGTGAAGCGC 
AACCGGTATC 
AAGTCGGTCT 
23 



GTTGAAGGCG 
CGGTGGTGTT 
GTGGGTTGAA 
AAGTCAACTC 
GAAAGGTCCT 
CTTCTTACGA 
AAGGTAATGG 
GTTCGGTGGT 
5 



CAGCTGAGAA 
GTTCGTTGTA 
AGTCATCAAC 
TGGGCCGTAT 
ATCGGCGAAG 
AGATCTTGCC 
ACTTGATTTG 
GCGGGTGTAG 



50 
100 
150 
200 
250 
300 
350 
400 
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GTAAAACGGT 
TCAGGTTATT 
CGACTTCTAC 
TGGTTTATGG 
CTGACCGGCT 
ATTGCTGTTC 
TTTCCGCACT 
CTGGCAGAAG 
GGGTTCAATC 



AAACATGATG 
CCGTATTTGC 
CACGAGATGA 
CCAAATGAAT 
TGACCATGGC 
ATCGATAACA 
GCTGGGTCGT 
AGATGGGTGT 
AC 



GAGCTTATTC 
TGGCGTGGGT 
CTGACTCCAA 
GAGCCACCAG 
GGAGAAATTC 
TCTATCGCTA 
ATGCCATCTG 
GTTGCAGGAA 



GTAACATTGC GATTGAGCAC 
GAGCGTACTC GTGAGGGTAA 
CGTTCTGGAC AAAGTATCCT 
GTAACCGTCT GCGCGTTGCA 
CGTGATGAAG GTCGTGACGT 
TACCTTAGCC GGTACGGAAG 
CCGTAGGTTA CCAGCCAACG 
CGTATTACTT CCACCAAGAC 



450 
500 
550 
600 
650 
700 
750 
800 
812 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 394 

(i) SEQUENCE CHARACTERISTICS: , 

(A) LENGTH: 802 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia frederiksenii 

(B) STRAIN: ATCC 33641 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 394 

AAAGTGTACA ACGCCCTTGA GGTTGAAGGT ACTGCTGAGA AGTTAGTACT 50 

GGAAGTTCAG CAACAGCTGG GCGGTGGTGT TGCTCGTTGT ATCGCCATGG 100 

GCTCTTCCGA TGGTTTGAGC CGCGGGTTGA AAGTTGTCAA CCTGGAACAC 150 

CCAATTGAAG TACCGGTTGG TAAATCAACT CTGGGCCGTA TCATGAACGT 200 

ATTGGGTGAC CCAATCGACA TGAAAGGTCC TATCGGTGAA GAAGAGCGTT 250 

GGGCAATCCA CCGCGAAGCG CCTTCTTACG AAGAGCTTGC CAGCTCGCAA 3 00 

GATCTGTTAG AAACCGGTAT CAAGGTAATG GATCTGATTT GCCCGTTCGC 350 

TAAAGGCGGT AAAGTCGGTC TGTTCGGTGG TGCGGGTGTA GGTAAAACGG 400 

TAAACATGAT GGAGCTGATC CGTAATATCG CGATCGAGCA CTCAGGTTAT 4 50 

TCCGTATTTG CGGGTGTTGG TGAACGTACC CGTGAGGGTA ACGACTTCTA 500 

CCACGAGATG ACTGACTCCA ACGTTCTGGA CAAAGTATCC TTGGTTTATG 550 

GCCAGATGAA TGAGCCACCA GGTAACCGTC TTCGCGTTGC ACTGACCGGT 600 

CTGACCATGG CGGAGAAATT CCGTGATGAA GGTCGTGACG TATTGCTGTT 650 

CATCGATAAC ATCTATCGTT ATACCTTGGC CGGTACGGAA GTATCCGCAC 700 

TGCTGGGTCG TATGCCATCT GCGGTAGGCT ATCAGCCAAC GCTGGCAGAA 750 

GAGATGGGTG TGTTGCAGGA ACGTATTACT TCCACCAAGA CGGGTTCAAT 800 

CA 802 



2) INFORMATION FOR SEQ ID NO: 395 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia intermedia 

236 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 
(B) 



PCT/CAOO/01150 



STRAIN: ATCC 29909 



10 



15 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 395 



GCTGTACCAA 
GCTGGTGCTG 
TCGCAATGGG 
CTGGAACACC 
CATGAACGTA 
AAGAGCGTTG 
AGCTCACAAG 
CCCGTTCGCT 
GTAAAACAGT 
TCAGGTTATT 
CGACTTCTAC 
TGGTGTATGG 
CTGACCGGCC 
ACTGTTGTTC 
TATCCGCACT 
CTGGCAGAAG 
GGGTTC 



GAGTGTACAA 
GAAGTTCAGC 
CTCTTCCGAT 
CAATTGAAGT 
TTGGGTGACC 
GGCAATCCAC 
ATTTGTTAGA 
AAGGGCGGTA 
AAACATGATG 
CTGTATTTGC 
CACGAGATGA 
CCAGATGAAT 
TGACCATGGC 
ATCGATAACA 
GCTGGGTCGT 
AGATGGGTGT 



CGCCCTTGAG 
AACAGCTAGG 
GGTCTGAGCC 
GCCGGTTGGT 
CAATCGACAT 
CGCGAAGCGC 
AACCGGTATC 
AAGTGGGTCT 
GAGCTTATTC 
TGGTGTGGGT 
CTGACTCCAA 
GAGCCACCAG 
GGAGAAATTC 
TCTATCGCTA 
ATGCCATCAG 
GTTGCAGGAA 



GTTGAAGGCA 
CGGTGGTGTT 
GCGGGTTGAA 
AAATCAACTC 
GAAAGGTCCT 
CTTCTTACGA 
AAAGTAATGG 
GTTCGGTGGT 
GTAACATCGC 
GAGCGTACTC 
CGTTCTGGAC 
GTAACCGTCT 
CGTGATGAAG 
TACCTTGGCC 
CGGTAGGCTA 
CGTATTACGT 



CTGCTGAGAA 50 

GTTCGTTGTA 100 

AGTCATCAAC 150 

TGGGCCGTAT 200 

ATCGGTGAAG 250 

AGAGCTTGCC 300 

ACTTGATTTG 350 

GCGGGTGTAG 400 

GATTGAGCAC 450 

GTGAGGGTAA 500 

AAAGTATCCT 550 

GCGCGTTGCA 600 

GTCGTGACGT 650 

GGTACGGAAG 700 

CCAGCCAACG 750 

CCACCAAGAC 800 
806 



25 2) INFORMATION FOR SEQ ID NO: 396 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



35 



40 



45 



50 



55 



(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia pseudotuberculosis 

(B) STRAIN: ATCC 29833 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 396 

GCTGTACCAA AAGTGTACAA CGCCCTTGAG GTAGAAGGCA 
GTTAGTGCTG GAAGTTCAGC AACAGTTGGG CGGTGGTGTT 
TCGCAATGGG CTCTTCCGAT GGTCTGAGCC GTGGGTTGAA 
CTGGAACACC CGATCGAAGT ACCGGTTGGT AAAGCGACCC 
CATGAACGTA TTGGGTGAAC CAATCGACAT GAAAGGTCCT 
AAGAGCGTTG GGCAATCCAT CGCGAAGCGC CTTCTTATGA 
AGCTCACAAG ATCTGTTAGA AACCGGTATC AAGGTTATGG 
TCCGTTTGCT AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT 
GTAAAACAGT AAACATGATG GAGCTGATCC GTAACATCGC 
TCTGGGTATT CTGTATTTGC CGGTGTAGGT GAGCGTACCC 
TGACTTCTAC CATGAAATGA CTGACTCCAA CGTTTTGGAC 
TGGTTTACGG CCAGATGAAT GAGCCACCAG GTAACCGTCT 
CTGACCGGCC TGACCATGGC GGAGAAATTC CGTGATGAAG 
ACTGCTGTTC ATCGATAATA TCTATCGTTA TACCCTAGCT 
TATCCGCATT GCTGGGTCGT ATGCCATCAG CGGTAGGTTA 
CTGGCTGAAG AGATGGGTGT GTTGCAGGAA CGTATTACTT 
GGGTTC 



CAACTGAAAA 50 

GTTCGTTGTA 100 

AGTAACCAAC 150 

TTGGCCGTAT 200 

ATCGGTGAAG 2 50 

AGAGCTTGCT 300 

ACCTGATTTG 350 

GCGGGTGTAG 400 

GATCGAGCAC 450 

GTGAGGGTAA 500 

AAAGTATCCT 550 

ACGCGTTGCA 600 

GTCGTGACGT 650 

GGTACGGAAG 700 

TCAGCCAACA 750 

CCACTAAGAC 800 
806 



60 



237 
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2) INFORMATION FOR SEQ ID NO: 397 



PCT/CAOO/01150 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

10 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia rohdei 

(B) STRAIN: ATCC 43380 



15 (xi) SEQUENCE DESCRIPTION: SEQ, ID NO: 397 

TTCCCCCAAG ACGCTGTACC AAAAGTGTAC AACGCCCTTG AGGTTGAAGG 50 

TGCAGCTGAG AAGCTTGTGC TGGAAGTTCA GCAGCAGCTG GGCGGTGGTG 100 

TTGTTCGTTG TATCGCAATG GGCTCTTCCG ATGGTTTGAG CCGTGGGTTG 150 

20 AAAGTTATCA ACCTGGAACA CCCAATTGAA GTGCCAGTTG GTAAATCAAC 200 

TCTGGGCCGT ATCATGAACG TATTGGGTGA CCCAATCGAC ATGAAAGGCC 250 

CTATCGGTGA AGAAGAGCGT TGGGCAATCC ACCGTGAAGC GCCTTCTTAC 300 

GAAGAGCTTG CCAGCTCGCA AGATCTGTTA GAAACCGGTA TCAAGGTAAT 350 

GGATCTGATT TGTCCGTTCG CTAAGGGCGG TAAAGTCGGT CTGTTCGGTG 4 00 

25 GTGCGGGTGT TGGTAAAACA GTAAACATGA TGGAGCTTAT TCGTAACATC 4 50 

GCGATTGAGC ACTCAGGTTA TTCTGTATTT GCCGGGGTAG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT ACCACGAGAT GACTGACTCC AACGTTCTGG 550 

ACAAAGTATC CTTGGTTTAT GGCCAGATGA ATGAGCCACC AGGTAACCGT 600 

CTGCGCGTTG CACTGACCGG CTTGACCATG GCGGAAAAAT TCCGTGATGA 650 

30 AGGCCGTGAC GTATTGCTGT TCATCGATAA CATTTATCGT TATACCCTAG 700 

CCGGTACGGA AGTATCCGCA CTGCTGGGTC GTATGCCATC TGCGGTAGGC 750 

TATCAGCCAA CACTGGCAGA AGAGATGGGT GTGTTGCAGG AACGTATTAC 800 

TTCCACTAAG ACGGGTTCAA TCACCTCCG 829 



35 



2) INFORMATION FOR SEQ ID NO: 398 

(i) SEQUENCE CHARACTERISTICS: 
4 0 (A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yokenella regensburgei 

(B) STRAIN: ATCC 35313 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 398 



ATGCCGTACC GCGCGTGTAC 
AAACTGGTGC TGGAAGCTCA 

55 TATCGCCATG GGTTCTTCCG 
ACCTCGAGCA CCCGATCGAA 
ATCATGAACG TCCTGGGTCA 
AGAAGAGCGT TGGGCTATCC 
CCAGCTCTCA GGAACTGCTG 

60 TGCCCGTTCG CTAAGGGTGG 



GATGCTCTTG AGGTACAAAA 
GCAGCAGCTC GGCGGCGGTA 
ACGGTCTGCG TCGTGGTCTG 
GTCCCGGTAG GTAAAGCAAC 
GCCGATCGAC ATGAAAGGCG 
ACCGCGCAGC ACCTTCCTAT 
GAAACCGGTA TCAAAGTAAT 
TAAAGTCGGT CTGTTCGGTG 
238 



TGGTAACGAG 50 

TCGTGCGTAC 100 

GAAGTTAAAG 150 

CCTGGGCCGT 200 

ACATCGGTGA 250 

GAAGAGCTGT 3 00 

GGATCTGATC 350 

GTGCGGGTGT 4 00 
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AGGTAAAACT GTAAACATGA TGGAGCTTAT 
ACTCCGGTTA CTCTGTGTTT GCAGGCGTGG 
AACGACTTCT ACCACGAAAT GACCGACTCC 
CCTGGTGTAC GGCCAGATGA ACGAGCCGCC 
5 CGCTGACCGG CCTGACCATG GCTGAGAAAT 
GTTCTGCTGT TCGTCGATAA CATCTACCGT 
AGTATCCGCA CTGCTGGGTC GTATGCCTTC 
CTCTGGCGGA AGAGATGGGT GTTCTTCAGG 
ACCGGTTCTA TCACCTCCG 



PCT/CAOO/01150 

CCGTAACATC GCGATTGAGC ' 4^0' 
GTGAACGTAC TCGTGAGGGT 500 
AACGTACTGG ATAAAGTATC 550 
GGGAAACCGT CTGCGCGTTG 600 
TCCGTGACGA AGGCCGTGAC 650 
TATACCCTGG CCGGTACGGA 700 
TGCGGTAGGT TATCAGCCAA 750 
AACGTATCAC CTCTACCAAA 800 

819 



2) INFORMATION FOR SEQ ID NO: 399 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1097 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

20 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yarrowia lipolytics 
25 (B) STRAIN: ATCC 38295 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 99 

AAGCTTAAGG CTGAGCGAGA GCGAGGTATC ACCATTGATA TCGCTCTCTG 50 

3 0 GAAGTTCCAG ACCCCTAAGT ACTACGTCAC CGTTATTGAT GCTCCCGGTC 100 

ACCGAGATTT CATCAAGAAC ATGATTACCG GTACTTCCCA GGCTGACTGC 150 

GCCATCCTCA TCATTGCTGG TGGTGTTGGT GAGTTCGAGG CTGGTATCTC 200 

CAAGGATGGT CAGACCCGAG AGCACGCCCT GCTCGCTTTC ACCCTCGGTG 250 

TTAAGCAGCT CATTGTTGCT ATCAACAAGA TGGACTCCGT CAAGTGGTCT 300 

35 CAGGATCGAT ACCTCGAGAT TTGCAAGGAG ACTGCCAACT TCGTCAAGAA 350 

GGTCGGTTAC AACCCCAAGG CTGTCCCCTT CGTCCCCATT TCCGGATGGA 4 00 

ACGGTGACAA CATGATCGAG CCCTCTACCA ACTGTGACTG GTACAAGGGA 450 

TGGACCAAGG AGACCAAGGC CGGCGAGATC AAGGGTAAGA CCCTCCTCGA 500 

GGCCATTGAT GCCATTGAGC CCCCCGTGCG ACCCCACGAC AAGCCCCTCC 550 

4 0 GACTTCCCCT CCAGGATGTC TACAAGATCG GTGGTATCGG CACAGTGCCC 600 

GTTGGCCGAG TCGAGACCGG TGTTATCAAG GCCGGTATGG KTGTTACCTT 650 

CGCTCCCGCC AACGTGACCA CTGAGGTCAA GTCTGTCGAG ATGCACCACG 700 

AGATCCTCCC CGATGGAGGT TTCCCCGGTG ACAACGTCGG TTTCAACGTC 750 

AAGAACGTTT CCGTCAAGGA TATCCGACGA GGTAACGTTG CTGGTGACTC 800 

4 5 CAAGAACGAC CCCCCCAAGG GCTGCGACTC TTTCAACGCT CAGGTCATTG 850 

TTCTTAACCA CCCCGGTCAG ATCGGTGCTG GTTACGCTCC CGTCCTTGAT 900 

TGCCACACTG CCCACATTGC TTGCAAGTTC GACACCCTGA TCGAGAAGAT 950 

CGACCGACGA ACCGGTAAGA AGATGGAGGA CTCCCCCAAG TTCATCAAGT 1000 

CTGGTGATGC TGCCATTGTC AAGATGGTTC CCTCCAAGCC CATGTGTGTC " 1050 

50 GAGGCCTTCA CTGAGTACCC CCCTCTTGGT CGATTCGCCG TCCGAGA 1097 



2) INFORMATION FOR SEQ ID NO: 4 00 

55 

( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 1233 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
60 (D) TOPOLOGY: Linear 

239 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
5 (A) ORGANISM: Absidia corymbifera 

(B) STRAIN: ATCC 46775 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 400 

10 CAAGCTTAAG GCTGAACGTG AGCGTGGTAT CACCATCGAT ATCGCTCTCT 50 

GGAAGTTCGA GACTCCCAAG TACCACGTTA CCGTCATTGA TGCCCCTGGC 100 

CATCGTGATT TCATCAAGAA CATGATTACT GGTACTTCCC AAGCTGACTG 150 

CGGTATCTTG ATTATTGCTG CTGGTACTGG TGAATTCGAA GCTGGTATCT 2 00 

CCAAGGATGG TCAAACCCGT GAACACGCTT TGCTTGCTTT CACCCTTGGT 250 

15 GTCCGTCAAT TGATTGTCGC TATCAACAAG, ATGGATTCCA CCAAGTACTC 300 

TGAGGCCCGT TACAACGAAA TTGTCAAGGA AGTCTCCACC TTCATCAAGA 350 

AGATTGGTTT CAACCCCAAG TCCGTTCCTT TCGTCCCTAT CTCTGGCTGG 400 

AACGGTGACA ACATGTTGGA GGARTCCACC AACATGCCTT GGTTCAAGGG 450 

ATGGAACAAG GAGACTAAGG CTGGTGCCAA GACYGGCAAG ACCCTTCTTG 500 

20 AAGCCATTGA CAACATTGAT CCCCCTGTTC GTCCTTCCGA CAAGCCCCTT 550 

CGTCTTCCCC TTCAAGATGT CTACAAGATC GGTGGTATTG GTACAGTTCC 600 

TGTCGGTCGT iGTTGAGACTG GTGTCATCAA GCCTGGTATG GTTGTCACCT 650 

TCGCTCCCGC TAACGTCACC ACTGAAGTCA AGTCCGTYGA AATGCACCAC 700 

GAGCAACTTG CTGAAGGTGT TCCCGGTGAC AACGTCGGTT TCAACGTCAA 750 

25 GAACGTTTCC GTCAAGGATA TCCGCCGTGG TAACGTYTGC TCTGACTCCA 800 

AGAACGACCC CGCCAAGGAA TCCGCTTCCT TCACCGCTCA AGTTATTGTC 850 

TTGAACCACC CTGGTCARAT TGGTGCTGGT TACTCTCCTG TCTTGGATTG 900 

CCACACTGCT CACATTGCAT GCAAGTTCTY TKAGCTTCTT KAGAAGATCG 950 

ATYGTCGTTC CGGTAAGTAA ATANTTTGGT TTRGGATATG GGTATTGGGC 1000 

30 TTAATCTYTG GATTTTGCCT CAATTGCTCC TTCCTTGATC TTTCTCGATT 1050 

ACTTTTTGAT CATTTGCTAA TCCAAACCCT TTCCATTTYA TTGAAAACAG 1100 

GTAAGAAGTT GGAAGACTCC CCCAAGTTCG TCAAGWSYGG TGACTCTGCY 1150 

ATCGTCAAGA TGGTTCCTTC CAAGCCCATG TGCGTTGAAG CCTACACTGA 120 0 

ATATCCTCCT CTTGGTCGTT TCGCTGTCCG TGA 1233 

35 



2) INFORMATION FOR SEQ ID NO: 401 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1151 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

45 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Alternaria alternata 
50 (B) STRAIN: ATCC 62099 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 401 

CAAGTTGAAG GCCGAGCGTG AGCGTGGTAT CACCATCGAC ATTGCTCTCT 50 

55 GGAAGTTCGA GACTCCCAAG GTTAGTACCC CTCTGCCTAC TACATCAAGT 100 

TCTTTACAAT GCTAACATGT TGTACTCAGT ACTATGTCAC CGTCATTGAC 150 

GCCCCCGGTC ACCGTGATTT CATCAAGAAC ATGATCACTG GTACCTCCCA 200 

GGCCGACTGC GCTATTCTCA TCATTGCCGC CGGTACTGGT GAGTTCGAGG 250 

CTGGTATCTC CAAGGATGGC CAGACTCGTG AGCACGCTCT CCTCGCTTAC 300 

60 ACCCTCGGTG TCAAGCAGCT CATCGTTGCC ATCAACAAGA TGGACACCAC 350 

240 
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CAAGTGGTCC 
TCATCAAGAA 
TCCGGTTTCA 
GTACAAGGGT 
5 TCCTCGAGGC 
CCCCTCCGTC 
GGTGCCCGTC 
TCACCTTCGC 
CACCACGAGC 

10 CGTCAAGAAC 
ACTCCAAGAA 
ATCGTCCTCA 
CGACTGCCAC 
AGATTGACCG 

15 AAGTCCGGTG 
CGTTGAGGCT 
A 



GAGGAGCGTT 
GGTCGGCTAC 
ACGGTGACAA 
TGGGAGAAGG 
CATCGACGCC 
TTCCCCTYCA 
GGTCGTGTCG 
CCCCGCTGGT 
AGCTCACCGA 
GTCTCCGTCA 
CGACCCCCCC 
ACCACCCTGG 
ACCGCCCACA 
CCGTACCGGA 
ACGCCGCCAT 
TTCACTGACT 



ACCAGGAGAT 
AACCCCAAGC 
CATGATTGAG 
AGACCAAGGC 
ATCGACCCTY 
GGATGTTTAC 
AGACCGGTAT 
GTCACCACTG 
GGGTGTCCCC 
AGGAGATCCG 
AAGGGTGCCG 
TCAGGTCGGT 
TTGCTTGCAA 
AAGTCTGTTG 
CGTCAAGATG 
ACCCTCCTCT 



CATCAAGGAG 
ACGTTCCCTT 
GCCTCATCCA 
CAAGGCCACT 
CCAGCCGTCC 
AAGATTGGTG 
CATCAAGGCC 
AAGTCAAGTC 
GGTGACAACG 
TCGTGGTAAC 
AGTCCTTCAA 
GCTGGTTACG 
GTTCTCTGAG 
AGAACTCTCC 
GTTCCCTCCA 
CGGTCGTTTC 
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ACCTCCAACT' 4~00 

CGTCCCCATC 450 

ACTGCCCCTG 500 

GGTAAGACCC 550 

CACCGACAAG 600 

GTATTGGCAC 650 

GGTATGGTCG 700 

CGTCGAGATG 750 

TCGGCTTCAA 800 

GTTGCCGGTG 850 

CGCCCAGGTC 900 

CCCCAGTCCT 950 

CTCCTCGAGA 1000 

CAAGTTCATC 1050 

AGCCCATGTG 1100 

GCTGTCCGTG 1150 
1151 



20 



25 



30 



35 



2) INFORMATION FOR SEQ ID NO: 402 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1283 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus flavus 

(B) STRAIN: ATCC 26947 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 402 



CAAGCTCAAG 
GGAAGTTCCA 
AGTTCCAACC 

4 0 AGATGCCCCC 
CCCAGGCTGA 
GAGGCTGGTA 
TTTCACCCTC 
CCTGCAAGTG 

45 AACTTCATCA 
CATCTCCGGT 
CCTGGTACAA 
AAGACCCTTC 
CGACAAGCCT 

50 TCGGTACTGT 
ATGGTCGTTA 
TGAAATGCAC 
GTTTCAACGT 
GCCGGTGACT 

55 CCAGGTCATC 
CCGTCCTGGA 
CTTGAGAAGA 
GTTCATCAAG 
CCATGTGTGT 

60 GTCCGTGACG 



TCCGAGCGTG AGCGTGGTAT CACCATCGAT 
GACCTCCAAG TATGAGGTCA CCGTCATTGG 
TACGTTGCCC AACATTTACA GTCATCTAAC 
GGTCACCGTG ACTTCATCAA GAACATGATC 
CTGCGCTATC CTCATCATTG CCTCCGGTAC 
TCTCCAAGGA TGGTCAGACC CGTGAGCACG 
GGTGTCCGTC AGCTCATCGT TGCCCTCAAC 
GTCTCAGGAT CGTTACAACG AAATCGTTAA 
AGAAGGTCGG ATACAACCCC AAGAGCGTTC 
TTCAACGGTG ACAACATGAT TGAGGCCTCC 
GGGTTGGGAG AAGGAGACCA AGGCTGGCAA 
TCGAGGCCAT CGATGCCATC GAGCCCCCCG 
CTCCGTCTTC CCCTYCAGGA TGTCTACAAG 
GCCCGTCGGT CGTGTCGAGA CTGGTGTCAT 
CTTTCGCTCC TGCCAACGTG ACCACTGAAG 
CACCAGCAGC TCCAGGCCGG TAACCCCGGT 
CAAGAACGTC TCCGTCAAGG AAGTCCGCCG 
CCAAGAACGA CCCCCCTGCT GGCTGCGATT 
GTCCTTAACC ACCCCGGTCA GGTCGGCAAC 
CTGCCACACC GCTCACATTG CTTGCAAGTT 
TTGACCGCCG TACCGGTAAA TCTGTTGAGG 
TCTGGTGATG CTGCCATCGT CAAGATGATT 
GGAGTCTTTC ACTGACTTCC CCCCTCTTGG 
TAAGTTTTTC CCTCTTGACT ATCTTCACAA 

241 



ATCGCCCTCT 50 

TAAGCATTTG 100 

AAAGTTCAAT 150 

ACTGGTACTT 20 0 

TGGTGAATTC 250 

CTCTGCTCGC 3 00 

AAGATGGACA 350 

GGAGACTTCC 400 

CTTTCGTCCC 4 50 

ACCAACTGCC 500 

GTCCACCGGT 550 

TCCGTCCCAC 600 

ATCTCTGGTA 650 

CAAGCCTGGT 700 

TCAAGTCCGT 750 

GACAACGTTG 800 

TGGTAACGTT 850 

CCTTCAACGC 900 

GGTTACGCTC 950 

CGCTGAGCTC 1000 

ACAAGCCCAA 1050 

CCCTCCAAGC 1100 

TCGTTTCGCT 1150 

TTTTTCACAT 1200 
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ATTTTCACGC CTCGTCCCAC TCTTTTTCCT CCCTTCCTCT TTGGTTCCCC 
TTTTTGCCTG CAAGTTCTCT ATAGCTAACA TGA 



1250 
1283 



2) INFORMATION FOR SEQ ID NO: 403 

( i ) SEQUENCE CHARACTER I STICS : 

(A) LENGTH; 1103 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 

(B) STRAIN: DAL95 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 403 



TCCGAGCGTG AGCGTGGTAT CACCATCGAC 
GACTCCCAAG TATGAGGTCA CTGTCATCGG 
ATGTTTTGGT GCTGTAGCTA ACACGATCTG 
GACTTCATCA AGAACATGAT CACTGGTACC 
CCTCATCATT GCCTCCGGTA CTGGTGAGTT 
ATGGCCAGAC CCGTGAGCAC GCTCTGCTGG 
CAGCTCATCG TCGCCCTCAA CAAGATGGAC 
TCGTTACAAC GAAATTGTCA AGGAAACCTC 
GCTACAACCC CAAGGCCGTT CCCTTCGTCC 
GACAACATGC TTGAGCCCTC CTCCAACTGC 
GAAGGAGACC AAGGCCGGCA AGGTCACTGG 
TCGACGCCAT TGAGCCCCCT GTCCGTCCCT 
CCCCTCCAGG ATGTCTACAA GATCTCTGGT 
CCGTGTCGAG ACCGGTATCA TCAAGCCCGG 
CCGCCAACGT CACCACTGAA GTCAAGTCCG 
CTCCAGGAGG GTGTCCCCGG TGACAACGTC 
TTCCGTCAAG GAAGTCCGCC GTGGTAACGT 
ATCCCCCTCA GGGTGCTGCC TCCTTCAACG 
CACCCCGGTC AGGTCGGCGC TGGTTACGCC 
TGCCCACATT GCTTGCAAGT TCTCTGAGCT 
GTACCGGCAA GTCTGTTGAG AACAACCCCA 
GCCGCCATCG TGAAGATGGT TCCTTCCAAG 
CAC 



ATTGCCCTCT GGAAGTTCCA 50 

TAAGCTCGAC TCGCCCCGAT 100 

AAGATGCCCC CGGTCACCGT 150 

TCCCAGGCTG ACTGCGCTAT 200 

CGAGGCTGGT ATCTCCAAGG 250 

CTTTCACCCT CGGTGTCAAG 300 

ACCTGCAAGT GGTCCGAGGA 350 

CAACTTCATC AAGAAGGTCG 4 00 

CCATCTCTGG CTTCAACGGT 450 

CCCTGGTACA AGGGATGGGA 500 

TAAGACCCTC ATCGAGGCCA 550 

CCAACAAGCC CCTCCGTCTT 600 

ATCGGAACGG TCCCTGTCGG 650 

CATGGTCGTC ACCTTCGCCC 700 

TCGAGATGCA CCACCAGCAG 750 

GGTTTCAACG TCAAGAACGT 800 

CTGCGGTGAC TCCAAGAACG 850 

CCCAGGTCAT CGTCCTCAAC 900 

CCCGTCCTCG ACTGCCACAC 950 

GCTTGAGAAG ATTGACCGCC 1000 

AGTTCATCAA GTCCGGTGAT 1050 

CCCATGTGTG TCGAGTCCTT 1100 

1103 



2) INFORMATION FOR SEQ ID NO: 404 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 114 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 

(B) STRAIN: WSA-172 

242 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 404 

AAGCTCAAGT CCGAGCGTGA GCGTGGTATC ACCATCGACA TTGCCCTCTG 50 

GAAGTTCCAG ACTCCCAAGT ATGAGGTCAC TGTCATCGGT AAGCTCGACT 100 

CGCCCCGATA TGTTTTGGTG CTGTAGCTAA CACGATCTGA AGATGCCCCC 150 

GGTCACCGTG ACTTCATCAA GAACATGATC ACTGGTACCT CCCAGGCTGA 200 

CTGCGCTATC CTCATCATTG CCTCCGGTAC TGGTGAGTTC GAGGCTGGTA 250 

TCTCCAAGGA TGGCCAGACC CGTGAGCACG CTCTGCTGGC TTTCACCCTC 300 

GGTGTCAAGC AGCTCATCGT CGCCCTCAAC AAGATGGACA CCTGCAAGTG 3 50 

GTCCGAGGAT CGTTACAACG AAATTGTCAA GGAAACCTCC AACTTCATCA 4 00 

AGAAGGTCGG CTACAACCCC AAGGCCGTTC CCTTCGTCCC CATCTCTGGC 450 

TTCAACGGTG ACAACATGCT TGAGCCCTCC TCCAACTGCC CCTGGTACAA 500 

GGGATGGGAG AAGGAGACCA AGGCCGGCAA GGTCACTGGT AAGACCCTCA 550 

TCGAGGCCAT CGACGCCATT GAGCCCCCTG TCCGTCCCTC CAACAAGCCC 600 

CTCCGTCTTC CCCTCCAGGA TGTCTACAAG ATCTCTGGTA TCGGAACGGT 650 

CCCTGTCGGC CGTGTCGAGA CCGGTATCAT CAAGCCCGGC ATGGTCGTCA 700 

CCTTCGCCCC CGCCAACGTC ACCACTGAAG TCAAGTCCGT CGAGATGCAC 750 

CACCAGCAGC TCCAGGAGGG TGTCCCCGGT GACAACGTCG GTTTCAACGT 800 

CAAGAACGTT TCCGTCAAGG AAGTCCGCCG TGGTAACGTC TGCGGTGACT 850 

CCAAGAACGA TCCCCCTCAG GGTGCTGCCT CCTTCAACGC CCAGGTCATC 900 

GTCCTCAACC ACCCCGGTCA GGTCGGCGCT GGTTACGCCC CCGTCCTCGA 950 

CTGCCACACT GCCCACATTG CTTGCAAGTT CTCTGAGCTG CTTGAGAAGA 1000 

TTGACCGCCG TACCGGCAAG TCTGTTGAGA ACAACCCCAA GTTCATCAAG 1050 

TCCGGTGATG CCGCCATCGT GAAGATGGTT CCTTCCAAGC CCATGTGTGT 1100 

CGAGTCCTTC ACTGACTACC CCCCTCTGGG TCGTTTCGCC GTCCGTGAC 1149 



2) INFORMATION FOR SEQ ID NO: 405 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1151 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus niger 

(B) STRAIN: ATCC 9508 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 405 

CAAGCTCAAG TCCGAGCGTG AGCGTGGTAT CACCATCGAC ATTGCCCTCT 50 

GGAAGTTCCA GACTGGCAAG TATGAGGTCA CCGTCATTGG TATGTACTCA 100 

CAGAGTTCTC TTTTCATCAA AGCAATATAC TAACGTCCAT CATAGACGCC 150 

CCCGGTCACC GTGACTTCAT CAAGAACATG ATCACTGGTA CCTCCCAGGC 200 

TGACTGCGCT ATCCTCATCA TTGCCTCCGG TACTGGTGAG TTCGAGGCTG 250 

GTATCTCCAA GGATGGCCAG ACTCGTGAGC ACGCTCTGCT TGCTTTCACC 300 

CTCGGTGTCC GCCAGCTCAT CGTTGCCCTC AACAAGATGG ACACCTGCAA 350 

GTGGTCCGAG GACCGTTACA ACGAAATCGT TAAGGAGACC TCCAACTTCA 400 

TCAAGAAGGT CGGATACAAC CCCAAGGGTG TTCCTTTCGT CCCCATCTCC 450 

GGTTTCAACG GTGACAACAT GCTCGAGCCC TCCCCCAACT GCCCCTGGTA 500 

CAAGGGTTGG GAGAAGGAGA CCAAGGCCGG CAAGGTCACC GGTAAGACCC 550 

TCCTTGAGGC CATCGACGCC ATCGAGCCCC CCGTCCGTCC CTCCAACAAG 600 

CCCCTCCGTC TTCCCCTCCA GGATGTCTAC AAGATCTCCG GTATTGGAAC 650 

TGTTCCCGTC GGTCGTGTCG AGACCGGTAT CATTGCCCCT GGTATGGTCG 700 

TGACCTTCGC TCCCGCCAAC GTCACCACTG AAGTCAAGTC CGTTGAGATG 750 

243 
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CACCACCAGC AGCTCAAGGA AGGTGTCCCC 
CGTCAAGAAC GTTTCCGTCA AGGAGGTTCG 
ACTCCAAGAA CGACCCCCCT CTTGGCTGTG 
ATCGTCCTCA ACCACCCCGG TCAGGTCGGC 
5 GGACTGCCAC ACTGCTCACA TTGCTTGCAA 
AGATTGACCG CCGTACCGGA AAGTCTGTTG 
AAGTCCGGTG ACGCTGCCAT CGTCAAGATG 
TGTTGAGGCT TTCACTGACT ACCCCCCTCT 
A 



GGTGACAACG TTGGTTTCAA 800 

CCGTGGTAAC GTTGCCGGTG 850 

AGAGCTTCAC CGCCCAGGTC 900 

GCTGGTTACG CTCCCGTCCT 950 

GTTCGCTGAG CTCCTTGAGA 1000 

AATCTTCCCC CAAGTTCATC 1050 

ATTCCCTCCA AGCCCATGTG 1100 

TGGTCGTTTC GCCGTCCGCG 1150 

1151 



2) INFORMATION FOR SEQ ID NO: 406 

15 (i) SEQUENCE CHARACTERISTICS: , 

(A) LENGTH: 1093 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

20 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Blastoschizomyces capitatus 
25 (B) STRAIN: ATCC 10663 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 406 

GCTTAAAGCT GAACGTGAAC GTGGTATCAC CATTGATATC GCTCTCTGGA 50 

30 AGTTCGAAAC TCCTAAGTAC TACGTTACTG TTATTGATGC TCCAGGTCAC 100 

CGTGATTTCA TCAAGAACAT GATTACTGGT ACTTCCCAAG CCGATTGCGC 150 

CATTCTTATC ATTGCTGCCG GTGTCGGTGA ATTCGAAGCT GGTATCTCCA 2 00 

AGGAAGGTCA AACCAGAGAA CACGCTCTTC TCGCTTTCAC CCTTGGTGTC 250 

AGACAACTTA TCATTGCCAT CAACAAGATG GACTCTGTCA AGTGGGACCA 300 

35 AAAGAGATAC GAAGAAATCG TCAAGGAGGC TTCCAACTTC GTCAAGAAGG 350 

TTGGTTACAA CCCCAAGTCT GTTCCATTCG TTCCTATCTC TGGTTGGAAC 4 00 

GGTGACAACA TGTTGGAACC TACCACCAAC GCCCCATGGT ACAAGGGATG 450 

GACCAAGGAA ACCAAGGCTG GTGCCACTAA GGGTATGACT CTTATTGAAG 500 

CCATTGACGC CATTGAACCA CCAGTAAGAC CATCCGACAA GCCACTCCGT 550 

4 0 CTCCCACTCC AAGATGTTTA CAAGATTGGT GGTATCGGAA CTGTGCCAGT 600 

CGGCCGTGTC GAAACCGGTA TCATCAAGGC CGGTATGGTC GTTACCTTTG 650 

CTCCACCAAT GGTCACAACT GAAGTTAAGT CCGTTGAAAT GCACCACGAA 700 

CAACTTGCTC AAGGTAACCC AGGTGACAAC GTTGGTTTCA ACGTCAAGAA 750 

CGTTTCCGTT AAGGAAATCA GACGTGGTAA CGTCTGTGGT GACTCCAAGA 800 

4 5 ACGATCCACC AAAGGGCTGC GAATCTTTCA ACGCTCAAGT TATCGTCTTG 850 

AACCACCCTG GTCAAATCTC TGCTGGTTAC TCTCCAGTTC TCGATTGCCA 900 

CACTGCCCAC ATTGCCTGCA GATTCGACGA ACTCCTTGAA AAGATCGACC 950 

GTCGTTCTGG TAAGAAGATT GAAGACTCTC CAAAGTTTGT CAAGTCTGGT 1000 

GATGCCGCTA TCGTCAAGAT GATCCCAACC AAGCCAATGT GCGTTGAAAC 1050 

50 CTTCACTGAA TACCCACCAC TTGGTCGTTT CGCCGTCCGT GAT 1093 



2) INFORMATION FOR SEQ ID NO: 407 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1101 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
60 (D) TOPOLOGY: Linear 

244 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
5 (A) ORGANISM: Candida albicans 

(B) STRAIN: ATCC 10231 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 407 

10 CTTGGACAAA TTGAAGGCTG AAAGAGAAAG AGGTATCACC ATTGATATCG 50 

CTTTGTGGAA ATTCGAAACT CCAAAATACC ACGTTACCGT CATTGATGCT 100 

CCAGGTCACA GAGATTTCAT CAAGAATATG ATCACTGGTA CTTCTCAAGC 150 

TGATTGTGCT ATTTTGATTA TTGCTGGTGG TACTGGTGAA TTCGAAGCCG 200 

GTATTTCTAA GGATGGTCAA ACCAGAGAAC ACGCTTTGTT GGCTTACACT 250 

15 TTGGGTGTCA AACAATTGAT TGTTGCTGTC AACAAGATGG ACTCTGTCAA 3 00 

ATGGGACAAA AACAGATTTG AAGAAATCAT CAAGGAAACC TCCAACTTCG 3 50 

TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTYGT TCCAATCTCT 4 00 

GGTTGGAATG GTGACAACAT GATTGAACCA TCCACCAACT GTCCATGGTA 450 

CAAGGGTTGG GAAAAGGAAA CCAAATCCGG TAAAGTTACT GGTAAGACCT 500 

20 TGTTAGAAGC TATTGACGCT ATTGAACCAC CAACCAGACC AACCGACAAA 550 

CCATTGAGAT TGCCATTGCA AGATGTTTAC AAGATCGGTG GTATTGGTAC 600 

TGTGCCAGTC GGTAGAGTTG AAACTGGTAT CATCAAAGCC GGTATGGTTG 650 

TTACTTTCGC CCCAGCTGGT GTTACCACTG AAGTCAAATC CGTTGAAATG 700 

CATCACGAAC AATTGGCTGA AGGTGTTCCA GGTGACAATG TTGGTTTCAA 7 50 

25 CGTTAAGAAC GTTTCCGTTA AAGAAATTAG AAGAGGTAAC GTTTGTGGTG 800 

ACTCCAAGAA CGATCCACCA AAGGGTTGTG ACTCTTTCAA TGCCCAAGTC 8 50 

ATTGTTTTGA ACCATCCAGG TCAAATCTCT GCTGGTTACT CTCCAGTCTT 900 

GGATTGTCAC ACTGCCCACA TTGCTTGTAA ATTCGACACT TTGGTTGAAA 950 

AGATTGACAG AAGAACTGGT AAGAAATTGG AAGAAAATCC AAAATTCGTC 1000 

30 AAATCCGGTG ATGCTGCTAT CGTCAAGATG GTCCCAACCA AACCAATGTG 1050 

TGTTGAAGCT TTCACTGACT ACCCACCATT AGGTAGATTC GCTGTCAGAG 1100 

A 1101 



35 



2) INFORMATION FOR SEQ ID NO: 408 



40 



45 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1089 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida albicans 

(B) STRAIN: ATCC 18804 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 408 



GAAGGCTGAA 
TCGAAACTCC 
GATTTCATCA 

55 TTTGATTATT 
ATGGTCAAAC 
CAATTGATTG 
CAGATTTGAA 
GTTACAACCC 

60 GACAACATGA 



AGAGAAAGAG 
AAAATACCAG 
AGAATATGAT 
GCTGGTGGTA 
CAGAGAACAC 
TTGCTGTCAA 
GAAATCATCA 
AAAGACTGTT 
TTGAACCATC 



GTATCACCAT 
GTTACCGTCA 
CACTGGTACT 
CTGGTGAATT 
GCTTTGTTGG 
CAAGATGGAC 
AGGAAACCTC 
CCATTCGTTC 
CACCAACTGT 
24 



TGATATCGCT 
TTGATGCTCC 
TCTCAAGCTG 
CGAAGCCGGT 
CTTACACTTT 
TCTGTCAAAT 
CAACTTCGTC 
CAATCTCTGG 
CCATGGTACA 
5 



TTGTGGAAAT 
AGGTCACAGA 
ATTGTGCTAT 
ATTTCTAAGG 
GGGTGTCAAA 
GGGACAAAAA 
AAGAAGGTTG 
TTGGAATGGT 
AGGGTTGGGA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
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AAAGGAAACC 
TTGACGCTAT 
CCATTGCAAG 
TAGAGTTGAA 
CAGCTGGTGT 
TTGGCTGAAG 
TTCCGTTAAA 
ATCCACCAAA 
CATCCAGGTC 
TGCCCACATT 
GAACTGGTAA 
GCTGCTATCG 
CACTGACTAC 



AAATCCGGTA 
TGAACCACCA 
ATGTTTACAA 
ACTGGTATCA 
TACCACTGAA 
GTGTTCCAGG 
GAAATTAGAA 
GGGTTGTGAC 
AAATCTCTGC 
GCTTGTAAAT 
GAAATTGGAA 
TCAAGATGGT 
CCACCATTAG 



AAGTTACTGG 
ACCAGACCAA 
GATCGGTGGT 
TCAAAGCCGG 
GTCAAATCCG 
TGACAATGTT 
GAGGTAACGT 
TCTTTCAATG 
TGGTTACTCT 
TCGACACTTT 
GAAAATCCAA 
CCCAACCAAA 
GTAGATTCGC 



TAAGACCTTG 
CCGACAAACC 
ATTGGTACTG 
TATGGTTGTT 
TTGAAATGCA 
GGTTTCAACG 
TTGTGGTGAC 
CCCAAGTCAT 
CCAGTCTTGG 
GGTTGAAAAG 
AATTCGTCAA 
CCAATGTGTG 
TGTCAGAGA 
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TTAGAAGCTA 500 

ATTGAGATTG 550 

TGCCAGTCGG 600 

ACTTTCGCCC 650 

TCACGAACAA 700 

TTAAGAACGT 750 

TCCAAGAACG 800 

TGTTTTGAAC 850 

ATTGTCACAC 900 

ATTGACAGAA 950 

ATCCGGTGAT 1000 

TTGAAGCTTT 1050 
1089 
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2) INFORMATION FOR SEQ ID NO: 409 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1101 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida albicans 

(B) STRAIN: ATCC 56884 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 409 



CTTGGACAAA 
CTTTGTGGAA 
CCAGGTCACA 
TGATTGTGCT 
GTATTTCTAA 
TTGGGTGTCA 
ATGGGACAAA 
TCAAGAAGGT 
GGTTGGAATG 
CAAGGGTTGG 
TGTTAGAAGC 
CCATTGAGAT 
TGTGCCAGTC 
TTACTTTCGC 
CATCACGAAC 
CGTTAAGAAC 
ACTCCAAGAA 
ATTGTTTTGA 
GGATTGTCAC 
AGATTGACAG 
AAATCCGGTG 
TGTTGAAGCT 
A 



TTGAAGGCTG 
ATTCGAAACT 
GAGATTTCAT 
ATTTTGATTA 
GGATGGTCAA 
AACAATTGAT 
AACAGATTTG 
TGGTTACAAC 
GTGACAACAT 
GAAAAGGAAA 
TATTGACGCT 
TGCCATTGCA 
GGTAGAGTTG 
CCCAGCTGGT 
AATTGGCTGA 
GTTTCCGTTA 
CGATCCACCA 
ACCATCCAGG 
ACTGCCCACA 
AAGAACTGGT 
ATGCTGCTAT 
TTCACTGACT 



AAAGAGAAAG 
CCAAAATACC 
CAAGAATATG 
TTGCTGGTGG 
ACCAGAGAAC 
TGTTGCTGTC 
AAGAAATCAT 
CCAAAGACTG 
GATTGAACCA 
CCAAATCCGG 
ATTGAACCAC 
AGATGTTTAC 
AAACTGGTAT 
GTTACCACTG 
AGGTGTTCCA 
AAGAAATTAG 
AAGGGTTGTG 
TCAAATCTCT 
TTGCTTGTAA 
AAGAAATTGG 
CGTCAAGATG 
ACCCACCATT 



AGGTATCACC 
ACGTTACCGT 
ATCACTGGTA 
TACTGGTGAA 
ACGCTTTGTT 
AACAAGATGG 
CAAGGAAACC 
TTCCATTCGT 
TCCACCAACT 
TAAAGTTACT 
CAACCAGACC 
AAGATCGGTG 
CATCAAAGCC 
AAGTCAAATC 
GGTGACAATG 
AAGAGGTAAC 
ACTCTTTCAA 
GCTGGTTACT 
ATTCGACACT 
AAGAAAATCC 
GTCCCAACCA 
AGGTAGATTC 



ATTGATATCG 50 

CATTGATGCT 100 

CTTCTCAAGC 150 

TTCGAAGCCG 2 00 

GGCTTACACT 2 50 

ACTCTGTCAA 300 

TCCAACTTCG 350 

TCCAATCTCT 4 00 

GTCCATGGTA 450 

GGTAAGACCT 500 

AACCGACAAA 550 

GTATTGGTAC 600 

GGTATGGTTG 650 

CGTTGAAATG 700 

TTGGTTTCAA 750 

GTTTGTGGTG 800 

TGCCCAAGTC 850 

CTCCAGTCTT 900 

TTGGTTGAAA 950 

AAAATTCGTC 1000 

AACCAATGTG 1050 

GCTGTCAGAG 1100 
1101 



60 



2) INFORMATION FOR SEQ ID NO: 410 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1102 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida albicans 

(B) STRAIN: ATCC 60193 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 410 



CTTGGACAAA TTGAAGGCTG 
CTTTGTGGAA ATTCGAAACT 
CCAGGTCACA GAGATTTCAT 
TGATTGTGCT ATTTTGATTA 
GTATTTCTAA GGATGGTCAA 
TTGGGTGTCA AACAATTGAT 
ATGGGACAAA AACAGATTTG 
TCAAGAAGGT TGGTTACAAC 
GGTTGGAATG GTGACAACAT 
CAAGGGTTGG GAAAAGGAAA 
TGTTAGAAGC TATTGACGCT 
CCATTGAGAT TGCCATTGCA 
TGTGCCAGTC GGTAGAGTTG 
TTACTTTCGC CCCAGCTGGT 
CATCACGAAC AATTGGCTGA 
CGTTAAGAAC GTTTCCGTTA 
ACTCCAAGAA CGATCCACCA 
ATTGTTTTGA ACCATCCAGG 
GGATTGTCAC ACTGCCCACA 
AGATTGACAG AAGAACTGGT 
AAATCCGGTG ATGCTGCTAT 
TGTTGAAGCT TTCACTGACT 
AT 



AAAGAGAAAG, AGGTATCACC 
CCAAAATACC ACGTTACCGT 
CAAGAATATG ATCACTGGTA 
TTGCTGGTGG TACTGGTGAA 
ACCAGAGAAC ACGCTTTGTT 
TGTTGCTGTC AACAAGATGG 
AAGAAATCAT CAAGGAAACC 
CCAAAGACTG TTCCATTCGT 
GATTGAACCA TCCACCAACT 
CCAAATCCGG TAAAGTTACT 
ATTGAACCAC CAACCAGACC 
AGATGTTTAC AAGATCGGTG 
AAACTGGTAT CATCAAAGCC 
GTTACCACTG AAGTCAAATC 
AGGTGTTCCA GGTGACAATG 
AAGAAATTAG AAGAGGTAAC 
AAGGGTTGTG ACTCTTTCAA 
TCAAATCTCT GCTGGTTACT 
TTGCTTGTAA ATTCGACACT 
AAGAAATTGG AAGAAAATCC 
CGTCAAGATG GTCCCAACCA 
ACCCACCATT AGGTAGATTC 



ATTGATATrG 


50 


CATTGATGCT 


100 




150 


TTCGAAGCHG 


200 


uOL. X Xrl^-riV^ x 




ACTCTGTCAA 


300 


TCCAACTTCG 


350 


TCCAATCTCT 


400 


GTCCATGGTA 


450 


GGTAAGACCT 


500 


AACCGACAAA 


550 


GTATTGGTAC 


600 


GGTATGGTTG 


650 


CGTTGAAATG 


700 


TTGGTTTCAA 


750 


GTTTGTGGTG 


800 


TGCCCAAGTC 


850 


CTCCAGTCTT 


900 


TTGGTTGAAA 


950 


AAAATTCGTC 


1000 


AACCAATGTG 


1050 


GCTGTCAGAG 


1100 




1102 



2) INFORMATION FOR SEQ ID NO: 411 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1102 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida albicans 

(B) STRAIN: ATCC 90028 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 411 



CTTGGACAAA TTGAAGGCTG AAAGAGAAAG AGGTATCACC ATTGATATCG 50 

CTTTGTGGAA ATTCGAAACT CCAAAATACC ACGTTACCGT CATTGATGCT 100 

CCAGGTCACA GAGATTTCAT CAAGAATATG ATCACTGGTA CTTCTCAAGC 150 

TGATTGTGCT ATTTTGATTA TTGCTGGTGG TACTGGTGAA TTCGAAGCCG 200 

247 
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GTATTTCTAA GGATGGTCAA 
TTGGGTGTCA AACAATTGAT 
ATGGGACAAA AACAGATTTG 
TCAAGAAGGT TGGTTACAAC 
5 GGTTGGAATG GTGACAACAT 
CAAGGGTTGG GAAAAGGAAA 
TGTTAGAAGC TATTGACGCT 
CCATTGAGAT TGCCATTGCA 
TGTGCCAGTC GGTAGAGTTG 

10 TTACTTTCGC CCCAGCTGGT 
CATCACGAAC AATTGGCTGA 
CGTTAAGAAC GTTTCCGTTA 
ACTCCAAGAA CGATCCACCA 
ATTGTTTTGA ACCATCCAGG 

15 GGATTGTCAC ACTGCCCACA 
AGATTGACAG AAGAACTGGT 
AAATCCGGTG ATGCTGCTAT 
TGTTGAAGCT TTCACTGACT 
AT 

20 



ACCAGAGAAC ACGCTTTGTT 
TGTTGCTGTC AACAAGATGG 
AAGAAATCAT CAAGGAAACC 
CCAAAGACTG TTCCATTCGT 
GATTGAACCA TCCACCAACT 
CCAAATCCGG TAAAGTTACT 
ATTGAACCAC CAACCAGACC 
AGATGTTTAC AAGATCGGTG 
AAACTGGTAT CATCAAAGCC 
GTTACCACTG AAGTCAAATC 
AGGTGTTCCA GGTGACAATG 
AAGAAATTAG AAGAGGTAAC 
AAGGGTTGTG ACTCTTTCAA 
TCAAATCTCT GCTGGTTACT 
TTGCTTGTAA ATTCGACACT 
AAGAAATTGG AAGAAAATCC 
CGTCAAGATG GTCCCAACCA 
ACCCACCATT AGGTAGATTC 



PCT/CA00/01150 



GGCTTACACT 


2§(f 


ACTCTGTCAA 


300 


TCCAACTTCG 


350 


TCCAATCTCT 


400 


GTCCATGGTA 


450 


GGTAAGACCT 


500 


AACCGACAAA 


550 


GTATTGGTAC 


600 


GGTATGGTTG 


650 


CGTTGAAATG 


700 


TTGGTTTCAA 


750 


GTTTGTGGTG 


800 


TGCCCAAGTC 


850 


CTCCAGTCTT 


900 


TTGGTTGAAA 


950 


AAAATTCGTC 


1000 


AACCAATGTG 


1050 


GCTGTCAGAG 


1100 




1102 



2) INFORMATION FOR SEQ ID NO: 412 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1101 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

30 

. (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida dubliniensis 
35 (B) STRAIN: NCPF 3108 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 412 

CTTGGATAAA TTGAAGGCTG AAAGAGAAAG AGGTATCACC ATTGATATCG 50 

4 0 CTTTGTGGAA ATTCGAAACT CCAAAATACC ACGTTACCGT CATTGATGCT 100 

CCAGGTCACA GAGATTTCAT CAAGAACATG ATCACTGGTA CTTCTCAAGC 150 

TGATTGTGCT ATTTTGATTA TTGCTGGTGG TACTGGTGAA TTCGAAGCCG 2 00 

GTATTTCTAA GGATGGTCAA ACCAGAGAAC ACGCTTTGTT GGCTTACACT 250 

TTGGGTGTCA AACAATTGAT TGTTGCTGTC AACAAGATGG ACTCTGTCAA 300 

45 ATGGGACAAA AACAGATTTG AAGAAATCAT CAAGGAAACC TCTAACTTCG 3 50 

TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTCGT TCCAATCTCT 4 00 

GGTTGGAATG GTGACAACAT GATTGAAGCT TCCACCAACT GTCCATGGTA 4 50 

CAAGGGTTGG GAAAAGGAAA CCAAATCCGG TAAGGTTACT GGTAAGACCT 500 

TGTTAGAAGC TATTGATGCT ATTGAACCAC CAACCAGACC AACCGACAAA 550 

50 CCATTGAGAT TGCCATTGCA AGATGTTTAC AAGATCGGTG GTATTGGTAC 600 

TGTGCCAGTC GGTAGAGTTG AAACTGGTGT CATTAAAGCC GGTATGGTTG 650 

TCACTTTTGC CCCAGCTGGT GTTACCACTG AAGTCAAATC CGTTGAAATG 700 

CATCACGAAC AATTGGCTGA AGGTGTTCCA GGTGACAATG TTGGTTTCAA 750 

CGTTAAGAAT GTTTCTGTCA AAGAAATTAG AAGAGGTAAC GTTTGTGGTG 800 

55 ACTCCAAGAA CGATCCACCA AAGGGTTGTG ACTCTTTCAA TGCCCAAGTC 850 

ATTGTCTTGA ACCATCCAGG TCAAATCTCT GCTGGTTACT CTCCAGTCTT 900 

GGATTGTCAC ACTGCCCACA TTGCTTGTAA ATTCGACACT TTGGTTGAAA 950 

AGATTGACAG AAGAACTGGT AAGAAATTGG AAGAAAATCC AAAATTCGTC 1000 

AAATCCGGTG ACGCTGCTAT YGTCAAGATG GTCCCAACCA AACCAATGTG 1050 

60 TGTTGAAGCT TTCACTGACT ACCCACCATT AGGTAGATTC GCTGTCAGAG 1100 

248 
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1101 



2) INFORMATION FOR SEQ ID NO: 413 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1098 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida catenulata 

(B) STRAIN: ATCC 10565 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 413 

GGACAAGCTT AAGGCTGAGC GTGAGAGAGG TATCACCATT GACATTGCCT 50 

TGTGGAAGTT CGAGACTCCC AAGTACCACG TCACTGTCAT TGACGCCCCC 100 

GGTCACAGAG ATTTCATCAA GAACATGATC ACTGGTACCT CGCAGGCTGA 150 

CTGTGCTATC TTGATCATTG CTTCCGGTGT CGGTGAGTTC GAGGCTGGTA 2 00 

TCTCCAAGGA CGGTCAGACC CGTGAGCACG CCTTGTTGGC CTACACCTTG 250 

GGTGTCAAGC AGTTGATCGT TGCCATCAAC AAGATGGACT CCGTCAAGTG 300 

GGACAAGAAC AGATTCGAGG AGATTGTCAA GGAGACCACC AACTTCGTCA 350 

AGAAGGTTGG TTACAACCCC AAGGCTGTCC CCTTCGTCCC CATCTCTGGC 4 00 

TGGAACGGTG ACAACATGAT TGAGGCCTCC ACCAACTGCC CCTGGTACAA 4 50 

GGGCTGGGAG AAGGAGACCA AGGCCGGTAA GTCTACCGGT AAGACCTTGT 500 

TGGAGGCCAT TGACGCCATT GAGCCCCCTA CCAGACCCAC CGACAAGCCC 550 

TTGAGATTGC CCTTGCAGGA TGTCTACAAG ATCGGTGGTA TTGGTACGGT 600 

GCCCGTCGGC CGTGTCGAGA CCGGTGTCAT CAAGCCCGGT ATGGTCGTCA 650 

CCTTCGCCCC CGCTGGTGTC ACCACTGAAG TCAAGTCCGT CGAGATGCAC 700 

CACGAGCAGT TGTCCGAGGG TGTCCCCGGT GACAACGTTG GTTTCAACGT 750 

CAAGAACGTC TCTGTTAAGG AGATCAGACG TGGTAACGTC TGCGGTGACT 800 

CCAAGAACGA CCCCCCCATG GGTTGCTCTT CTTTCAACGC CCAGGTTATC 850 

GTGTTGAACC ACCCCGGTCA GATCTCTGCC GGTTACTCTC CCGTCTTGGA 900 

CTGCCACACC GCCCACATTG CTTGCAAGTT CGCCGAGTTG ATCGAGAAGA 950 

TTGACAGACG TACCGGTAAG TCCATGGAGG CTAACCCCAA GTTCGTCAAG 1000 

TCTGGTGACG CCGCCATCGT CAAGATGGAG CCCACTAAGC CCATGTGTGT 1050 

TGAGGCCTTC ACTGACTTCC CTCCTTTGGG TAGATTCGCC GTCAGAGA 1098 



2) INFORMATION FOR SEQ ID NO: 414 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 110-2 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida dubliniensis 

(B) STRAIN: NCPF 3949 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 414 

249 
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CTTGGATAAA 
CTTTGTGGAA 
CCAGGTCACA 
5 TGATTGTGCT 
GTATTTCTAA 
TTGGGTGTCA 
ATGGGACAAA 
TCAAGAAGGT 

10 GGTTGGAATG 
CAAGGGTTGG 
TGTTAGAAGC 
CCATTGAGAT 
TGTGCCAGTC 

15 TCACTTTTGC 
CATCACGAAC 
CGTTAAGAAT 
ACTCCAAGAA 
ATTGTCTTGA 

20 GGATTGTCAC 
AGATTGACAG 
AAATCCGGTG 
TGTTGAAGCT 
AT 

25 



TTGAAGGCTG 
ATTCGAAACT 
GAGATTTCAT 
ATTTTGATTA 
GGATGGTCAA 
AACAATTGAT 
AACAGATTCG 
TGGTTACAAC 
GTGACAACAT 
GAAAAGGAAA 
TATTGATGCT 
TGCCATTGCA 
GGTAGAGTTG 
CCCAGCTGGT 
AATTGGTTGA 
GTTTCTGTCA 
CGATCCACCA 
ACCATCCAGG 
ACTGCCCACA 
AAGAACTGGT 
ACGCTGCTAT 
TTCACTGACT 



AAAGAGAAAG 
CCAAAATACC 
CAAGAACATG 
TTGCTGGTGG 
ACCAGAGAAC 
TGTTGCTGTC 
AAGAAATCAT 
CCAAAGACTG 
GATTGAAGCT 
CCAAATCCGG 
ATTGAACCAC 
AGATGTTTAC 
AAACTGGTGT 
GTTACCACTG 
AGGTGTTCCA 
AAGAAATTAG 
AAGGGTTGTG 
TCAAATCTCT 
TTGCTTGTAA 
AAGAAATTGG 
YGTCAAGATG 
ACCCACCATT 



AGGTATCACC 
ACGTTACCGT 
ATCACTGGTA 
TACTGGTGAA 
ACGCTTTGTT 
AACAAGATGG 
CAAGGAAACC 
TTCCATTCGT 
TCCACCAACT 
TAAGGTTACT 
CAACCAGACC 
AAGATCGGTG 
CATTAAAGCC 
AAGTCAAATC 
GGTGACAATG 
AAGAGGTAAC 
ACTCTTTCAA 
GCTGGTTAYT 
ATTCGACACT 
AAGAAAATCC 
GTCCCAACCA 
AGGTAGATTC 



ATTGATATCG 50 

CATTGATGCT 100 

CTTCTCAAGC 15 0 

TTCGAAGCCG 200 

GGCTTACACT 250 

ACTCTGTCAA 300 

TCTAACTTCG 350 

TCCAATCTCT 400 

GTCCATGGTA 450 

GGTAAGACCT 500 

AACCGACAAA 550 

GTATTGGTAC 600 

GGTATGGTTG 650 

CGTTGAAATG 700 

TTGGTTTCAA 750 

GTTTGTGGTG 800 

TGCCCAAGTC 850 

CTCCAGTCTT 900 

TTGGTTGAAA 950 

AAAATTCGTC 1000 

AACCAATGTG 1050 

GCTGTCAGAG 1100 
1102 



2) INFORMATION FOR SEQ ID NO: 415 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1102 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida dubliniensis 
40 (B) STRAIN: CBS 7987 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 415 

CTTGGATAAA TTGAAGGCTG AAAGAGAAAG AGGTATCACC ATTGATATCG 50 

4 5 CTTTGTGGAA ATTCGAAACT CCAAAATACC ACGTTACCGT CATTGATGCT 100 

CCAGGTCACA GAGATTTCAT CAAGAACATG ATCACTGGTA CTTCTCAAGC 150 

TGATTGTGCT ATTTTGATTA TTGCTGGTGG TACTGGTGAA TTCGAAGCCG 200 

GTATTTCTAA GGATGGTCAA ACCAGAGAAC ACGCTTTGTT GGCTTACACT 250 

TTGGGTGTCA AACAATTGAT TGTTGCTGTC AACAAGATGG ACTCTGTCAA 300 

50 ATGGGACAAA AACAGATTYG AAGAAATCAT CAAGGAAACC TCTAACTTCG 350 

TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTCGT TCCAATCTCT 400 

GGTTGGAATG GTGACAACAT GATTGAAGCT TCCACCAACT GTCCATGGTA 450 

CAAGGGTTGG GAAAAGGAAA CCAAATCCGG TAAGGTTACT GGTAAGACCT 500 

TGTTAGAAGC TATTGATGCT ATTGAACCAC CAACCAGACC AACCGACAAA 550 

55 CCATTGAGAT TGCCATTGCA AGATGTTTAC AAGATCGGTG GTATTGGTAC 600 

TGTGCCAGTC GGTAGAGTTG AAACTGGTGT CATTAAAGCC GGTATGGTTG 650 

TCACTTTTGC CCCAGCTGGT GTTACCACTG AAGTCAAATC CGTTGAAATG 700 

CATCACGAAC AATTGGTTGA AGGTGTTCCA GGTGACAATG TTGGTTTCAA 750 

CGTTAAGAAT GTTTCTGTCA AAGAAATTAG AAGAGGTAAC GTTTGTGGTG 800 

60 ACTCCAAGAA CGATCCACCA AAGGGTTGTG ACTCTTTCAA TGCCCAAGTC 850 

250 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

ATTGTCTTGA ACCATCCAGG TCAAATCTCT 
GGATTGTCAC ACTGCCCACA TTGCTTGTAA 
AGATTGACAG AAGAACTGGT AAGAAATTGG 
AAATCCGGTG ACGCTGCTAT YGTCAAGATG 
TGTTGAAGCT TTCACTGACT ACCCACCATT 
AT 



PCT/CA00/01150 

GCTGGTTAYT CTCCAGTCTT 900 
ATTCGACACT TTGGTTGAAA 950 
AAGAAAATCC AAAATTCGTC 1000 
GTCCCAACCA AACCAATGTG 1050 
AGGTAGATTC GCTGTCAGAG 1100 

1102 



2) INFORMATION FOR SEQ ID NO: 416 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1094 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double , 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida famata 

(B) STRAIN: ATCC 62894 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 416 

AATTGAAGGC TGAAAGAGAA AGAGGTATCA CCATTGATAT CGCTTTATGG 50 

AAATTCGAAA CTCCAAAATA CCACGTTACC GTTATTGATG CTCCAGGTCA 100 

CAGAGATTTC ATCAAGAACA TGATTACTGG TACTTCTCAA GCTGATTGTG 150 

CTATTTTRAT TATTGCTGGT GGTGTCGGTG AATTCGAAGC CGGTATCTCT 200 

AAGGATGGTC AAACCAGAGA ACACGCTTTA TTGGCTTACA CCTTAGGTGT 250 

TAGACAATTG ATTGTTGCCG TCAACAAGAT GGACTCTGTT AAATGGGACA 300 

AGGCTAGATT CGAAGAAATC ATCAAGGAAA CCTCTAACTT CGTCAAGAAG 3 50 

GTTGGTTACA ACCCTAAGAC TGTTCCTTTC GTYCCAATTT CTGGATGGAA 4 00 

CGGTGACAAC ATGATTGAAG CCTCCACCAA CTGTCCATGG TACAAGGGTT 450 

GGGAAAAGGA AACCAAGGCT GGTAAATCTA CTGGTAAGAC TTTGTTAGAA 50 0 

GCCATTGATG CCATTGAACC ACCAACCAGA CCAACCGAAA AGCCATTGAG 550 

ATTACCATTA CAAGATGTCT ACAAGATCGG TGGTATTGGT ACTGTGCCAG 600 

TCGGTAGAGT TGAAACCGGT GTTATCAAGG GTGGTATGGT TGTTACCTTT 650 

GCCCCAGCCG GTGTCACTAC CGAAGTCAAA TCCGTTGAAA TGCACCACGA 700 

ACAATTAGCT GAAGGTGTTC CAGGTGACAA TGTTGGTTTC AACGTCAAGA 750 

ACGTTTCCGT TAAGGAAATC AGAAGAGGTA ACGTTTGTGG TGACTCCAAG 800 

AACGACCCAC CAAAGGGTGC TGAATCTTTC ACCGCTCAAG TTATTGTCTT 850 

GAACCACCCA GGTCARATCT CTGCTGGTTA CTCTCCAGTC TTAGATTGTC 900 

ACACCGCCCA CATTGCTTGT AAATTCGATG CTTTACTCGA AAAGATTGAC 950 

AGAAGATCCG GTAAGAAATT AGAAGACGAA CCAAAATTCG TCAAGTCCGG 1000 

TGATGCTGCT ATCGTCAAGA TGGTCCCAAC CAAACCAATG TGTGTTGAAG 1050 

CTTTCACTGA ATACCCACCA TTAGGTAGAT TCGCTGTTAG AGAT 1094 



2) INFORMATION FOR SEQ ID NO: 417 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1101 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

251 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida glabrata 

(B) STRAIN: ATCC 66032 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 417 

TTTGGACAAG TTGAAGGCTG AAAGAGAAAG AGGTATCACT ATCGATATCG 50 

CTTTGTGGAA GTTCGAAACT CCAAAGTACC ACGTYACCGT TATCGATGCY 100 

CCAGGTCACA GAGATTTCAT CAAGAACATG ATTACTGGTA CTTCTCAAGC 150 

10 TGACTGTGCT ATCTTGATTA TTGCTGGTGG TGTCGGTGAA TTCGAAGCYG 200 

GTATCTCCAA GGATGGTCAA ACCAGAGAAC ACGCTCTATT GGCTTTCACC 250 

CTAGGTGTTA GACAATTGAT TGTYGCTGTC AACAAGATGG ACTCTGTCAA 300 

GTGGGATGAA TCCAGATTCG CTGAAATCGT TAAGGAAACC TCCAACTTCA 3 50 

TCAAGAAGGT CGGTTACAAC CCAAAGACTG TTCCATTCGT CCCAATCTCT 4 00 

15 GGTTGGAACG GTGACAACAT GATTGAAGCC ACCACCAACG CTTCCTGGTA 450 

CAAGGGTTGG GAAAAGGAAA CCAAGGCTGG TGTCGTCAAG GGTAAGACCT 500 

TGTTGGAAGC CATTGACGCT ATCGAACCAC CAACCAGACC AACTGACAAG 550 

CCATTGAGAT TGCCATTGCA AGATGTCTAC AAGATCGGTG GTATCGGTAC 600 

GGTGCCAGTC GGTAGAGTCG AAACCGGTGT CATCAAGCCA GGTATGGTTG 650 

20 TTACCTTCGC CCCAGCTGGT GTTACCACTG AAGTCAAGTC CGTTGAAATG 700 

CACCACGAAC AATTGACTGA AGGTTTGCCA GGTGACAACG TTGGTTTCAA 750 

CGTTAAGAAC GTTTCCGTTA AGGAAATCAG AAGAGGTAAT GTCTGTGGTG 800 

ACTCCAAGAA CGACCCACCA AAGGCTGCTG CTTCTTTCAA CGCTACCGTC 850 

ATTGTCTTGA ACCACCCAGG TCAAATCTCT GCTGGTTACT CTCCAGTTTT 900 

25 GGACTGTCAC ACCGCCCACA TTGCTTGTAA GTTCGAAGAA TTGTTGGAAA 950 

AGAACGACAG AAGATCCGGT AAGAAGTTGG AAGACTCTCC AAAGTTCTTG 1000 

AAGTCCGGTG ACGCTGCTTT GGTTAAGTTC GTTCCATCCA AGCCAATGTG 1050 

TGTCGAAGCT TTCTCCGACT ACCCACCATT GGGTAGATTC GCTGTCAGAG 1100 

A 1101 

30 



2) INFORMATION FOR SEQ ID NO: 418 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1080 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double- 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida guilliermondii 
45 (B) STRAIN: ATCC 6260 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 418 

AGAGAAAGAG GTATCACCAT TGACATTGCT TTGTGGAAAT TCGAGACTCC 50 

50 AAAGTACCAC GTTACYGTCA TTGATGCCCC AGGTCACAGA GATTTCATCA 100 

AGAACATGAT CACTGGTACT TCTCAAGCTG ACTGTGCTAT TTTGATTATT 150 

GCTGGTGGTA CCGGTGAATT CGAAGCTGGT ATCTCTAAGG ATGGTCAAAC 200 

CAGAGAGCAC GCTTTGTTGG CTTACACCTT GGGTGTTAGA CAATTGATTG 250 

TTGCTGTCAA CAAGATGGAC TCCGTCAART GGGACAAGAA CAGATTYGAG 300 

55 GAAATCATCA AGGAAACCTC TAACTTCGTC AAGAAGGTTG GTTACAACCC 3 50 

TAAGACTGTG CCATTCGTTC CTATCTCTGG ATGGAAYGGT GACAACATGA 400 

TTGAGGCTTC TACCAACTGT CCTTGGTACA AGGGATGGGA GAAGGAGACC 450 

AAGGCTGGTA AGTCCACCGG TAAGACTTTG TTGGAGGCCA TTGACGCCAT 500 

TGAGCCACCT CAAAGACCAA CCGACAAGCC ATTGAGATTG CCATTGCAAG 550 

60 ATGTYTACAA GATTGGTGGT ATTGGAACGG TGCCAGTCGG TAGAGTTGAA 600 
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ACCGGTATCA TYAAGGCCGG TATGGTTGTT 
YACCACTGAA GTCAAGTCCG TGGAAATGCA 
GTGTTCCAGG TGACAATGTT GGTTTCAACG 
GAAATTAGAA GAGGTAACGT TTGTGGTGAC 
GGGTTGTGAC TCTTTCACCG CTCAAGTTAT 
AAATCTCTGC TGGTTACTCT CCAGTTTTGG 
GCTTGTAAAT TCGACACCTT GTTGGAGAAG 
GAAGATGGAG GACAACCCCA AGTTTGTCAA 
TCAAGATGGT GCCATCCAAG CCAATGTGTG 
CCACCATTGG GAAGATTCGC CGTCAGAGAC 



PCT/CA00/01150 

ACCTTTGCCC CAGCTGGTGT 650 

CCACGAACAA TTGGTTGAAG 700 

TTAAGAACGT TTCCGTTAAG 750 

TCCAAGAACG ACCCACCAAA 800 

TGTGTTGAAC CACCCTGGTC 850 

ACTGTCACAC CGCCCACATT 900 

ATTGACAGAA GAACCGGTAA 950 

GTCCGGTGAC GCTTCTATCG 1000 

TTGAGGCTTT CACCGACTAC 1050 

1080 



2) INFORMATION FOR SEQ ID NO: 419 

( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 751 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida haemulonii 

(B) STRAIN: ATCC 22991 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 419 



TCTGTCAAGT GGGACAAGGC 
TAACTTCGTC AAGAAGGTTG 
CAATCTCTGG TTGGAACGGT 
GACTGGTACA AGGGTTGGGA 
TAAGACCTTG TTGGAGGCCA 
CCGACAAGCC ATTGAGATTG 
ATCGGAACTG TGCCAGTCGG 
TATGGTTGTC ACCTTCGCCC 
TCGAGATGCA CCACGAGCAG 
GGTTTCAACG TCAAGAACGT 
CTGTGGTGAC TCCAAGCAGG 
CTCAGGTTAT TGTGTTGAAC 
CCAGTTTTGG ACTGTCACAC 
GGTTGAGAAG ATCGACAGAA 
AGTTCATCAA GTCCGGTGAC 
C 



CAGATACGAG GAAATCGTCA 
GTTACAACCC TAAGACTGTT 
GACAACATGA TTGAGGCTTC 
GAAGGAGACC AAGTCTGGTA 
TTGACGCCAT TGAGCCACCA 
CCATTGCAGG ATGTCTACAA 
CAGAGTTGAG ACCGGTGTTA 
CAGCTGGTGT CACCACTGAA 
TTGCCAGAGG GTGTCCCAGG 
TTCCGTTAAG GAAATCAGAA 
ACCCACCAAA GGGCTGTGAC 
CACCCAGGTC AGATCTCTTC 
TGCCCACATT GCTTGTAAGT 
GAACCGGTAA GAAGTTGGAA 
GCTGCTATCG TCAAGATGGT 



AGGAGACCTC 


50 


CCATTCGTCC 


100 


TACCAACTGT 


150 


AGTCCACCGG 


200 


ACCAGACCAA 


250 


GATTGGTGGT 


300 


TCAAGGCCGG 


350 


GTCAAGTCTG 


400 


TGACAACGTT 


450 


GAGGTAACGT 


500 


TCTTTCACCG 


550 


TGGTTACTCT 


600 


TCGACACCTT 


650 


GATGAGCCAA 


700 


CCCAACCAAG 


750 




751 



2) INFORMATION FOR SEQ ID NO: 420 

( i ) SEQUENCE CHARACTERI STICS ; 

(A) LENGTH: 1102 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida inconspicua 

253 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(B) STRAIN: ATCC 16783 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 420 

5 TCTTGACAAG TTAAAGGCTG AAAGAGAAAG AGGTATCACT ATTGATATTG 50 

CTTTATGGAA ATTCGAAACT CCAAAGTATC ACGTTACCGT CATTGATGCT 100 

CCAGGTCACA GAGATTTCAT TAAGAACATG ATTACTGGTA CTTCTCAAGC 150 

AGATTGTGCT ATTTTGATTA TTGCTGGTGG TGTCGGTGAA TTCGAAGCTG 200 

GTATTTCCAA GGATGGTCAA ACTAGAGAAC ACGCTTTATT AGCATTCACC 250 

10 TTAGGTGTTA AGCAATTGAT TGTTGCTATC AACAAGATGG ATTCTGTTAA 300 

GTGGGATGAA AAGAGATTTG AAGAAATTGT CAAGGAAACC CAAAACTTCA 350 

TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTCGT TCCAATTTCT 400 

GGTTGGAATG GTGACAACAT GATTGAACCA TCTTCTAACT GTCCATGGTA 450 

CAAGGGTTGG ACTAAGGAAA CCAAGGCAGG TGTTGTCAAG GGTAAGACCT 500 

15 TATTAGAAGC TATTGATGCT ATTGAACCAC CTGTCAGACC AACTGATAAG 550 

CCATTAAGAT TACCATTACA AGATGTTTAC AAGATTGGTG GTATTGGTAC 600 

TGTGCCAGTC GGTAGAGTTG AAACCGGTAT TATTAAGCCA GGTATGGTTG 650 

TTGTTTTCGC ACCATCTGGT GTTACCACTG AAGTCAAGTC CGTTGAAATG 700 

CACCATGAAC AATTAGAAGA AGGTGTCCCA GGTGACAATG TTGGTTTCAA 750 

20 CGTCAAGAAC GTCTCTGTTA AGGATATCAA GAGAGGTAAC GTTTGTGGTG 800 

ACTCCAAGAA TGACCCACCA CAAGGTTGTG CTTCCTTCAA TGCTCAAGTC 850 

ATTGTCTTGA ACCACCCTGG TCAAATTTCT GCTGGTTACT CTCCAGTTTT 900 

AGATTGTCAC ACTGCCCACA TTGCATGTAA ATTCGATGAA TTAATTGAAA 950 

AGATTGACAG AAGAACTGGT AAGTCCGTTG AAGACCATCC AAAGTCTGTT 1000 

25 AAGTCTGGTG ATGCAGCTAT CGTTAAGATG GTTCCAACCA AGCCAATGTG 1050 

TGTTGAAGCT TTCACTGAAT ACCCACCATT AGGTAGATTC GCAGTCAGAG 1100 

AT 1102 



30 



40 



2) INFORMATION FOR SEQ ID NO: 421 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 99 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida kefyr 

(B) STRAIN: ATCC 28838 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 421 

TGGACAAGTT AAAGGCTGAA AGAGAAAGAG GTATCACCAT CGATATCGCT 50 

TTGTGGAAGT TCGAAACTCC AAAGTACCAA GTTACCGTTA TCGATGCTCC 100 

AGGTCACAGA GATTTCATCA AGAACATGAT TACTGGTACT TCTCAAGCTG 150 

50 ACTGTGCTAT CTTGATTATT GCTGGTGGTG TCGGTGAATT CGAAGCCGGT 200 

ATCTCCAAGG ATGGTCAAAC CAGAGAACAC GCTTTGTTGG CTTACACCTT 250 

GGGTGTTAGA CAATTGATTG TTGCTATCAA CAAGATGGAC TCTGTTAAGT 300 

GGGATGAATC TCGTTACCAA GAAATTGTTA AGGAAACCTC CAACTTCATC 350 

AAGAAGGTCG GTTACAACCC AAAGAATGTT CCATTCGTCC CAATCTCTGG 4 00 

55 TTGGAACGGT GACAACATGA TTGAAGCCAC CACCAACGCT CCATGGTACA 450 

AGGGTTGGGA AAAGGAAACC AAGGCTGGTA CCGTCAAGGG TAAGACCTTG 500 

TTGGAAGCTA TTGACGCTAT CGAACCACCA ACCAGACCAA CTGACAAGCC 550 

ATTGAGATTG CCATTGCAAG ATGTCTACAA GATCGGTGGT ATTGGTACTG 600 

TGCCAGTCGG TAGAGTCGAA ACCGGTGTCA TCAAGCCAGG TATGGTTGTT 650 

60 ACCTTCGCCC CAGCCGGTGT CACTACCGAA GTTAAGTCCG TCGAAATGCA 700 

254 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



CCACGAACAA TTGGAAGAAG 
TCAAGAACGT TTCCGTTAAG 
TCCAAGAACG ATCCACCAAA 
CGTCTTGAAC CACCCAGGTC 
ATTGTCACAC TGCTCACATT 
AACGACAGAA GATCCGGTAA 
GTCTGGTGAC GCTGCTTTGG 
TTGAAGCATT CTCTGACTAC 



GTCTACCAGG TGACAACGTC 
GAAATCAGAA GAGGTAACGT 
GGCTGCTGCT TCTTTCAACG 
AAATCTCTGC TGGTTACTCT 
GCTTGTAAGT TCGACGAATT 
GAAGTTGGAA GACTCTCCAA 
TTAAGTTCGT TCCATCTAAG 
CCACCATTGG GTAGATTCGC 



GGTTTCAACG x 750 

CTGTGGTGAC 800 

CCACTGTTAT 850 

CCAGTTTTGG 900 

GTTGGAAAAG 950 

AGTTCTTGAA 1000 

CCAATGTGTG 1050 

TGTCAGAGA 1099 



2) INFORMATION FOR SEQ ID NO: 422 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1095 bases , 

(B) TYPE:' Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida krusei 

(B) STRAIN: ATCC 34135 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 422 

AAGTTAAAGG CAGAAAGAGA AAGAGGTATC ACTATTGATA TTGCTTTATG 50 

GAAGTTYGAA ACTCCAAART ACCACGTTAC CGTTATTGAT GCTCCAGGTC 100 

ACAGAGATTT CATCAAGAAC ATGATTACCG GTACTTCTCA AGCTGATTGT 150 

GCTATTTTGA TTATTGCTGG TGGTGTCGGT GAATTCGAAG CTGGTATCTC 200 

CAAGGATGGT CAAACTAGAG AACACGCTCT ATTGGCTTTC ACCTTAGGTG 250 

TTAGACAATT GATTGTTGCT ATCAACAAGA TGGATTCCGT TAARTGGGAT 300 

GAAAACAGAT TTGAAGAAAT TGTCAAGGAA ACCCAAAACT TCATCAAGAA 3 50 

GGTTGGTTAC AACCCAAAGA CTGTTCCATT CGTTCCAATY TCTGGTTGGA 4 00 

ATGGTGACAA CATGATTGAA GCATCCACCA ACTGTCCATG GTACAAGGGT 450 

TGGACTAAGG AAACCAAGGC AGGTGTTGTT AAGGGTAAGA CCTTATTAGA 50 0 

AGCAATCGAT GCTATTGAAC CACCTGTCAG ACCAACCGAA AAGCCATTAA 550 

GATTACCATT ACAAGATGTT TACAAGATTG GTGGTATTGG TACTGTGCCA 60 0 

GTCGGTAGAG TCGAAACCGG TGTCATTAAG CCAGGTATGG TTGTCACTTT 650 

TGCTCCAGCA GGTGTCACCA CCGAAGTCAA GTCCGTTGAA ATGCACCATG 70 0 

AACAATTAGA ACAAGGTGTT CCAGGTGATA ACGTTGGTTT CAACGTTAAG 750 

AACGTTTCTG TCAAGGATAT CAAGAGAGGT AACGTTTGTG GTGACTCCAA 8 00 

GAACGACCCA CCAATGGGTG CAGCTTCCTT CAATGCTCAA GTCATTGTCT 850 

TGAACCACCC TGGTCAAATT TCCGCTGGTT ACTCTCCAGT CTTGGATTGT 90 0 

CACACTGCCC ACATTGCATG TAAGTTCGAC GAATTAATCG AAAAGATTGA 950 

CAGAAGAACT GGTAAGTCTG TTGAAGACCA TCCAAAGTCY GTCAAGTCTG 1000 

GTGATGCAGC TATCGTCAAG ATGGTCCCAA CCAAGCCAAT GTGTGTTGAA 1050 

GCTTTCACTG AATAYCCACC ATTAGGTAGA TTCGCAGTCA GAGAT 1095 



2) INFORMATION FOR SEQ ID NO: 423 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1104 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

255 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(ii) MOLECULE TYPE: Genomic DNA 



PCT/CA00/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida Iambics 
5 (B) STRAIN: ATCC 24750 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 423 

CTTGGACAAG CTTAAGGCTG AAAGAGAAAG AGGTATCACC ATTGATATCG 50 

10 CTTTATGGAA GTTCGAAACT CCAAAGTACC ACGTTACCGT CATTGACGCT 100 

CCAGGTCACA GAGATTTCAT CAAGAACATG ATTACTGGTA CCTCTCAAGC 150 

AGATTGTGCT ATTTTRATYA TTGCTGGTGG TGTCGGTGAA TTCGAAGCTG 200 

GTATCTCTAA GGATGGTCAA ACCAGAGAAC ACGCTCTTCT TGCATTCACT 250 

CTTGGTGTTA GACAATTGAT TGTTGCTATC AACAAGATGG ACTCTGTCAA 3 00 

15 GTGGGACGAA TCCAGATTCG ATGAAATTTG TAAGGAAACC GCWAACTTCA 350 

TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTCGT CCCAATCTCT 4 00 

GGTTGGAACG GTGACAACAT GATTGAACCA TCTGCTAACT GTCCATGGTA 450 

CAAGGGATGG ACTAAGGAAA CCAAGGCTTC CGGTGTCGTC AAGGGTAAGA 500 

CCCTTCTTGA AGCAATTGAT GCTATTGAGC CACCTGTCAG ACCAACTGAC 550 

20 AAGGCTTTGA GATTGCCATT RCAAGATGTC TACAAGATTG GTGGTATTGG 600 

TACTGTGCCA GTCGGTAGAG TTGAAACCGG TATCATCAAG CCAGGTATGA 650 

TTGTCGTTTT CGCTCCAACC GGTGTTACTA CTGAAGTTAA GTCCGTTGAA 700 

ATGCACCATG AACAATTAGA AGAAGGTGTC CCAGGTGACA ATGTTGGTTT 7 50 

CAACGTCAAG AACGTCTCTG TTAAGGATAT TAAGAGAGGT AACGTCTGTG 800 

25 GTGACTCCAA GAACGACCCA CCAATGGGTT GTGCTTCCTT CAATGCTCAA 850 

GTCATTGTTC TTAACCACCC AGGTCAAATT TCTGCTGGTT ACTCACCAGT 900 

TCTTGACTGT CACACTGCCC ACATTGCATG TAAGTTCGAY GAATTACTCG 950 

AAAAGATTGA CAGAAGAACC GGTAAGGCTA CTGAAGACCA TCCAAAGTCT 1000 
GTCAAGTCTG GTGATGCAGC TATCGTCAAG ATGGTTCCAA CCAAGCCAAT * 1050 

30 GTGTGTYGAA GCTTTCACTG ACTACCCACC ATTAGGTAGA TTCGCTGTYA 1100 

GAGA 1104 



35 2) INFORMATION FOR SEQ ID NO: 424 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1098 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



4 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida lusitaniae 

(B) STRAIN: ATCC 66035 



50 



55 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 424 

GGACAAGTTG AAGGCTGAGA GAGAAAGAGG TATCACCATC 
TGTGGAAGTT CGAGACTCCA AAGTACCACG TTACCGTCAT 
GGTCACAGAG ATTTCATCAA GAACATGATC ACTGGTACTT 
CTGTGCTATC TTGATTATCG CTGGTGGTGT CGGTGAGTTC 
TCTCTAAGGA CGGTCAAACC AGAGAGCACG CTTTGTTGGC 
GGTGTCAAGC AGTTGATTGT TGCTGTCAAC AAGATGGACT 
GGACCAATCT AGATTCGAGG AAATCATCAA GGAAACCTCT 
AGAAGGTTGG TTACAACCCT AAGACTGTTC CATTCGTCCC 
TGGAACGGTG ACAACATGAT TGAGCCATCY ACCAACTGCC 
GGGTTGGGAG AAGGAGACCA AGTCYGGTAA GTCCACCGGT 

256 



GATATCGCTT 50 

TGACGCTCCA 100 

CCCAAGCTGA 150 

GAAGCCGGTA 2 00 

TTACACCTTG 250 

CCGTCAAGTG 300 

AACTTCGTCA 350 

AATCTCTGGT 4 00 

CATGGTACAA 450 

AAGACCTTGT 500 



SUBSTITUTE SHEET (RULE 26) 
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TGGAGGCCAT TGACGCCATT GAGCCACCTT 
TTGAGATTGC CATTGCAAGA TGTCTACAAG 
GCCAGTCGGT AGAGTTGAGA CCGGTGTCAT 
CCTTTGCTCC AGCTGGTGTC ACCACTGAAG 
CACGAACAAT TGGCTGAGGG TGTCCCAGGT 
CAAGAACGTT TCCGTCAAGG AAATCAGAAG 
CCAAGAACGA CCCACCAAAG GCTGCTGCTT 
GTCTTGAACC ACCCAGGTCA AATCTCCTCY 
CTGTCACACT GCYCACATTG CTTGTAAGTT 
TCGACAGAAG AACYGGTAAG AAGTTGGAAG 
TCYGGTGACG CTGCTATCGT CAAGATGGTC 
YGAAGCTTTC ACCGACTACC CACCATTGGG 



PCT/CAOO/01150 

CGAGACCAAC CGACAAGCfCfi* 55 9 

ATYGGTGGTA TTGGTACTGT 600 

CAAGGCCGGT ATGGTTGTCA 650 

TCAAGTCCGT GGAAATGCAC 700 

GACAACGTTG GTTTCAACGT 750 

AGGTAACGTC TGTGGTGACT 800 

CYTTCACTGC TCAAGTYATY 850 

GGTTACTCTC CAGTYTTGGA 900 

CGACACCTTG ATTGAGAAGA 950 

AAGAGCCAAA GTTCATCAAG 1000 

CCAACCAAGC CAATGTGYGT 1050 

TAGATTCGCT GTCAGAGA 1098 



2) INFORMATION FOR SEQ ID NO: 425 

{ i ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1101 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida norvegensis 

(B) STRAIN: ATCC 22977 



(xi) SEQUENCE DESCRIPTION: SEQ : 

TCTTGACAAG TTAAAGGCTG AAAGAGAAAG 
CTTTATGGAA ATTCGAAACT CCAAAATACC 
CCAGGTCACA GAGATTTCAT TAAGAACATG 
TGATTGTGCT ATCTTAATTA TTGCTGGTGG 
GTATCTCCAA GGATGGTCAA ACCAGAGAAC 
TTAGGTGTTA AGCAATTAAT TGTTGCTATC 
GTGGGATGAA AAGAGATTTG AAGAAATTGT 
TCAAGAAGGT TGGTTACAAC CCAAAGACTG 
GGTTGGAATG GTGACAACAT GATTGAACCA 
CAAGGGTTGG ACTAAGGAAA CCAAGGCAGG 
TATTAGAAGC TATTGATGCT ATTGAACCAC 
CCATTAAGAT TACCATTACA AGATGTTTAC 
TGTGCCAGTC GGTAGAGTTG AAACCGGTGT 
TTGTTTTCGC ACCATCTGGT GTTACCACTG 
CACCATGAAC AATTAGAACA AGGTGTCCCA 
CGTCAAGAAC GTCTCTGTTA AGGATATTAA 
ACTCCAAGAA CGACCCACCA CAAGGTTGTG 
ATTGTCTTGA ACCACCCTGG TCAAATTTCT 
AGATTGTCAC ACTGCCCACA TTGCATGTAA 
AGATTGACAG AAGAACTGGT AAGTCCGTTG 
AAGTCTGGTG ATGCAGCTAT CGTTAAGATG 
TGTTGAAGCT TTCACTGAAT ACCCACCATT 
A 



NO: 425 



AGGTATCACT 


ATTGATATTG 


50 


ACGTTACCGT 


TATTGATGCT 


100 


ATTACTGGTA 


CTTCCCAAGC 


150 


TGTCGGTGAA 


TTCGAAGCTG 


200 


ACGCTTTATT 


AGCATTCACC 


250 


AACAAGATGG 


ACTCTGTTAA 


300 


CAAGGAAACC 


CAAAACTTCA 


350 


TTCCATTCGT 


TCCAATTTCT 


400 


TCTACTAACT 


GTCCATGGTA 


450 


TGTTGTTAAG 


GGTAAGACCT 


500 


CTGTCAGACC 


AACTGACAAG 


550 


AAGATTGGTG 


GTATTGGTAC 


600 


TATTAAGCCA 


GGTATGGTTG 


650 


AAGTCAAGTC 


CGTTGAAATG 


700 


GGTGACAATG 


TTGGTTTCAA 


750 


GAGAGGTAAC 


GTTTGTGGTG 


800 


CTTCCTTCAA 


TGCTCAAGTC 


850 


GCAGGTTACT 


CTCCAGTTTT 


900 


GTTCGATGAA 


TTAATCGAAA 


950 


AAGACCATCC 


AAAGTCTGTT 


1000 


GTTCCAACCA 


AGCCAATGTG 


1050 


AGGTAGATTC 


GCAGTCAGAG 


1100 
1101 



2) INFORMATION FOR SEQ ID NO: 426 
( i ) SEQUENCE CHARACTERISTICS : 



257 
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(A) LENGTH: 1095 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida parapsilosis 
10 (B) STRAIN: ATCC 90018 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 426 

CAAATTGAAG GCTGAAAGAG AAAGAGGTAT CACCATTGAT ATCGCTTTGT 50 

15 GGAAATTCGA AACTCCAAAA TACCATGTTA CTGTTATTGA TGCTCCAGGT 100 

CACAGAGATT TCATCAAGAA TATGATTACT GGTACTTCTC AAGCTGATTG 150 

TGCTATTTTG ATTATTGCTG GTGGTACTGG TGAATTCGAA GCTGGTATCT 200 

CTAAGGATGG TCAAACCAGA GAACACGCTT TGTTGGCTTA CACCTTGGGT 250 

GTTAAGCAAT TGATTGTTGC CATCAACAAG ATGGACTCAG TCAAATGGGA 300 

20 CAAGAACAGA TACGAAGAAA TTGTCAAGGA AACTTCCAAC TTCGTCAAGA 350 

AGGTTGGTTA CAACCCTAAA GCTGTCCCAT TCGTCCCAAT CTCTGGTTGG 400 

AACGGTGACA ATATGATTGA ACCATCAACC AACTGTCCAT GGTACAAGGG 450 

TTGGGAAAAG GAAACTAAAG CTGGTAAGGT TACCGGTAAG ACCTTGTTGG 500 

AAGCTATCGA TGCTATCGAA CCACCAACCA GACCAACTGA CAAGCCATTG 550 

25 AGATTGCCAT TGCAAGATGT CTACAAGATT GGTGGTATTG GAACTGTGCC 600 

AGTTGGTAGA GTTGAAACCG GTATCATCAA GGCTGGTATG GTTGTTACTT 650 

TTGCCCCAGC TGGTGTTACC ACTGAAGTCA AGTCCGTTGA AATGCACCAC 700 

GAACAATTGA CTGAAGGTGT CCCAGGTGAC AATGTTGGTT TCAACGTCAA 750 

GAACGTTTCA GTTAAGGAAA TCAGAAGAGG TAACGTTTGT GGTGACTCCA 800 

30 AGAACGATCC ACCAAAGGGA TGTGAYTCCT TCAATGCTCA AGTTATTGTC 850 

TTGAACCACC CAGGTCAAAT CTCTGCTGGT TACTCACCAG TCTTGGATTG 900 

TCACACTGCC CACATTGCTT GTAAATTCGA CACTTTGATT GAAAAGATTG 950 

ACAGAAGAAC CGGTAAGAAA TTGGAAGATG AACCAAAATT CATCAAGTCC 1000 

GGTGATGCTG CYATCGTCAA GATGGTCCCA ACCAAGCCAA TGTGTGTTGA 1050 

35 AGCTTTCACT GACTACCCAC CATTGGGAAG ATTCGCTGTT AGAGA 1095 



2) INFORMATION FOR SEQ ID NO: 427 



40 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 752 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
4 5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Candida rugosa 

(B) STRAIN: ATCC 96275 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 427 

55 CTCCGTCAAG TGGTCTCAGT CTCGTTTCGA GGAGATCGTC AAGGAGGTTT 50 

CCAACTTCAT CAAGAAGGTT GGTTACAAGC CCGATGAGGT TCCTTTCGTC 100 

CCCATCTCTG GCTGGAACGG CGACAACATG CTTGAGCCCT CCACCAACTG 150 

CCCCTGGTAC AAGGGATGGA CCAAGAAGAC CAAGAAGGGT GAGGTCAAGG 200 

GTAAGACTCT TCTCGAGGCC ATTGACGCCA TCGAGCCCCC CTCCCGTCCT 250 

60 ACCGACAAGC CCCTCCGCTT GCCTCTTCAG GATGTCTACA AGATCGGCGG 300 

258 



SUBSTITUTE SHEET (RULE 26) 
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TATCGGTACG GTACCTGTCG 
GCATGGTCGT CACTTTCGCC 
GTCGAGATGC ACCACGAGCA 
CGGTTTCAAC GTCAAGAACG 
5 TCTGCGGTGA CTCCAAGAAC 
GCCCAGGTCA TCGTTTTCAA 
CCCCGTTTTG GACTGCCACA 
TCCAGTCCAA GATGGACCGT 
AAGTTCATCA AGGCTGGTGA 
10 GC 



GCCGTGTCGA GACCGGTATC 
CCCGCTGGTG TCACCACTGA 
GATCCCCGAG GGTCTCCCCG 
TTACCGTCAA GGATATCCGC 
GACCCCCCCA AGGGCTGCTC 
CCACCCCGGT CAGATCTCCA 
CCGCCCACAT TGCCTGCCGC 
CGTACTGGTA AGACCCTTGA 
CTCCGCTATC GTCAAGATGG 



PCT/CAOO/01150 

ATCAAGCCCG 350 

AGTGAAGTCC 40 0 

GTGACAACGT 450 

CGTGGTAACG 500 

TTCCTTCACT 550 

ACGGTTACTC 600 

TTCGACGAGA 650 

GGAGAACCCC 700 

TTCCCTCCAA 750 
752 



2) INFORMATION FOR SEQ ID NO: 428 



15 



( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 1093 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Candida sphaerica 

(B) STRAIN: ATCC 2504 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 428 

30 AGTTAAAGGC TGAAAGAGAA AGAGGTATCA CCATCGATAT CGCTTTGTGG 50 

AAGTTCGAAA CTCCAAAGTA CCAAGTTACC GTTATCGATG CTCCAGGTCA 100 

CAGAGATTTC ATCAAGAACA TGATTACTGG TACTTCTCAA GCTGACTGTG 150 

CTATCTTGAT TATTGCTGGT GGTGTCGGTG AATTCGAAGC CGGTATCTCC 200 

AAGGATGGTC AAACCAGAGA ACACGCTTTG TTGGCTTTCA CCTTGGGTGT 250 

3 5 TAGACAATTG ATTGTTGCTG TTAACAAGAT GGATTCCGTT AAGTGGGATG 3 00 

AATCTCGTTT CCAAGAAATT GTCAAGGAAA CCTCTAACTT CATCAAGAAG 350 

GTCGGTTACA ACCCAAAGAC TGTTCCATTC GTCCCAATCT CTGGTTGGAA 4 00 

CGGTGACAAC ATGATTGAAG CCACCACCAA TGCTTCATGG TACAAGGGTT 450 

GGGAAAAGGA AACCAAGTCC GGTGTCGTCA AGGGTAAGAC CTTGTTGGAA 50 0 

4 0 GCTATTGACG CTATCGAACC ACCATCCAGA CCAACTGACA AGCCATTGAG 550 

ATTGCCATTG CAAGATGTCT ACAAGATTGG TGGTATCGGA ACTGTGCCAG 600 

TCGGTAGAGT CGAAACCGGT GTTATCAAGC CAGGTATGAT TGTTACCTTT 650 

GCCCCAGCCG GTGTTACTAC TGAAGTTAAG TCCGTCGAAA TGCACCACGA 700 

ACAATTGGAA GAAGGTCTAC CAGGTGACAA CGTCGGTTTC AACGTCAAGA 750 

45 ACGTTTCCGT TAAGGAAATC AGAAGAGGTA ACGTCTGTGG TGACTCCAAG 800 

AACGATCCAC CAAAGGCTGC TGCTTCTTTC AACGCCACTG TTATCGTCTT 850 

GAACCATCCA GGTCAAATCT CTGCTGGTTA CTCTCCAGTT TTGGATTGTC 900 

ACACTGCTCA CATTGCTTGT AAGTTCGACG AATTGTTGGA AAAGAACGAT 950 

AGAAGATCCG GTAAGAAGTT GGAAGACTCT CCAAAGTTCT TGAAGTCCGG 1000 

50 TGATGCTGCT TTGGTTAAGT TCGTTCCATC TAAGCCAATG TGTGTTGAAG 1050 

CCTTCTCTGA CTACCCACCT CTAGGTAGAT TCGCTGTCAG AGA 1093 



55 2) INFORMATION FOR SEQ ID NO: 429 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1094 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 

259 
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(D) TOPOLOGY: Linear 



PCT/CAOO/01150 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida tropicalis 

(B) STRAIN: ATCC 13803 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 429 



10 



15 



20 



25 



30 



AAATTGAAGG 
GAAATTCGAA 
ACAGAGATTT 
GCTATTTTGA 
TAAAGATGGT 
TCAAACAATT 
AAAAACAGAT 
GGTTGGTTAC 
ATGGTGACAA 
TGGGAAAAAG 
AGCCATTGAT 
GATTGCCATT 
GTCGGTAGAG 
CGCCCCAGCT 
AACAATTGGC 
AACGTTTCTG 
GAACGATCCA 
TGAACCACCC 
CACACTGCTC 
CAGAAGAACT 
GTGATGCTGC 
GCTTTCACTG 



CTGAAAGAGA 
ACTCCAAAAT 
CATCAAGAAC 
TTATTGCTGG 
CAAACCAGAG 
GATTGTTGCT 
TTGAAGAAAT 
AACCCTAAGG. 
CATGATTGAA 
AAACCAAGGC 
GCTATTGAAC 
GCAAGATGTT 
TTGAAACTGG 
GGTGTTACCA 
TGAAGGTGTC 
TTAAAGAAAT 
CCAAAGGGTT 
AGGTCAAATC 
ATATTGCTTG 
GGTAAGAAAT 
TATTGTCAAG 
ACTACCCACC 



AAGAGGTATC 
ACCACGTTAC 
ATGATTACTG 
TGGTACTGGT 
AACACGCTTT 
GTCAACAAGA 
TATCAAGGAA 
CTGTTCCATT 
GCTTCTACCA 
TGGTAAGGTT 
CACCTTCAAG 
TACAAGATTG 
TGTCATCAAA 
CTGAAGTCAA 
CCAGGTGACA 
TAGAAGAGGT 
GTGACTCTTT 
TCTGCTGGTT 
TAAATTCGAC 
TGGAAGAAAA 
ATGGTTCCAA 
ATTAGGTAGA 



ACCATTGATA 
CGTTATTGAT 
GTACTTCCCA 
GAATTCGAAG 
GTTGGCTTAC 
TGGACTCTGT 
ACTTCTAACT 
CGTTCCAATC 
ACTGTCCATG 
ACCGGTAAGA 
ACCAACTGAC 
GTGGTATTGG 
GCCGGTATGG 
ATCCGTCGAA 
ATGTTGGTTT 
AACGTTTGTG 
CAACGCTCAA 
ACTCTCCAGT 
ACCTTGGTTG 
TCCAAAATTC 
CCAAACCAAT 
TTCGCTGTCA 



TCGCTTTGTG 50 

GCTCCAGGTC 100 

AGCTGATTGT 150 

CTGGTATTTC 200 

ACCTTGGGTG 250 

TAAATGGGAC 300 

TCGTCAAGAA 350 

TCTGGTTGGA 4 00 

GTACAAGGGT 450 

CTTTGTTGGA 500 

AAGCCATTGA 550 

TACTGTGCCA 600 

TTGTTACTTT 650 

ATGCACCACG 700 

CAACGTTAAG 750 

GTGACTCCAA 800 

GTTATTGTCT 850 

CTTGGATTGT 900 

AAAAGATTGA 950 

GTCAAATCCG 1000 

GTGTGTTGAA 1050 

GAGA 1094 



35 



2) INFORMATION FOR SEQ ID NO: 430 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1095 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida utilis 

(B) STRAIN: Csp 388 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 30 



CAAGCTTAAA 
GGAAGTTCGA 
CACAGAGATT 

55 TGCTATTCTT 
CTAAGGATGG 
GTTAGACAGA 
CGAGAAGAGA 
AGGTTGGTTA 

60 AACGGTGACA 



GCTGAGAGAG 
GACTCCAAAG 
TCATCAAGAA 
ATCATTGCCG 
TCAGACCAGA 
TGATTGTTGC 
TTCGAGGAGA 
CAACCCAAAG 
ACATGATTGA 



AGAGAGGTAT 
TACCACGTTA 
CATGATTACT 
GTGGTGTTGG 
GAGCACGCTT 
TATCAACAAG 
TCGTTAAGGA 
ACTGTTCCAT 
GGCCTCTACC 
26 



CACTATCGAC 
CTGTCATTGA 
GGTACCTCCC 
TGAGTTCGAG 
TGCTCGCTTT 
ATGGACTCTG 
GACCTCTAAC 
TTGTCCCAAT 
AACTGTCCAT 
0 



ATTGCTCTCT 
TGCCCCAGGT 
AGGCTGACTG 
GCTGGTATCT 
CACCCTTGGT 
TCAAGTGGGA 
TTCATCAAGA 
TTCYGGTTGG 
GGTACAAGGG 



50 
100 
150 
200 
250 
300 
350 
400 
450 
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TTGGGAGAAG GAGACCAAGG CTGGTGTTGT 
ATGCCATTGA CGCCATTGAG CCACCAACAA 
AGATTGCCAC TCCAGGATGT CTACAAGATT 
AGTCGGTAGA GTCGAGACCG GTGTCATCAA 
TTGCCCCATC CGGTGTCACC ACTGAGGTTA 
GAGCAGCTTG CTGAGGGTAT CCCAGGTGAC 
GAACGTCTCT GTTAAGGAGA TCAGAAGAGG 
AGAACGACCC ACCACAGGGT GCTGAGTCCT 
TTGAACCACC CAGGTCAGAT CTCTGCTGGT 
TCACACCGCC CACATTGCTT GTAAGTTCTC 
ACAGAAGATC CGGTAAGTCC CTTGAGGCCT 
GGTGATGCCG CTATCGTCAA GATGGTTCCA 
GGCCTTCACT GACTACCCAC CACTCGGTAG 



PCT/CA00/01150 

CAAGGGTAAG ACCTTGCTCG ' A 6 
GACCAACTGA CAAGCCATTG 550 
GGTGGTATCG GAACTGTTCC 600 
GCCAGGTATG GTTGTTACCT 650 
AGTCCGTCGA GATGCACCAC 700 
AACGTTGGTT TCAACGTTAA 750 
TAACGTTGCC GGTGACTCCA 800 
TCAACGCTCA GGTCATTGTC 850 
TACTCTCCAG TTTTGGACTG 900 
TGAGCTTTTG GAGAAGATTG 950 

CTCCAAAGTT CGTCAAGTCT 1000 

TCCAAGCCAT TGTGTGTTGA 1050 

ATTCGCTGTC AGAGA 1095 



2) INFORMATION FOR SEQ ID NO: 431 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1085 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida viswanathii 

(B) STRAIN: ATCC 28269 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 431 

GCTGAAAGAG AAAGAGGTAT CACCATCGAT ATCGCTTTGT GGAAATTCGA 50 

AACTCCAAAR TACCACGTTA CCGTCATTGA YGCTCCAGGT CACAGAGATT 100 

TCATCAAGAA CATGATYACT GGTACTTCTC AAGCTGATTG TGCTATYTTG 150 

ATTATCGCTG GTGGTACTGG TGAATTCGAA GCTGGTATYT CTAAGGATGG 200 

TCAAACCAGA GAACACGCTT TGTTGGCCTA CACCTTGGGT GTCAAGCAAT 250 

TGATTGTTGC TGTCAACAAG ATGGACTCTG TCAAATGGGA CAAGAACAGA 300 

TTCGAAGAAA TCATCAAGGA AACCTCCAAC TTCGTCAAGA AGGTTGGTTA 350 

CAACCCAAAG ACTGTTCCAT TCGTCCCAAT CTCTGGTTGG AACGGTGACA 4 00 

ACATGATTGA AGCCTCCACC AACTGCCCAT GGTACAAGGG TTGGGAAAAG 4 50 

GAAACCAAGG CTGGTAAGGT TACCGGTAAG ACTTTGTTGG AAGCCATTGA 500 

CGCTATCGAA CCACCAACCA GACCAACTGA CAAGCCATTG AGATTGCCAT 550 

TGCAAGATGT CTACAAGATT GGTGGTATCG GAACTGTGCC AGTCGGTAGA 600 

GTTGAAACTG GTGTCATCAA GGCCGGTATG GTTGTCACTT TYGCCCCAGC 650 

TGGTGTTACC ACTGAAGTCA AGTCCGTTGA AATGCACCAC GAACAATTGG 700 

CTGAAGGTGT CCCAGGTGAC AACGTTGGTT TCAACGTCAA GAACGTTTCC 750 

GTCAAGGAAA TCAGAAGAGG TAACGTCTGT GGTGACTCCA AGAACGACCC 800 

ACCAAAGGGT TGTGASTCTT TCAACGCTCA AGTCATTGTC TTGAACCACC 850 

CAGGTCAAAT CTCTGCTGGT TACTCTCCAG TCTTGGATTG TCACACTGCC 900 

CACATTGCTT GTAAGTTTGA CACCTTGGTT GAAAAGATTG ACAGAAGAAC 950 

CGGTAAGAAG TTGGAAGAAA ACCCAAAGTT TGTCAAGTCC GGTGACGCTG 1000 

CTATCGTCAA GATGGTCCCA ACCAAGCCAA TGTGTGTTGA AGCYTTCACT 1050 

GACTACCCAC CATTGGGTAG ATTCGCTGTC AGAGA 1085 



2) INFORMATION FOR SEQ ID NO: 432 
(i) SEQUENCE CHARACTERISTICS : 
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(A) LENGTH: 1072 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida zeylanoides 
10 (B) STRAIN: ATCC 7351 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 432 

AGGTATTACC ATTGACATTG CCTTGTGGAA GTTCGAGACC CCCAAGTACC 50 

15 AGGTCACCGT CATTGACGCT CCTGGCCACA GAGATTTCAT TAAGAACATG 100 

ATCACTGGTA CCTCCCAGGC TGACTGTGCC ATCTTGATCA TTGCTGGTGG 150 

TGTTGGTGAG TTCGAGGCTG GTATCTCCAA GGATGGCCAG ACCAGAGAGC 200 

ACGCCTTGCT TGCCTACACC TTGGGTGTCA AGCAATTGAT TGTTGCTGTC 250 

AACAAGATGG ACTCCGTCAA GTGGGACAAG AACAGATTCG AGGAGATTGT 300 

20 CAAGGAGACC TCCAACTTCG TCAAGAAGGT TGGCTACAAC CCCAAGACTG 350 

TCCCCTTCGT TCCCATCTCC GGTTGGAACG GTGACAACAT GATTGAGGCC 400 

TCCACCAACT GCCCTTGGTA CAAGGGTTGG GAGAAGGAGA CCAAGGCCGG 4 50 

TAAGGTCACT GGTAAGACCT TGTTGGAGGC TATTGACGCC ATTGAGCCCC 500 

' CCACCAGACC CACCGACAAG CCCTTGAGAT TGCCCTTGCA GGATGTCTAC 550 

25 AAGATTGGTG GTATTGGAAC GGTGCCCGTT GGCAGAGTTG AGACCGGCAT 600 

CATCAAGGCC GGTATGGTTG TCACCTTTGC CCCCGCTGGT GTCACTACTG 650 

AAGTGAAGTC TGTCGAGATG CACCACGAGC AATTGGCTGA GGGTGTCCCA 700 

GGTGACAATG TTGGTTTCAA CGTGAAGAAC GTTTCCGTTA AGGAGATCAG 750 

AAGAGGTAAC GTTTGCGGTG ACTCCAAGAA CGACCCCCCC AAGGCTGCTG 800 

3 0 CTTCTTTCAA CGCCCAGGTT ATCGTCTTAA ACCACCCCGG TCAAATCTCT 850 

GCTGGTTACT CTCCGGTTTT GGATTGCCAC ACTGCCCACA TTGCTTGCAG 900 

ATTCGACCAG TTGATTGAGA AGATCGACAG AAGAACCGGT AAGAAGATGG 950 

AGGACGACCC TAAGTTCATC AAGTCCGGTG ACGCTGCCAT CGTCAAGATG 1000 

GTTCCTTCCA AGCCCATGTG TGTTGAGGCC TTCACTGACT ACCCTCCCTT 1050 

35 GGGTCGTTTC GCTGTCAGAG AC 1072 



2) INFORMATION FOR SEQ ID NO: 433 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 751 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Coccidioides immitis 

(B) STRAIN: Silveira 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 433 

55 AGCACCAACT GGTCCGAGCC TCGTTTCAAC GAAATCGTCA AGGAAGTCTC 50 

CAACTTCATC AAGAAGGTCG GATACAACCC CAAGGCTGTT CCATTCGTCC 100 

CCATCTCTGG TTTCGAAGGT GACAACATGA TTCAACCCTC CACCAACGCT 150 

CCTTGGTACA AGGGCTGGAA CAAGGAGACC GCCTCTGGCA AGCACACTGG 200 

CAAGACCCTC CTCGACGCCA TTGATGCCAT CGACCCCCCA ACCCGCCCCA 250 

60 CCGAGAAGCC CCTCCGTCTC CCACTTCAGG ATGTGTACAA GATCTCTGGT 300 

262 
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ATCGGAACAG TCCCAGTCGG CCGTGTCGAA 
TATGGTTGTG ACCTTCGCTC CTTCCAACGT 
TCGAAATGCA CCACCAGCAG CTCACCCAGG 
GGCTTCAACG TCAAGAACGT CTCTGTCAAG 
5 CGCTGGTGAC TCCAAGAACG ACCCACCAAA 
CCCAGGTCAT CGTCCTCAAC CACCCTGGTC 
CCAGTCCTTG ACTGCCACAC TGCCCACATT 
CCTCGAGAAG ATCGACCGCC GTACCGGTAA 
AGTTCATCAA GTCTGGTGAT GCCGCTATCG 
10 C 



PCT/CA00/01150 

ACCGGTGTTA TCAAGCCTGG * ■■ J50 

CACCACTGAA GTCAAGTCCG 4 00 

GTAACCCTGG TGACAACGTT 450 

GAAGTCCGCC GCGGTAACGT 500 

GGGCTGCGAC TCCTTCAACG 550 

AAGTCGGTGC TGGTTATGCC 600 

GCTTGCAAGT TCTCCGAGCT 650 

ATCCGTTGAG AACAACCCCA 700 

TCAAGATGGT TCCATCCAAG 750 

751 



15 



20 



25 



2) INFORMATION FOR SEQ ID NO: 434 

( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 1146 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptococcus albidus 

(B) STRAIN: ATCC 66030 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 434 



3 0 AAGCTCAAGG 

GAAGTTCGAG 
ACCGAGACTT 
GCCATCCTCA 
CAAGGACGGT 
35 TCCGACAGCT 
TCGCAGGTCC 
CCGAAGACCG 
AAGGTCGGAT 
GCACGGAGAC 

4 0 GATGGCACAA 

GAGGCCATCG 
GCGATTGCCC 
CCGTCGGTCG 
TTCGCCCCTG 

45 CGAACAGCTC 
AGAACGTTTC 
AAGAACGACC 
CTTGAACCAC 
GCCACACCGC 

50 GACCGACGAA 
CGGTGACGCC 
AGTCCTACTC 



CCGAGCGAGA 
ACCCCCAAGT 
CATCAAGAAC 
TCATCGCCTC 
CAGACCCGAG 
CATCATTGCC 
TGGTCTCTGT 
ATACAACGAA 
ACAACCCCAA 
AACATGTTGG 
GGAGTCCAAG 
ACGCCATCGA 
CTCCAGGATG 
AGTCGAGACC 
CCAACGTCAC 
GCTGAGGGTG 
CGTCAAGGAC 
CCGCTATGGA 
CCGGGTCAGA 
TCACATTGCC 
CCGGTAAGGT 
GCCATCGTCA 
CGAGTACCCC 



GCGAGGTATC 
ACAATGTCAC 
ATGATCACCG 
CGGTATCGGA 
AGCACGCCCT 
ATCAACAAGA 
ACGAATCTTG 
ATCGTCAAGG 
GACCGTTCCC 
AGGAGTCCAC 
GCCGGTGTTG 
GCCCCCTACC 
TCTACAAGAT 
GGTGTCATCA 
CACCGAAGTC 
TTCCCGGTGA 
ATCCGACGAG 
GTCTGCTTCC 
TCGGTGCCGG 
TGCAAGTTCG 
CATGGAGGCC 
AGTTGATCCC 
CCCTTGGGTC 



ACCATCGACA 
CGTCATTGAC 
GTACCTCGCA 
GAGTTCGAGG 
TTTGGCCTTC 
TGGACACCTG 
CTGACCCCTT 
AGGCTTCCGG 
TTCGTCCCCA 
CAACATGCCC 
TCAAGGGAAA 
CGACCTTCCG 
CGGTGGTATC 
AGGCTGGTAT 
AAGTCCGTCG 
CAACGTCGGT 
GAAACGTCTG 
TTCAACGCTC 
CTACTCCCCC 
CTGAGCTCGT 
GCCCCCAAGT 
GTCCAAGCCC 
GATTCGCCGT 



TCGCCTTGTG 50 

GCCCCCGGTC 100 

GGCCGACTGT 150 

CTGGTATCTC 2 00 

ACCCTCGGTG 2 50 

CAAGGTTAGT 3 00 

TTACAGTGGT 350 

TTTCATCAAG 400 

TCTCCGGATG 4 50 

TGGTACAAGG 500 

GACCTTGCTC 550 

ACAAGCCCTT 600 

GGTACGGTGC 650 

GGTCGTCACC 700 

AAATGCACCA 750 

TTCAACGTCA 800 

CTCCGACTCG 850 

AGGTCATTGT 900 

GTTTTGGACT 950 

TGAGAAGATC 1000 

TCGTCAAGTC 1050 

ATGTGTGTCG 1100 

CCGAGA 114 6 



55 

2) INFORMATION FOR SEQ ID NO: 435 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1095 bases 
60. (B) TYPE: Nucleic acid 
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(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Exophiala jeanselmei 

(B) STRAIN: ATCC 64755 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 435 

CAAGCTGAAG GCCGAGCGTG AGCGTGGTAT CACCATCGAT ATCGCCTTGT 50 

GGAAGTTCGA GACTCCCAAG TACTATGTCA CTGTCATCGA CGCCCCTGGT 100 

CATCGTGACT TTATCAAGAA CATGATCACT GGTACTTCCC AGGCTGACTG 150 

15 CGCCATTCTC ATCATTGCCG CCGGTACTGG TGAATTCGAA GCCGGTATCT 200 

CCAAGGATGG TCAGACTCGT GAGCACGCTC TGCTCGCCTA CACCCTGGGT 250 

GTCAAGCAGC TCATTGTCGC CATCAACAAG ATGGACACCA CCAAGTGGTC 3 00 

CGAGGATCGT TTCAACGAAA TCATCAAGGA GACTTCCAGC TTCATCAAGA 3 50 

AGGTCGGCTA CAACCCCAAG TCCGTTCCTT TCGTCCCCAT CTCCGGCTTC 400 

20 AACGGTGACA ACATGATCGA TGTCTCCACC AACTGCCCCT GGTACAAGGG 450 

CTGGGAGAAG GAGACCAAGG CTGGCAAGGC CTCTGGCAAG ACTCTCCTCG 500 

AGGCCATCGA CGCCATCGAC CCCCCCACTC GTCCCACCGA CAAGCCTCTC 550 

CGTCTTCCTC TCCAGGATGT CTACAAGATC TCTGGTATCG GAACGGTGCC 600 

CGTCGGTCGT GTTGAGACTG GTGTCATCAA GGCCGGTATG GTCGTTACCT 650 

25 TCGCTCCTGC CAACGTCACC ACTGAAGTCA AGTCCGTCGA AATGCACCAC 700 

GAACAACTCG CCGAGGGTGT TCCAGGTGAC AACGTTGGTT TCAACGTCAA 750 

GAACGTCTCC GTCAAGGAGG TTCGTCGTGG AAACGTCTGC GGTGACTCCA 800 

AGAACGACCC ACCCAAGGGT GCTGATTCCT TCAACGCCCA GGTCATCGTC 850 

TTGAACCACC CTGGTCAAGT CGGTGCTGGC TACGCCCCAG TGTTGGATTG 900 

3 0 CCACACTGCC CACATTGCTT GCAAGTTCTC TGAGCTTCTC GAGAAGATTG 950 

ACCGCCGTAC CGGTAAATCC ATCGAAAACA ACCCCAAGTT CATCAAGTCT 1000 

GGTGACGCTG CCATCGTCAA GATGGTTCCC AGCAAGCCCA TGTGTGTTGA 1050 

GGCCTTCACT GACTACCCAC CTCTTGGTCG TTTCGCCGTC CGTGA 1095 



35 



50 



2) INFORMATION FOR SEQ ID NO: 436 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 1113 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusarium oxysporum 

(B) STRAIN: WSA-212 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 436 



AAGCTCAAGG CCGAGCGTGA GCGTGGTATC ACCATCGATA TTGCTCTCTG 50 

GAAGTTCGAG ACTCCTCGCT ACTATGTCAC CGTCATTGGT ATGTTGTCGC 100 

55 TCATGCTTCA TTCTACTTCT CTTCGTACTA ACACATCACT CAGACGCTCC 150 

CGGTCACCGT GATTTCATCA AGAACATGAT CACTGGTACT TCCCAGGCCG 200 

ATTGCGCCAT TCTCATCATT GCCGCCGGTA CTGGTGAGTT CGAGGCTGGT 250 

ATCTCCAAGG ATGGCCAGAC CCGTGAGCAC GCTCTTCTTG CCTACACCCT 300 

TGGTGTCAAG AACCTCATCG TCGCCATCAA CAAGATGGAC ACCACCAAGT 350 

60 GGTCTGAGGC CCGTTACCAG GAGATCATCA AGGAGACCTC CTCTTTCATC 4 00 

264 
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AAGAAGGTCG GCTACAACCC CAAGGCTGTC 
TTTCAACGGT GACAACATGC TTACCCCCTC 
AGGGTTGGGA GCGTGAGATC AAGTCCGGCA 
CTCGAGGCCA TTGACTCCAT CGAGCCCCCC 
5 CCTTCGTCTT CCCCTTCAGG ATGTCTACAA 
TTCCCGTCGG CCGTATCGAG ACTGGTGTCA 
ACCTTCGCTC CTTCCAACGT CACCACTGAA 
CCACGAGCAA CTCACTGAGG GCCAGCCCGG 
TGAAGAACGT CTCCGTCAAG GACATCCGAC 

10 TCCAAGAACG ACCCCCCTAT GGGTGCCGCT 
CGTCCTCAAC CACCCCGGTC AGGTCGGTGC 
ATTGTCACAC TGCCCACATT GCCTGCAAGT 
ATCGACCGCC GAACCGGTAA GGCTACTGAG 
GTCTGGTGAC TCCGCCATCG TCAAGATGGT 

15 TTGAGGCTTT CAC 



PCT/CAOO/01150 

GCTTTCGTCC CCATCTCCGG 4 50 

CACCAACTGC CCCTGGTACA 500 

AGCTCACTGG CAAGACCCTC 550 

AAGCGTCCCG TTGACAAGCC 600 

GATCGGTGGT ATTGGAACGG 650 

TCAAGCCCGG TATGGTCGTT 700 

GTCAAGTCCG TCGAGATGCA 750 

TGACAACGTT GGTTTCAACG 800 

GTGGTAACGT CGCTGGTGAC 850 

TCTTTCACCG CCCAGGTCAT 900 

TGGTTACGCT CCCGTCCTCG 950 

TCGCCGAGAT CCAGGAGAAG 1000 

GCCGCCCCCA AGTTCATCAA 1050 

TCCCTCCAAG CCCATGTGTG 1100 

1113 



2) INFORMATION FOR SEQ ID NO: 437 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 726 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear. 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Geotrichum spp. 

<B) STRAIN: LEV- 4 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 437 

35 GGTCCGAGGA CAGATTCAAC GAGATTGTCA AGGAGACTTC CAACTTCATC 50 

AAGAAGGTTG GTTAYAACCC CAAGACTGTT GCTTTCGTCC CCATCTCTGG 100 

TTGGAACGGT GACAACATGA TTGAGCCCTC CACCAACTGC CCCTGGTACA 150 

AGGGATGGCA GAAGGAGACC AAGGCTGGTG TCACTAAGGG TAAGACCCTC 200 

CTTGAGGCCA TCGATGCCAT TGAGCCCCCT GTCAGACCTT CCGACAAGCC 250 

4 0 CCTCCGTCTT CCCCTCCAGG ATGTCTACAA GATCGGTGGT ATCGGAACTG 300 

TGCCCGTCGG CCGTGTCGAA ACCGGTGTCA TCAAGGCCGG TATGGTCGTC 350 

ACCTTCGCCC CCGCTGGTGT CACCACTGAA GTCAAGTCCG TCGAGATGCA 4 00 

CCACGAGCTC CTCACTGAGG GTCTCCCCGG TGACAACGTT GGTTTCAACG 4 50 

TCAAGAACGT CTCCGTTAAG GATATCAGAC GTGGTAACGT CTGCGGTGAC 500 

45 TCCAAGAACG ATCCCCCCAA GGCTTGCGCT TCTTTCAACG CCCAGGTCAT 550 

TATCTTCAAC CACCCTGGTC AGATCTCTGC TGGATACTCT CCCGTCCTTG 6 00 

ATTGCCACAC CGCCCATATT GCTTGCAAGT TCGACACTTT GATCGAGAAG 650 

ATTGACCGTC GTACTGGTAA GAAGACTGAG GACTCCCCCA AGTTCGTCAA 700 

GGCCGGTGAT GCTGCTATCG TCAAGA 726 

50 



2) INFORMATION FOR SEQ ID NO: 438 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 754 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 

265 
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(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Histoplasma capsulatum 
5 (B) STRAIN: G186A5 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 438 

CACCACCAAG TGGTCCGAGT CCCGTTTCAA CGAAATCATC AAGGAGGTTT 50 

10 CCAACTTCAT CAAGAAGGTC GGATATAACC CCAAGGCTGT TCCCTTCGTG 100 

CCAATCTCTG GTTTCGAGGG TGACAACATG ATTGAACCCT CCCCCAACTG 150 

CACATGGTAC AAGGGCTGGA ACAAGGAGAC TGCCTCTGGC AAGTCTTCTG 200 

GTAAAACCCT TCTCGATGCC ATTGACGCCA TTGAACCCCC AACCCGTCCT 250 

ACCGATAAGC CCCTCCGTCT TCCCCTCCAG GATGTGTACA AAATCTCTGG 300 

15 TATTGGCACT GTTCCCGTCG GACGTGTTGA GACTGGTGTC ATCAAGCCCG 350 

GTATGGTCGT GACTTTCGCT CCCTCCAACG TCACCACTGA AGTCAAGTCC 400 

GTCGAAATGC ACCACCAACA ACTCCAGGCT GGTTACCCTG GCGACAACGT 450 

CGGCTTCAAC GTCAAGAACG TTTCAGTCAA GGAAGTCCGC CGTGGCAACG 500 

TTGCTGGCGA CTCCAAAAAT GATCCCCCTA AGGGCTGCGA ATCCTTCAAT 550 

20 GCCCAGGTCA TCGTCCTTAA CCACCCCGGC CAGGTTGGCG CTGGTTATGC 600 

CCCAGTCCTC GACTGCCACA CTGCCCACAT TGCTTGCAAG TTCTCTGAGC 650 

TTATTGAGAA GATCGACCGC CGTACCGGAA AGTCTGTTGA GAACAACCCC 700 

AAGTTCATCA AGTCTGGTGA TGCTGCTATC GTCAAGATGG TTCCCTCCAA 750 

GCCC 754 

25 



2) INFORMATION FOR SEQ ID NO: 439 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 743 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Issatchenkia orientalis 
40 (B) STRAIN: ATCC 6258 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 439 

TGGGATGAAA ACAGATTTGA AGAAATTGTC AAGGAAACCC AAAACTTCAT 50 

4 5 CAAGAAGGTT GGTTACAACC CAAAGACTGT TCCATTCGTT CCAATCTCTG 100 

GTTGGAATGG TGACAACATG ATTGAAGCAT CCACCAACTG TCCATGGTAC 150 

AAGGGTTGGA CTAAGGAAAC CAAGGCAGGT GTTGTTAAGG GTAAGACCTT 200 

ATTAGAAGCA ATCGATGCTA TTGAACCACC TGTCAGACCA ACCGAAAAGC 250 

CATTAAGATT ACCATTACAA GATGTTTACA AGATTGGTGG TATTGGTACT 300 

50 GTGCCAGTCG GTAGAGTCGA AACCGGTGTC ATTAAGCCAG GTATGGTTGT 350 

CACTTTTGCT CCAGCAGGTG TCACCACCGA AGTCAARTCC GTTGAAATGC 400 

ACCATGAACA ATTAGAACAA GGTGTTCCAG GTGATAACGT TGGTTTCAAC 450 

GTTAAGAACG TCTCTGTCAA GGATATCAAG AGAGGTAACG TTTGTGGTGA 500 

CTCCAAGAAC GACCCACCAA TGGGTGCAGC TTCYTTCAAT GCTCAAGTCA 550 

55 TTGTCTTGAA CCACCCTGGT CAAATTTCCG CTGGTTACTC TCCAGTCTTG 600 

GATTGTCACA CTGCCCACAT TGCATGTAAG TTCGACGAAT TAATCGAAAA 650 

GATTGACAGA AGAACTGGTA AGTCTGTTGA AGACCATCCA AAGTCYGTCA 700 

AGTCTGGTGA TGCAGCTATC GTCAAGATGG TCCCAACCAA GCC 743 

60 
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2) INFORMATION FOR SEQ ID NO: 440 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1091 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Malassezia furfur 

(B) STRAIN: ATCC 42132 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 440 

CAAGCTCAAG GCTGAGCGTG AGCGTGGTAT CACCATCGAC ATTGCTCTGT 50 

GGAAGTTCGA GACCCCTAAG TACCACGTTA CCGTCATTGA CGCTCCTGGT 100 

CACCGTGACT TCATCAAGAA CATGATTACG GGTACCTCGC AGGCTGACTG 15 0 

CGCTATCCTC ATCATTGCCG GTGGTACCGG TGAGTTCGAG GCTGGTATCT 200 

CGAAGGACGG TCAGACCCGT GAGCACGCTC TGCTCGCTTT CACCCTGGGT 250 

GTGCGTCAGC TCATTGTGGC CGTCAACAAG ATGGACACCA CCAAGTACTC 300 

GGAGGACCGC TTCAACGAGA TTGTCCGCGA AGTGTCGAAC TTCATCAAGA 350 

AGGTCGGTTT CAACCCCAAG ACTGTTGCCT TCGTCCCCAT CTCGGGCTGG 400 

CACGGTGACA ACATGATCGA GGCCACCACC AACATGCCTT GGTACAAGGG 450 

CTGGGAGAAG GAGACCAAGT CGGGCAAGGT CACTGGTAAG ACTCTGCTGG 500 

ACGCCATCGA CGCCATCGAG CCCCCGACCC GCCCCACTGA CAAGCCCCTG 550 

CGTCTCCCTC TGCAGGATGT GTACAAGATC GGTGGTATCG GTACTGTCCC 600 

TGTCGGTCGT GTTGAGACCG GTGTGATCAA GCCCGGTATG GTTGTGACCT 650 

TCGCTCCCTC GAACGTCACC ACTGAAGTTA AGTCGGTTGA GATGCACCAC 700 

GAGTCGCTCC CTGAGGGTCT CCCCGGTGAC AACGTTGGTT TCAACGTGAA 750 

GAACGTCTCG GTTAAGGACA TTCGCCGTGG TAACGTTGCC TCGGACTCGA 800 

AGAACGACCC CGCTCAGGAG GCTGCTTCGT TCAACGCGCA GGTCATTGTC 850 

ATGAACCACC CTGGTCAGAT CAGCAACGGT TACTCGCCCG TGCTTGACTG 900 

CCACACTGCG CACATTGCCT GCCGCTTCAA CAACATCCTC CAGAAGATCG 950 

ACCGTCGCTC GGGTAAGGTG CTTGAGGAGA ACCCCAAGTT CATCAAGTCG 1000 

GGTGACGCTG CCATGGTGGA GATGATCCCC ACCAAGCCCA TGTGTGTGGA 1050 

GTCGTTCAAC GAGTACCCCC CTCTGGGTCG TTTCGCTGTG C 1091 



2) INFORMATION FOR SEQ ID NO: 441 

( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 749 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Malassezia pachydermatis 

(B) STRAIN: ATCC 42756 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 441 

ACCACCAAGT ACTCGGAGGA CCGCTTCAAC GAGATTATTC GCGAGACTTC 50 
GAACTTCATC AAGAAGGTCG GTTACAACCC GAAGACTGTT GCCTTCGTCC 100 
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CGATCTCGGG CTGGCACGGT GACAACATGA TTGAGGCGAC CACCAACATG 150 

CCGTGGTACA AGGGCTGGGA GAAGGAGACC AAGTCGGGCA AGGCCACTGG 200 

TAAGACCCTT CTGGACGCTA TTGACGCCAT TGAGCCGCCG ACGCGCCCGA 250 

CCGACAAGCC TCTCCGTCTT CCTCTCCAGG ATGTGTACAA GATCGGTGGT 300 

5 ATCGGTACYG TCCCGGTCGG CCGTGTTGAG ACCGGTGTTA TCAAGCCCGG 350 

TATGGTTGTG ACCTTCGCTC CGTCGAACGT CACSACTGAA GTTAAGTCGG 400 

TCGAGATGCA CCACGAGCAG ATCCCTGAGG GTCTTCCGGG TGACAACGTT 450 

GGTTTCAACG TGAAGAACGT GTCGGTCAAG GACATTCGCC GTGGTAACGT 500 

CGCCTCGGAC TCGAAGAACG ACCCGGCTCA GGAGGCTGCC TCGTTCAATG 550 

10 CTCAGGTCAT TGTGATGAAC CACCCTGGTC AGATCAGCAA CGGTTACTCG 600 

CCRGTGCTCG ACTGCCACAC TGCTCACATT GCCTGCCGCT TCAACAACAT 650 

CCTCCAGAAG ATCGACCGTC GTTCGGGTAA GGTTCTYGAA GAGAACCCCA 700 

AGTTCATCAA GTCGGGTGAC GCTGCCATGG TTGAGATGAT CCCGACCAA 74 9 



15 



30 



2) INFORMATION FOR SEQ ID NO: 442 



(i) SEQUENCE CHARACTERISTICS : 
20 (A) LENGTH: 1150 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Malbranchea filamentosa 

(B) STRAIN: ATCC 48174 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 442 



ACTGAAGGCC GAGCGTGAGC GTGGTATCAC CATCGATATC GCCCTCTGGA 50 

AGTTCGAGAC CCCCAAGTAC CATGTCACCG TCATTGGTAC GTTCGACATG 100 

35 TTCGACCTTT TGCCTAGTGT CCCCTTCTAA CCACAGTTTA TAGACGCCCC 150 

TGGCCATCGT GATTTCGTCA AGAACATGAT CACTGGTACT TCCCAGGCTG 200 

ATTGCGCTAT CCTCATCATT GCTTCCGGCA CTGGTGAATT CGAGGCTGGT 250 

ATCTCCAAGG ATGGCCAGAC CCGTGAGCAC GCTCTGCTTT CCTTCACCCT 300 

CGGTGTTAGG CAGCTCATTG TCGCCCTCAA CAAGATGGAC ACTGTCAACT 350 

4 0 TCGCTGAGGC CCGTTACAAC GAGATTGTCA AGGAAGTCTC CAACTTCATC 400 

AAGAAGGTCG GCTACAACCC CAAGGCCGTT CCTTTCGTCC CCATCTCCGG 450 

TTTCGAGGGT GACAACATGA TCGAGGCCTC CACCCGCATT CCTTGGTACA 500 

AGGGCTGGAA CAAGGAGACC GCCAGTGGCA AGAGCACCGG CAAGACCCTY 550 

CTCGAGGCCA TTGATGCCAT CGAACCCCCG GTCCGTCCCA CCGACAAGCC 600 

4 5 CCTYCGTCTC CCTCTTCAGG ATGTGTACAA GATCTCCGGT ATTGGCACTG 650 

TTCCTGTCGG TCGTGTTGAG ACTGGTGTCA TCAAGCCTGG TATGGTCGTT 700 

ACTTTCGCCC CCGCCAACGT CACCACTGAA GTCAAGTCCG TCGAGATGCA 750 

CCACCAGCAG CTCCAGGCCG GTAACCCCGG TGACAACGTC GGCTTCAACG 800 

TCAAGAACGT TTCCGTCAAG GAAGTCCGCC GTGGCAACGT TGCCTCCGAC 850 

50 TCCAAGAACG ACCCCGCCAA GGGCTGCGAC TCCTTCAACG CCCAGGTCAT 900 

CGTCCTTAAC CACCCCGGTC AGGTCGGTGC TGGATACGCT CCCGTCCTCG 950 

ATTGCCACAC TGCCCACATT GCTTGCAAGT TCTCTGAGCT TCTTGAGAAG 1000 

ATCGATCGCC GTACCGGTAA ATCCGTTGAG GACCACCCCA AGTTCATCAA 1050 

GTCTGGTGAT GCCGCTATCG TCAAGATGGT TCCCTCCAAG CCTATGTGCG 1100 

55 TTGAGGCTTT CACTGACTAC CCTCCCCTTG GTCGTTTCGC CGTCCGTGAC 1150 



60 



2) INFORMATION FOR SEQ ID NO: 443 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1099 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Metschnikowia pulcherrima 

(B) STRAIN: DSM 70336 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 443 

GGACAAGTTG AAGGCTGAGA GAGAGAGAGG, TATCACCATC GACATTGCCT 50 

TGTGGAAGTT CGAGACTCCT AAGTACCACG TCACCGTYAT TGACGCCCCA 100 

GGTCACAGAG ATTTCATCAA GAACATGATC ACTGGTACTT CCCAGGCTGA 150 

CTGTGCYATC TTGATTATCG CYGGTGGTGT TGGTGAGTTC GAGGCTGGTA 200 

TCTCCAAGGA TGGCCAGACC AGAGAGCACG CTTTGTTGGC YTACACCTTG 250 

GGTGTTAGAC ARTTGATTGT TGCCGTCAAC AAGATGGACT CTGTCAAGTG 300 

GGACAAGAAC AGATTTGAGG AGATCATCAA GGAGACCTCT AACTTCGTCA 350 

AGAAGGTTGG TTACAACCCT AAGACYGTGC CATTCGTGCC AATYTCTGGT 4 00 

TGGAACGGTG ACAACATGAT TGAGGCYTCC ACTAACTGCC CATGGTACAA 450 

GGGTTGGGAG AAGGAGACCA AGGCCGGTAA GTCTWCCGGT AAGACCTTGT 500 

TGGAGGCCAT TGACGCCATT GAGCCACCAA CCAGACCTAC CGACAAGGCC 550 

TTGAGATTGC CTTTGCAGGA TGTCTACAAG ATCGGTGGTA TCGGAACGGT 600 

GCCAGTCGGC CGTGTCGAGA CCGGTGTCAT TAAGGCCGGT ATGGTTGTYA 650 

CCTTYGCCCC AGCTGGTGTC ACCACTGAGG TCAAGTCCGT CGAGATGCAC 700 

CACGAGCAGT TGGTCGAGGG TCTTCCAGGT GACAAYGTTG GTTTCAACGT 750 

CAAGAACGTC TCCGTTAAGG AGATCAGAAG AGGTAACGTC TGTGGTGACT 800 

CCAAGCAGGA CCCACCAAAG GGTGCCGCTT CTTTCACCGC YCAGGTTATT 850 

GTGTTGAACC ACCCTGGTCA GATCTCCTCT GGTTACTCTC CAGTGTTGGA 900 

CTGYCACACC GCCCACATTG CCTGTAARTT CGACACCTTG TTGGAGAAGA 950 

TTGACAGAAG AACTGGTAAG TCCTTGGAGT CYGAGCCTAA GTTCGTCAAG 1000 

TCYGGTGACG CCGCCATTGT CAAGATGGTG CCAACCAAGC CAATGTGTGT 1050 

TGAGGCTTTC ACCGACTACC CACCTTTGGG TAGATTCGCC GTCAGAGAC 1099 



2) INFORMATION FOR SEQ ID NO: 444 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1153 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paecilomyces lilacinus 

(B) STRAIN: ATCC 42570 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 444 

CAAGCTCAAG GCCGAGCGTG AGCGTGGTAT CACCATCGAC ATTGCCCTCT 50 

GGAAGTTCGA GACTCCCAAG TACTATGTCA CCGTCATTGG TACGTCGACT 100 

CGCGCGAGAC TGGTCGCAAT TTCCACGTCG CTAACGTGCT TGAACAGACG 150 

CTCCCGGCCA CCGTGACTTC ATCAAGAACA TGATCACTGG TACCTCCCAG 200 

GCTGACTGCG CTATCCTCAT TATCGCTGCC GGCACTGGTG AGTTCGAGGC 250 
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TGGTATCTCC AAGGATGGCC 
CCCTCGGTGT TAAGCAGCTC 
AAGTGGTCTG AGGCCCGTTT 
CATCAAGAAG GTCGGCTACA 
CTGGTTTCCA CGGCGACAAC 
TACAAGGGCT GGGAGAAGGA 
CCTCCTTGAG GCCATCGACT 
AGCCCCTCCG CCTTCCCCTT 
ACAGTCCCTG TCGGCCGTAT 
CGTGACCTTC GCTCCTTCCA 
TGCACCACGA GCAGCTCTCC 
AACGTCAAGA ACGTCTCCGT 
TGACTCCAAG AACGACCCCC 
TCATCGTCCT CAACCACCCC 
CTCGACTGCC ACACCGCCCA 
GAAGATCGAC CGCCGTACCG 
TCAAGTCTGG CGACTCTGCC 
TGCGTTGAGG CTTTCACCGA 
TGA 



AGACCCGTGA GCACGCTCTG 
ATCGTCGCTA TCAACAAGAT 
CCAGGAGATC ATCAAGGAGA 
ACCCCAAGAC CGTCGCTTTC 
ATGCTTTCCC CCTCCACCAA 
GACCAAGGCT GGCAAGTCCA 
CCATCGAGCC CCCCAAGCGC 
CAGGATGTGT ACAAGATCGG 
CGAGACTGGT GTCATCAAGC 
ACGTCACCAC CGAAGTCAAG 
GAGGGTGTCC CCGGTGACAA 
CAAGGAGATC CGTCGTGGCA 
CTCTGGGTGC CGCTTCTTTC 
GGCCAGGTCG GTGCTGGCTA 
CATTGCCTGC AAGTTCGCCG 
GCAAGTCTGT CGAGTCCGCC 
ATCGTCAAGA TGATTCCCTC 
CTACCCTCCT CTGGGCCGCT 



PCT/CAOO/01150 



CTCGCCTACA" 


300" 


GGACACCACC 


350 


CCTCCAACTT 


400 


GTCCCCATCT 


450 


CTGCCCCTGG 


500 


CCGGCAAGAC 


550 


CCCAGCGACA 


600 


CGGTATCGGC 


650 


CCGGCATGGT 


700 


TCCGTTGAGA 


750 


CGTCGGCTTC 


800 


ACGTCGCCGG 


850 


GATGCCCAGG 


900 


CGCCCCCGTC 


950 


AGATCAAGGA 


1000 


CCCAAGTTCA 


1050 


CAAGCCCATG 


1100 


TCGCCGTCCG 


1150 




1153 



2) INFORMATION FOR SEQ ID NO: 445 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 763 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paracoccidioides brasiliensis 
(Bj STRAIN: ATCC 32071 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 445 

TACCACTAAG TGGTCCGAGA CCCGATTCAA TGAAATTATC AAGGAAGTCA 50 

CCAATTTCAT TAAGAAGGTC GGATATAACC CCAAGACTGT TCCTTTCGTT 100 

CCCATTTCTG GTTTCGAGGG TGACAACATG ATCGAGCCCT CTGCCAACTG 150 

CCCATGGTAC AAGGGCTGGT CCAAGGAGAC TGCTCAGGGC AAGTACTCTG 200 

GCAAGACCCT TCTTGAGGCC ATCGACGCCA TTGAGCCCCC CACCCGTCCT 250 

ACCGATAAAC CTCTCCGTCT TCCCCTCCAG GATGTCTACA AGATCTCCGG 300 

TATTGGCACT GTTCCTGTCG GACGTGTTGA GACTGGAGTC ATCAAGCCCG 350 

GTATGGTCGT GACCTTCGCT CCCGCCAACG TCACCACTGA AGTCAAGTCC 4 00 

GTCGAAATGC ACCACCAGCA GCTTACCGCC GGTAACCCCG GTGACAACGT 450 

CGGCTTCAAC GTCAAGAATG TTTCCGTCAA AGAAGTCCGC CGTGGTAACG 500 

TTGCCGGTGA CTCTAAGAAT GATCCCCCAA AGGGCTGCGA TTCCTTCAAT 550 

GCCCAGGTCA TCGTCCTCAA CCACCCTGGT CAGGTTGGCG CTGGTTATGC 600 

CCCAGTCCTC GACTGCCATA CTGCCCACAT TGCCTGCAAA TTCGCTGAGC 650 

TCCTTGAGAA GATTGATCGA CGAACCGGAA AGTCTGTTGA GAACAACCCC 700 

AAGTTCATCA AGTCCGGTGA TGCTGCTATC GTCAAGATGA TTCCTTCCAA 750 

GCCCATGTGC GTC 763 



2) INFORMATION FOR SEQ ID NO: 446 * 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1346 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Penicillium marneffei 
10 (B) STRAIN: ATCC 64101 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 446 

AAGCTCAAGG CTGAGCGTGA GCGTGGTATC ACCATCGATA TTGCTCTCTG 50 

15 GAAGTTCCAG ACTGCCAAGT ACGAGGTTAC CGTCATTGAC GCCCCCGGTC 100 

ACCGTGATTT CATCAAGAAC ATGATCACTG GTACCTCCCA GGCCGATTGC 150 

GCTATTCTCA TCATTGCCTC TGGTACTGGT GAATTCGAGG CTGGTATCTC 200 

CAAGGATGGC CAGACTCGTG AGCACGCTCT TTTGGCTTTC ACCCTCGGTG 250 

TCCGTCAGCT CATTGTTGCC CTCAACAAGA TGGACACTTG CAAGTGGTCT 3 00 

20 CAGGGTGAGT ACTCGTACCT GCGTTTGGCC TTGAATATCT TACTAATGCA 350 

CCATAGATCG TTACAACGAA ATTGTCAAGG AGACTTCCAA CTTCATCAAG 400 

AAGGTCGGAT ACAACCCCAA GAACGTTCCT TTCGTTCCTA TCTCCGGTTT 450 

CAACGGTGAC AACATGCTTG AGCCCTCCCC CAACTGCCCC TGGTACAAGG 500 

GTTGGGAGAA GGAGACCAAG GCCGGTAAGG TCACTGGTAA GACCCTCCTC 550 

25 GAGGCCATCG ACGCCATTGA GCCCCCTACC CGTCCCGCCA ACAAGGTTAG 600 

TCCCTCCTCG ACTACTCAAA CCCTCCTCAT AAGTTCAGAT TACTGACTCG 650 

TTCACAGCCC CTCCGTCTTC CCCTCCAGRA TGTCTACAAG ATCGGTGGTA 700 

TTGGAACGGT TCCCGTCGGT CGTGTTGAGA CTGGTACCAT CGTTCCTGGT 750 

ATGGTTGTCA CCTTGTAAGT CACTCTCCTC GCTTATCCTA CCTGAAATCA 800 

30 TCATGTGCTA ACTTGACACT CAGCGCTCCC GCCAACGTCA CCACTGAAGT 850 

CAAGAGTGTT GAAATGCACC ACCAGCAGCT CACTGCCGGT CAGCCCGGTG 900 

ACAACGTTGG TTTCAACGTG AAGAACGTCT CCGTCAAGGA AATCCGTCGT 950 

GGTAACGTTG CTGGTGACAG CAAGAACGAC CCCCCTGCCG GTGCTGCCTC 1000 

CTTCAACGCC CAGGTCATCG TCCTCAACCA CCCCGGTCAG GTCGGTGCTG 1050 

35 GTTACGCCCC AGTCCTCGAT TGCCACACTG CCCACATTGC TTGCAAGTTC 1100 

GCTGAGCTCC TCGAGAAGAT TGACCGTCGT ACCGGAAAGT CTGTTGAGGA 1150 

CCACCCCAAG TTCATCAAGT CCGGTGACGC TGCCATCGTC AAGATGATTC 12 00 

CTTCCAAGCC CATGTGTGTT GAGGCTTTCA CCGAGTACCC TCCTCTCGGT 1250 

CGTTTCGCCG TTCGCGAGTA AGTTTTATCT CCGTTGTCTA TTTTCCATCC 13 00 

4 0 TTCCCTTCTC CTCCGTCTTC CATATATACT TTTTCAGTTA TATGTG 1346 



2) INFORMATION FOR SEQ ID NO: 447 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1094 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
55 (A) ORGANISM: Pichia anomala 

(B) STRAIN: ATCC 18205 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 447 

60 AAGTTAAAAG CTGAACGTGA AAGAGGTATC ACTATTGATA TTGCTTTATG 50 
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GAAATTCGAA 
ACAGAGATTT 
GCTATYTTAA 
AAAAGATGGT 
5 TTAAACAATT 
GAATCTCGTT 
AGTTGGTTAC 
ATGGTGATAA 
TGGAAAAAAG 

10 AGCCATTGAT 
GTTTACCATT 
GTCGGTAGAG 
TGCCCCAGCT 
AACAATTGAC 

15 AATGTTTCTG 
AAACGATCCA 
TAAACCATCC 
CACACTGCTC 
CAGACGTACT 

2 0 GTGATGCTGC 
GCTTTCACTG 



ACTCCAAAAT 
CATCAAAAAT 
TTATTGCCGG 
CAAACCAGAG 
GATTGTTGCT 
TCGAAGAAAT 
AACCCAAAAA 
CATGATTGAA 
AAACCAAAGC 
GCTATTGACC 
ACAAGATGTT 
TTGAAACCGG 
GGTGTTACCA 
TGAAGGTTTA 
TTAAAGAAAT 
CCAAAAGCTG 
AGGTCAAATC 
ACATTGCTTG 
GGTAAGAAAT 
TATTGTTAAA 
ACTACCCACC 



ACCATGTTAC 
ATGATTACTG 
TGGTATTGGT 
AACACGCTTT 
ATCAACAAGA 
TGTCAAGGAA 
CTGTTCCATT 
CCATCAWCTA 
TGGTGAAGCT 
CACCATCAAG 
TACAARATTG 
TGTTATCAAA 
CTGAAGTCAA 
CCAGGTGACA 
CCGTCGTGGT 
CTGAATCATT 
TCTGCTGGTT 
TAAATTTGAC 
TAGAAGAAGC 
TTTGTTCCAT 
ATTAGGTCGT 



CGTTATTGAT 
GTACTTCCCA 
GAATTCGAAG 
NTTAGCTTAC 
TGGATTCCGT 
ACYTCAAACT 
CGTTCCAATC 
ACTGTCCATG 
AAAGGTAAAA 
ACCAACTGAT 
GTGGTATTGG 
CCAGGTATGG 
ATCTGTTGAA 
ATGTTGGTTT 
AACGTCTGTG 
CAATGCTCAA 
ACTCTCCAGT 
ACTTTAATTG 
TCCAAAATTC 
CAAAACCATT 
TTCGCTGTCA 



PCT/CAOO/01150 

GCTCCAGGTC 100 

AGCTGATTGT 150 

CTGGTATCTC 2 00 

ACCTTAGGTG 250 

TAAATGGGAT 300 

TTATCAAGAA 350 

TCAGGTTGGA 4 00 

GTACAAAGGT 450 

CTTTATTAGA 500 

AAACCATTAC 550 

TACTGTGCCA 600 

TTGTTACCTT 650 

ATGCATCATG 700 

CAACGTTAAG 750 

GTGACTCTAA 800 

GTTATTGTCT 850 

TTTAGATTGT 900 

AAAAAATTGA 950 

ATCAAATCAG 1000 

ATCAGTTGAA 1050 

GAGA 1094 



25 2) INFORMATION FOR SEQ ID NO: 448 

( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 1100 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pichia anomala 

(B) STRAIN: ATCC 2149 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 448 



CTTAGATAAG TTAAAAGCTG AACGTGAAAG AGGTATCACT ATTGATATTG 50 

CTTTATGGAA ATTCGAAACT CCAAAATACC ATGTTACCGT TATTGATGCT 10 0 

CCAGGTCACA GAGATTTCAT CAAAAATATG ATTACTGGTA CTTCCCAAGC 150 

TGATTGTGCT ATMTTAATTA TTGCCGGTGG TATTGGTGAA TTCGAAGCTG 200 

4 5 GTATCTCAAA AGATGGTCAA ACCAGAGAAC ACGCTTTRTT AGCTTACACC 250 

TTAGGTGTTA AACAATTGAT TGTTGCTATC AACAAGATGG ATTCCGTTAA 300 

ATGGGATGAA TCTCGTTTCG AAGAAATTGT CAAGGAAACY TCAAACTTTA 350 

TCAAGAAAGT TGGTACAACC CAAAAACTGT TCCATTCGTT CCAATCTCAG 4 00 

GTTGGAATGG TGATAACATG ATTGAACCAT CAACTAACTG TCCATGGTAC 4 50 

50 AAAGGTTGGA AAAAAGAAAC CAAAGCTGGT GAAGCTAAAG GTAAAACTTT 500 

ATTAGAAGCC ATTGATGCTA TTGATCCACC ATCAAGACCA ACTGATAAAC 550 

CATTACGTTT ACCATTACAA GATGTTTACA ARATTGGTGG TATTGGTACT 600 

GTGCCAGTCG GTAGAGTTGA AACCGGTGTT ATCAAACCAG GTATGGTTGT 650 

TACCTTTGCC CCAGCTGGTG TTACCACTGA AGTCAAATCT GTTGAAATGC 700 

55 ATCATGAACA ATTGACTGAA GGTTTACCAG GTGACAATGT TGGTTTCAAC 750 

GTTAAGAATG TTTCTGTTAA AGAAATCCGT CGTGGTAACG TCTGTGGTGA 800 

CTCTAAAAAC GATCCACCAA AAGCTGCTGA ATCATTCAAT GCTCAAGTTA 850 

TTGTCTTAAA CCATCCAGGT CAAATCTCTG CTGGTTACTC TCCAGTTTTA 900 

GATTGTCACA CTGCTCACAT TGCTTGTAAA TTTGACACTT TAATTGAAAA 950 

60 AATTGACAGA CGTACTGGTA AGAAATTAGA AGAAGCTCCA AAATTCATCA 1000 
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AATCAGGTGA TGCTGCTATT GTTAAATTTG TTCCATCAAA ACCATTATCA * 1050 
GTTGAAGCTT TCACTGACTA CCCACCATTA GGTCGTTTCG CTGTCAGAGA 1100 



2) INFORMATION FOR SEQ ID NO: 449 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1201 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



20 



(ii) MOLECULE TYPE: Genomic. DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudallescheria boydii 

(B) STRAIN: ATCC 44331 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 449 



CAAGCTCAAG 
GGAAGTTCGA 
TGCTTTTTGT 

25 AATTATGAAC 
CCCGGCCATC 
TGATTGCGCC 
GCATCTCCAA 
CTCGGTGTCA 

30 CTGGTCCGAG 
TCAAGAAGGT 
GGTTTCCACG 
CAAGGGTTGG 
TTCTCGAGGC 

35 CCCCTCCGTC 
TGTGCCCGTC 
TCACCTTCGC 
CACCACGAGC 
CGTGAAGAAC 

4 0 ACTCCAAGAA 
ATTGTCCTCA 
TGACTGCCAC 
AGATCGACCG 
AAGTCTGGTG 

45 TGTTGAGTCC 
A 



GCCGAGCGTG 
GACCCCCAAG 
GCTTTTTGTG 
CTCGTACTAA 
GTGATTTCAT 
ATTCTCATCA 
GGATGGCCAG 
AGAACCTCAT 
GACCGATACA 
CGGCTACAAC 
GAGACAACAT 
GTCCGTGAGG 
CATCGACTYC 
TTCCCCTTCA 
GGCCGTATCG 
TCCCTCCAAC 
AGCTTACCGA 
GTCTCCGTCA 
CGACCCCCCC 
ACCACCCCGG 
ACTGCCCACA 
CCGTACCGGT 
ATGCCGCCAT 
TTCACCGAGT 



AGCGTGGTAT 
TACCAGGTCA 
CCTGTGCCTC 
TATGTCGTTC 
CAAGAACATG 
TTGCCTCCGG 
ACCCGTGAGC 
TGTTGCCATC 
AGGAGATCAT 
CCCAAGGCCG 
GCTTACCCCC 
TCAAGGGTAA 
ATCGAGCCCC 
GGACGTCTAC 
AGACCGGTAT 
GTCACCACTG 
GGGTGTCCCC 
AGGATATCCG 
GCTGCTGCCG 
CCAGATCGGT 
TTGCTTGCAA 
AAGTCGGTCG 
CGTCAAGATG 
ACCCCCCTCT 



CACCATCGAT 
CCGTCATTGG 
GCACAATTCC 
TCCCACTACC 
ATTACTGGTA 
TACTGGTGAG 
ACGCTCTTCT 
AACAAGATGG 
CAAGGAGACC 
TTCCTTTCGT 
TCCACCAACT 
CACCCTTACC 
CCAAGCGTCC 
AAGATCGGTG 
CCTCAAGCCC 
AAGTCAAGTC 
GGTGACAACG 
CCGTGGCAAC 
CCTCTTTCCA 
GCTGGTTACG 
GTTCGCCGAG 
AGAACAACCC 
GTTCCCTCCA 
CGGTCGTTTC 



ATCGCCCTCT 50 

TATGTCTTTG 100 

AGCCCTCGAT 150 

CACAGACGCC 2 00 

CCTCCCAGGC 250 

TTCGAGGCTG 300 

CGCCTTCACC 350 

ACACCAACAA 4 00 

TCCAACTTCA 450 

CCCCATCTCC 500 

GCCCCTGGTA 550 

GGCAAGACCC 600 

CACCGAGAAG 650 

GTATTGGCAC 700 

GGTATGGTCG 750 

CGTCGAGATG 800 

TTGGTTTCAA 850 

GTCTGCGGTG 900 

GGCCCAGGTC 950 

CTCCCGTTCT 1000 

CTCCTTGAGA 1050 

CAAGTTCGTC 1100 

AGCCCATGTG 1150 

GCCGTCCGTG 1200 
1201 



50 2) INFORMATION FOR SEQ ID NO: 450 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1095 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Rhizopus oryzae 

(B) STRAIN: ATCC 56015 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 450 

5 

CAAGCTTAAG GCTGAACGTG AACGTGGTAT CACCATCGAT ATTGCTCTCT 50 

GGAAGTTCGA AACCCCCAAG TACCAAATTA CCGTTATTGA TGCTCCCGGT 100 

CACCGTGATT TCATCAAGAA CATGATTACC GGTACTTCTC AAGCCGATTG 150 

TGCTATTCTT ATCATTGCTG GTGGTACTGG TGAATTCGAA GCTGGTATCT 200 

10. CCAAGGATGG TCAAACCCGT GAACACGCCC TTTTGGCTTT CACTCTCGGT 250 

GTCCGTCAAT TGATTGTTGC TGTCAACAAG ATGGATACCA CCAAGTGGTC 300 

CGAAGCTCGT TTCAACGAAA TYGTCAAGGA AGTTTCTTCC TTCATCAAGA 350 

AGATTGGTTA CAACCCCAAG TCTGTTCCCT TCGTCCCCAT CTCTGGTTGG 400 

CACGGTGACA ACATGTTGGA AGAATCTACC AACATGCCCT GGTACAAGGG 450 

15 ATGGAACAAG GAAACCAAGG CTGGTGCCAA, GTCTGGTAAG ACCCTCTTGG 500 

ATGCCATTGA CAACATTGAC CCTCCTACCC GTCCTGTTGA CAAGCCTCTC 550 

CGTCTTCCTC TTCAAGATGT TTACAAGATT GGTGGTATCG GTACTGTCCC 600 

CGTCGGTCGT GTCGAAACTG GTGTCATCAA GGCTGGTATG GTTGTCACCT 650 

TCGCTCCTGC TGCTGTCACC ACTGAAGTTA AGTCCGTCGA AATGCACCAC 700 

20 GAAACCCTCA CTGAAGGTCT CCCCGGTGAC AACGTCGGTT TCAACGTCAA 750 

GAACGTCTCC GTCAAGGATA TCCGTCGTGG TAACGTCTGT TCTGACTCCA 800 

AGAACGACCC CGCCAAGGAA GCCGGTTCCT TCACCGCTCA AGTCATTATC 850 

TTGAACCACC CTGGTCAAAT TGGTGCTGGT TACGCTCCYG TTTTGGATTG 900 

TCACACTGCT CACATTGCCT GTAAGTTCGC TGAATTGATC GAAAAGATTG 950 

25 ACAGACGTTC CGGTAAGTCC TTGGAAGCTA CTCCCAAGTT CGTCAAGTCT 1000 

GGTGACTCTG CCATCGTCAA GATGATCCCC TCCAAGCCCA TGTGTGTTGA 1050 

AGCTTACACT GACTACCCTC CTCTCGGTCG TTTCGCTGTT CGTGA 1095 



30 



40 



2) INFORMATION FOR SEQ ID NO: 4 51 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1092 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rhodotorula winuta 

(B) STRAIN: ATCC 10658 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 451 

GCTGAAGGCC GAGCGAGAGC GTGGTATCAC CATCGATATC GCTCTATGGA 50 

AGTTCGAGAC CCCCAAGTAC AACGTCACCG TCATTGACGC TCCAGGACAT 100 

CGTGATTTCA TCAAGAGTGA GTTAACCATA ACATCAAACA GTGTTGCAAA 150 

50 CATCAGCTAA TGCATGTTAT GCGTCCAGAC ATGATTACTG GTACTTCCCA 200 

GGCCGATTGC GCTATTCTCA TCATCGCCAC CGGTGTTGGT GAGTTCGAGG 250 

CTGGTATCTC CAAGGATGGC CAGACCCGAG AGCACGCCCT TCTCGCCTTC 300 

ACCCTCGGTG TCAGACAGCT CATCGTTGCC TTGAACAAGA TGGACTCGGT 350 

CAAGGTAGGC TAACTTCACA ACGTCGGCTT CCCATCATTC ATTCACTTAC 4 00 

55 CTGTCTTGTC TTCCACCCTC CAGTTCTCCG AGTCCCGATA CGATGAAATC 450 

GTCAAGGAGA CATCCGGTTT CATCAAGAAG GTCGGATTCG ACCCCAAGGG 500 

TGTTCCCTTC GTCCCCATCT CAGGATGGCA CGGAGACAAC ATGATCGAGG 550 

AGTCCACCAA CATGCCTTGG TACAAGGGAT GGAAGAAGAC CACCAAGACC 600 

GGCGAGTACA AGGGAAAGAC CCTGCTCGAG GCCATCGACT CCATCGAGCC 650 

60 CCCCACCCGT CCTACCGACA AGCCTCTCCG ACTTCCCCTC CAGGATGTCT 700 

274 
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ACAAGATTGG 
ACCATCAAGG 
CGAAGTCAAG 
CAGGTGACAA 
CGAAGAGGAA 
TGCTTCCTTC 
GTAACGGATA 
CATATTAGTT 
TTCAGCCCAC 
GACGATCCGG 
GACGCCGCCA 
TTTCACCGAC 



PCT/CAOO/01150 



TGGTATCGGA 
CTGGTATGGT 
TCTGTTGAGA 
CGTCGGATTC 
ACGTCTGCGG 
ACCGCCCAGG 
CTCTCCAGTT 
TGAACTCTTT 
ATTGCATGCA 
AAAGTCCATC 
TCGTCAAGAT 
TACCCACCTC 



ACAGTGCCAG 
CGTCACCTTC 
TGCACCACGA 
AACATCAAGA 
TGACTCCAAG 
TCATTGTCCT 
CTCGATTGCC 
TGAACAATAC 
AGTTCGACAC 
GAAGATACCC 
GGTCCCCACC 
TTGGACGATT 



TCGGACGAGT 
GCTCCTTCAG 
GCAGCTCGAG 
ACGTTTCAGT 
AACGATCCCC 
CAACCACCCC 
ACACTGGTGA 
TAACGTGAAT 
CCTCCTAGAG 
CCAAGTTCGT 
AAGCCAATGT 
CGCCGTCCGT 



CGAGACTGGT 
CTGTCACCAC 
GCTGGTCTTC 
CAAGGATATC 
CCAAGGAGGC 
GGTCAAATCG 
GTCATTCTTC 
CATTATACTT 
AAGATTGACC 
CAAGTCTGGT 
GCGTTGAGGC 
GA 



750 
800 
850 
900 
950 
1000 
1050 
1100 
1150 
1200 
1250 
1292 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 452 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12 89 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sporobolomyces salmoni color 

(B) STRAIN: ATCC 32311 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 452 

AAGCTCAAGG CCGAGCGTGA GCGTGGTATC ACCATCGATA TCGCTCTCTG 50 

GAAGTTCGAG ACCCCCAAGG TGCGTTCTCA CCCCGGCTGA GGAGCACGCA 100 

CGCGAGGGCT CACGCTGCGC CTCTTACAGT ACATGATCAC CGTCATCGAC 150 

GCCCCGGGTC ACCGTGACTT CATCAAGAAC ATGATCACCG GTACCTCGCA 200 

GGCCGACTGC GCCATCCTCA TCATCGCCGC CGGTACCGGT GAGTTCGAGG 250 

CTGGTATCTC GAAGGACGGC CAGACCCGCG AGCACGCCCT CCTCGCCTTC 300 

ACCCTCGGTG TCCGTCAGCT CATCGTCGCC ATCAACAAGA TGGACACGAC 350 

CAAGTACTCG GAGGCCCGTT TCGAGGAGAT CATCAAGGAG ACCTCCAACT 4 00 

TCATCAAGAA GGTCGGCTTC AACCCCAAGA ACGTCCCCTT CGTCCCCATC 450 

TCGGGATGGC ACGGTGACAA CATGATTGAG GAGACCGCCA ACATGCCCTG 500 

GTACAAGGGA TGGAAGAAGG AGACCAAGGC CGGTGAGGTC AAGGGCAAGA 550 

CCCTCCTCGA CGCCATCGAC GCGATCGAGC CCCCTTCGCG CCCTACCGAC 600 

AAGCCCCTCC GTCTTCCCCT CCAGGTTCGT TTCCCTGCTC GCGGTTTACG 650 

CTGCTACTTC GAGCTGACCC GCGAGCTCTG CCCGAACAGG ATGTCTACAA 700 

GATCGGTGGT ATCGGCACAG TCCCCGTCGG CCGTGTCGAG ACCGGCACGA 750 

TCAAGGCCGG TATGGTCGTC GTCTTCGCCC CGGCCAACGT CACCACTGAG 800 

GTCAAGTCGG TCGAGATGCA CCACGAGCAG CTCGAGGCTG GTCTCCCGGG 850 

AGACAACGTC GGCTTCAAGT GCGTCTCATC ATGTTTTTGC TTCGCTCGGC 900 

CATTTTTTCA GTCCTGACCC CGTTTTGCCC CTCGACAGCG TCAAGAACGT 950 

TTCCGTTAAG GACATCCGTC GCGGTAACGT CTGCGGTGAC TCGAAGAACG 1000 

ACCCCCCCAA GGAGGCCGCT TCCTTCAAGG CCCAGGTCAT CGTCATGAAC 1050 

CACCCCGGTC AGATCGGCAA CGGTTACGCT CCCGTTCTCG ACTGCCACAC 1100 

CGCCCACATT GCCTGCAAGT TCGACACCCT CCTCGAGAAG ATCGACCGTC 1150 

GCTCGGGCAA GTCGATTGAG GACCTCCCCA AGTTCGTCAA GTCGGGTGAC 1200 

GCCGCCATCG TCAAGATGGT TCCCTCCAAG CCGATGTGTG TCGAGTCGTT 1250 

CGCCGAGTAC CCCCCTCTCG GACGTTTCGC CGTCCGTGA 1289 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 
2) INFORMATION FOR SEQ ID NO: 453 



PCT/CAOO/01150 



10 



( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 1070 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sporothrix schenckii 

(B) STRAIN: WSA-148 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 453 

GTGAGCGCGG TATCACCATC GATATTGCTC TGTGGAAGTT CGAGACCCCC 50 

AAGTACTACG TCACCGTCAT TGACGCCCCC GGTCATCGCG ATTTCATCAA 100 

GAACATGATC ACTGGTACCT CGCAGGCCGA CTGCGCCATT CTCATCATTG 150 

2 0 CCGCTGGTAC TGGTGAGTTC GAGGCTGGTA TCTCCAAGGA TGGCCAGACT 200 

CGTGAGCACG CTCTGCTCGC CTACACCCTG GGTGTGCGGC AGCTGATCGT 250 

CGCCATCAAC AAGATGGACA CGGCCAAGTG GGCTGAGGCT CGTTACCAGG 300 

AGATCATCAA GGAGACCTCC AACTTCATCA AGAAGGTCGG CTACAACCCC 3 50 

AAGACTGTTG CCTTCGTCCC CATCTCGGGC TTCCACGGCG ACAACATGCT 4 00 

25 TACTCCCTCG ACCAACTGCC CCTGGTACAA GGGCTGGGAG AAGGAGGGCA 450 

AGAGCGGCAA GGTTACCGGT AAGACTCTGC TGGACGCCAT TGACGCCGTC 50 0 

GAGCCCCCCA AGCGCCCCAC GGACAAGCCC CTGCGTCTGC CCCTCCAGGA 550 

TGTCTACAAG ATCGGCGGTA TCGGCACTGT . CCCTGTCGGC CGTATCGAGA 600 

CTGGTGTCCT GAAGCCCGGC ATGGTCGTCA CCTTTGCCCC GTCCAACGTC 650 

30 ACCACTGAAG TCAAGTCCGT CGAGATGCAC CACGAGCAGC TTGTTGAGGG 700 

TGTTCCCGGC GACAACGTCG GCTTCAACGT CAAGAACGTC TCCGTCAAGG 750 

AGATCCGTCG TGGCAACGTT GCCGGTGACT CCAAGAACGA CCCCCCCTCG 800 

GGCGCCGCCA CCTTCAACGC CCAGGTCATT GTCCTGAACC ACCCCGGCCA 850 

GGTCGGCAAC GGCTACGCCC CGGTTCTGGA CTGCCACACC GCCCACATTG 900 

35 CCTGCAAGTT CACCGAGATC CTTGAGAAGA TCGACCGCCG TACCGGCAAG 950 

TCGGTTGAGA ACAACCCCAA GTTCATCAAG TCGGGTGACG CCGCCATTGT 1000 

CAAGCTGACG CCCTCGAAGC CCATGTGCGT TGAGGCCTTC ACTGACTACC 1050 

CCCCTCTGGG CCGTTTCGCC 1070 



40 



2) INFORMATION FOR SEQ ID NO: 454 

( i ) SEQUENCE CHARACTERI STI CS : 
45 (A) LENGTH: 1092 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Stephanoascus ciferrii 

(B) STRAIN: ATCC 52550 

55 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 454 



CTTAAGTCTG AGCGTGAGAG AGGTATCACC ATCGATATTG CTCTCTGGAA 50 

ATTCGAGACT CCTAAGTACA ACGTTACCGT CATTGATGCT CCAGGTCACA 100 

60 GAGATTTCAT CAAGAACATG ATTACTGGTA CCTCCCAGGC CGATCTTGCC 150 

276 
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ATCCTTATCA 
GGACGGTCAG 
GAAACATGAT 
GATCGTTTCA 
5 TGGTTACAAC 
GTGACAATAT 
GAGCGTGAGA 
CATTGACTCT 
TTCCTCTTCA 

10 GGTCGTGTTG 
CCCAGCTGGT 
AGATCCCAGA 
GTCTCCGTCA 
CGACCCACCA 

15 ACCACCCTGG 
ACTGCTCACA 
ACGTTCCGGT 
ATGCCGCCAT 
TTCACTGAGT 

20 



TTGCTGGTGG 
ACCAGAGAGC 
TGTTGCTGTC 
ACGAAATTGT 
CCTAAGAATA 
GATTGAGCCA 
CCAAGAACGG 
ATGGAGCCAC 
GGACGTTTAC 
AGACTGGTGT 
GTCACCACTG 
AGGTACCCCA 
AGGAAATCAG 
AAGGGCTGCG 
TCAGATCTCT 
TTGCTTGCAA 
AAGAAGGTCG 
TGTCAAGATG 
ACCCACCTCT 



TGTCGGTGAG 
ACGCTCTTCT 
AACAAGATGG 
CAAGGAGACC 
TTGCTTTCGT 
TCCACCAACT 
TACTGCCAAG 
CTTCCAGACC 
AAGATTGGTG 
TATCAAGCCA 
AAGTCAAGTC 
GGTGACAACG 
ACGTGGTAAC 
ACTCTTTCAA 
GCTGGTTACG 
GTTTGAGGAG 
AAGACTCTCC 
GTTCCATCCA 
TGGTCGTTTC 



TTCGAGGCTG 
TTCTTTCACC 
ACTCCGTCAA 
TCCAACTTCG 
TCCTATCTCC 
GCCCATGGTA 
GGTAAGACCA 
AGTTGACAAG 
GTATTGGTAC 
GGTATGGTTG 
TGTCGAGATG 
TTGGTTTCAA 
GTTACTGGTG 
CGCTCAGGTC 
CTCCAGTTTT 
CTCATTGAGA 
TAAGTTCGTC 
AGCCAATGTG 
GCCGTCCGTG 



GTATCTCCAA 
CTTGGTGTCA 
GTGGTCTGAG 
TCAAGAAGGT 
GGTTGGAACG 
CAAGGGTTGG 
TCTTGGAGGC 
CCTCTCCGTC 
GGTGCCAGTT 
TTACCTTTGC 
CACCACGAAC 
CGTCAAGAAC 
ACTCCAAGAA 
ATCATCTTCA 
GGACTGCCAC 
AGATTGACAG 
AAGGCCGGTG 
TGTTGAAACC 
AC 
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29tf 
250 
300 



350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1092 



2) INFORMATION FOR SEQ ID NO: 455 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton mentagrophytes 
35 (B) STRAIN: WSA-225 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 455 

GCCGAGCGTG AGCGTGGTAT CACCATCGAT ATCGCCCTCT GGAAGTTCGA 50 

40 GACCCCCAAG TACAATGTCA CCGTCATTGG TATGTTTTCT CTTTACCTTT 100 

CCCCTCCATC GTCTTGCTGT GCCATAACTA ACGAGAGTAG ACGCCCCCGG 150 

TCACCGTGAC TTCATCAAGA ACATGATCAC TGGTACCTCC CAGGCCGACT 200 

GTGCTATTCT CATCATTGCT GCCGGTACTG GTGAGTTCGA GGCTGGTATC 250 

TCCAAGGATG GCCAGACCCG TGAGCACGCT CTGCTCGCCT TCACCCTTGG 300 

45 TGTCAAGCAG CTCATCGTTG CCATCAACAA GATGGACACC ACCAACTGGT 350 

CCGAGGACCG TTTCAAGGAA ATCATCAAGG AAGTCACCAA CTTCATCAAG 4 00 

AAGGTTGGCT ACGACCCCAA GGGTGTTCCA TTCGTTCCAA TCTCTGGTTT 450 

CAACGGTGAC AACATGATTG AGGCCTCCAC CAACTGCCCA TGGTACAAGG 500 

GATGGAACAA GGAGACTAAG GCCGGTGGTG CCAAGACTGG CAAGACCCTC 550 

50 CTCGAGGCCA TCGATGCCAT CGACATGCCA ACCCGTCCTA CCGACAAGCC 600 

CCTCCGTCTC CCACTCCAGG ATGTCTACAA GATCTCTGGT ATCGGAACTG 650 

TACCAGTCGG TCGTGTTGAG ACCGGTATCA TCAAGCCCGG TATGGTCGTC 700 

ACCTTCGCCC CTGCCAACGT CACCACTGAA GTCAAGTCCG TCGAAATGCA 750 

CCACCAGCAG CTTCAGCAGG GTGTCCCCGG TGACAACGTC GGCTTCAACG 800 

55 TCAAGAACGT TTCCGTCAAG GAAGTCCGCC GTGGTAACGT TGCCGGTGAC 850 

TCCAAGAACG ACCCACCATC CGGCTGTGCC TCCTTCAACG CCCAGGTCAT 900 

CGTCCTCAAC CACCCCGGCC AGATCGGTGC TGGTTACGCC CCAGTCCTCG 950 

ACTGCCACAC TGCTCACATT GCTTGCAAGT TCGCTGAGCT CCTCGAGAAG 1000 

ATTGACCGCC GTACCGGTAA ATCCGTCGAA GCCAACCCCA AGTTCGTCAA 1050 

60 GTCTGGTGAT GCCGCTATCG CCAAGATGGT TCCCTCCAAG CCTATGTGCG 1100 

277 
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TTGAGGCTTT CACTGACTAC CCCCCACTTG GTCGTTTCGC" CGTCCGTGA' tl%9 



2) INFORMATION FOR SEQ ID NO: 456 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1101 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichosporon cutaneum 

(B) STRAIN: ATCC 62965 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 456 

TCTTGACAAG CTTAAAGCTG AACGTGAACG TGGTATCACC ATTGATATCG 50 

CTCTCTGGAA GTTCGAAACT CCTAAGTACT ACGTTACTGT TATTGATGCT 100 

CCAGGTCACC GTGATTTCAT CAAGAACATG ATTACTGGTA CTTCCCAAGC 150 

CGACTGCGCC ATTCTTATCA TTGCTGCCGG TGTCGGTGAA TTCGAAGCTG 200 

GTATCTCCAA GGAAGGTCAA ACCAGAGAAC ACGCTCTTCT CGCTTTCACC 250 

CTTGGTGTCA GACAACTTAT CATTGCCATC AACAAGATGG ACTCTGTCAA 300 

GTGGGACCAA AAGAGATACG AAGAAATCGT CAAGGAGGCT TCCAACTTCG 350 

TCAAGAAGGT TGGTTACAAC CCCAAGTCTG TTCCATTCGT TCCTATCTCT 4 00 

GGTTGGAACG GTGACAACAT GTTGGAACCT ACCACCAACG CCCCATGGTA 4 50 

CAAGGGATGG ACCAAGGAAA CCAAGGCTGG TGCCACTAAG GGTATGACTC 500 

TTATTGAAGC CATTGACGCC ATTGAACCAC CAGTAAGACC ATCCGACAAG 550 

CCACTCCGTC TCCCACTCCA AGATGTTTAC AAGATTGGTG GTATCGGAAC 600 

TGTGCCAGTC GGCCGTGTCG AAACCGGTAT CATCAAGGCC GGTATGGTCG 650 

TCACCTTTGC TCCACCAATG GTCACAACTG AAGTTAAGTC CGTTGAAATG 700 

CACCACGAAC AACTTGCTCA AGGTAACCCA GGTGACAACG TTGGTTTCAA 750 

CGTCAAGAAC GTTTCCGTTA AGGAAATCAG ACGTGGTAAC GTCTGTGGTG 800 

ACTCCAAGAA CGATCCACCA AAGGGCTGCG AATCTTTCAA CGCTCAAGTT 850 

ATCGTCTTGA ACCACCCTGG TCAAATCTCT GCTGGTTACT CTCCAGTTCT 900 

CGATTGCCAC ACTGCCCACA TTGCCTGCAG ATTCGACGAA CTCCTTGAAA 950 

AGATCGACCG TCGTTCCGGT AAGAAGATTG AAGACTCTCC AAAGTTTGTC 1000 

AAGTCTGGTG ATGCCGCTAT CGTCAAGATG ATCCCAACCA AGCCAATGTG 1050 

CGTTGAAACC TTCACTGAAT ACCCACCACT TGGTCGTTTC GCCGTCCGTG 1100 

A 1101 



2) INFORMATION FOR SEQ ID NO: 457 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1085 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Wangiella dermatitidis 

(B) STRAIN: WSA-229 

278 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 457 

GTTGAAGGCC GAGCGTGAGC GTGGTATCAC CATCGATATC GCCCTCTGGA 50 

AGTTCGAGAC CCCCAAGTAC TATGTCACCG TCATCGACGC CCCGGGTCAT 100 

5 CGTGACTTTA TCAAGAACAT GATCACTGGT ACCTCGCAGG CCGACTGCGC 150 

CATCTTGATC ATTGCCGCCG GTACCGGTGA ATTCGAAGCC GGTATCTCCA 200 

AGGATGGTCA GACCCGTGAG CACGCTCTGC TCGCCTACAC CTTGGGTGTC 250 

AAGCAGCTCA TCGTCGCCAT CAACAAGATG GACACCACCA AGTGGTCCGA 300 

GGAGCGTTTC AACGAAATCA TCAAGGAGAC TTCCAACTTC ATCAAGAAGG 350 

10 TCGGCTACAA CCCCAAGGCC GTTCCTTTCG TCCCCATCTC CGGCTTCAAC 4 00 

GGTGACAACA TGATTGAGGT CTCCACCAAC TGCCCGTGGT ACAAGGGATG 450 

GGAGAAGGAG TCCAAGGCTG GCAAGGCCAC CGGCAAGACC CTCCTCGAGG 500 

CCATTGACGC CATCGACCCA CCCACCCGTC CCACCGACAA GCCTCTCCGT 550 

CTCCCTCTCC AGGATGTCTA CAAGATCTCT GGTATCGGAA CGGTTCCTGT 600 

15 CGGTCGTGTC GAGACCGGTA CCATCAAGGC CGGTATGGTC GTCACCTTCG 650 

CTCCGGCCAA CGTCACCACT GAAGTCAAGT CCGTCGAAAT GCACCACGAG 700 

CAGCTCGCCG AGGGTCTGCC AGGTGACAAC GTTGGCTTCA ACGTCAAGAA 750 

CGTCTCCGTC AAGGAGGTTC GTCGTGGTAA CGTTGCCGGT GACTCCAAGA 800 

ACGACCCGCC CAAGGGTGCC GAGTCCTTCA ACGCCCAGGT CATTGTCCTC 850 

20 AACCACCCTG GTCAGATCGG TGCCGGCTAC GCTCCAGTCT TGGATTGCCA 900 

CACTGCCCAC ATTGCTTGCA AGTTCGCCGA GTTGCTCGAG AAGATCGACC 950 

GTCGTACCGG AAAGTCCATC GAGAACAACC CCAAGTTCAT CAAGTCTGGT 1000 

GATGCTGCCA TCGTCAAGAT GATTCCCAGC AAGCCCATGT GTGTCGAGGC 1050 

TTTCACCGAC TATCCTCCTC TGGGTCGTTT CGCTG 1085 

25 



2) INFORMATION FOR SEQ ID NO: 458 

( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 492 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 
4 0 (B) STRAIN: DAL- 95 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 458 

TGTCTTCATC CGGAATTGAT TGTGAGTCGT TCCACATGCT CACCTAGTTT 50 

4 5 TCGCTCGATC TTTTCACTAA CGCAAACCAT GTAGAACAAC ATTGCCAAGG 100 

CCCACGGTGG TTACTCCGTC TTCACTGGTG TTGGTGAGCG TACTCGTGAG 150 

GGTAACGATC TGTACCACGA AATGCAGGAG ACTGGTGTCA TTCAGCTCGA 200 

GGGTGAATCC AAGGTCGCAC TGGTGTTCGG ACAGATGAAC GAGCCCCCCG 250 

GTGCCCGTGC CCGTGTCGCC CTTACCGGTC TGACCATTGC CGAGTACTTC 300 

50 CGTGACGAGG AGGGTCAGGA CGTGCTGCTC TTCATTGACA ACATTTTCCG 350 

TTTCACCCAG GCCGGTTCTG AGGTGTCTGC CCTTCTCGGT CGTATCCCCT 400 

CTGCCGTCGG TTACCAGCCC ACCCTGGCCG TCGACATGGG TGGTATGCAG 450 

GAGCGTATCA CCACCACCAA GAAGGGTTCT ATTACCTCCG TC 4 92 

55 

2) INFORMATION FOR SEQ ID NO: 459 

(i) SEQUENCE CHARACTERISTICS : 
60 (A) LENGTH: 1154 bases 

279 



30 



35 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Blastoschizomyces capitatus 

(B) STRAIN: ATCC 10663 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 459 

GTCCGTGGTC AAGAAGTTAT TGACACTGGT GCCCCAATTA CCATTCCTGT 50 

TGGTCGTGGT ACTCTTGGTA GAATTATCAA CGTCATTGGT GAACCAATTG 100 

15 ACGAACGTGG TCCTATCAAG GCTTCTAAGT, ATGCTCCTAT CCATACTGAA 150 

CCACCAACCT TTGCTGAACA ATCTACTTCT GCTGAAGTTC TTGAAACTGG 200 

TATCAAGGTT GTCGATCTTC TTGCTCCTTA CGCCCGTGGT GGTAAGATTG 250 

GTCTTTTCGG TGGTGCTGGT GTCGGTAAGA CTGTCTTCAT TCAAGAACTT 300 

ATTAACAACA TTGCCAAGGC TCACGGTGGT TTCTCTGTCT TCACTGGTGT 350 

20 CGGTGAAAGA ACCCGTGAAG GTAACGATCT TTACCGTGAA ATGAAGGAAA 400 

CTGGTGTCAT CAACCTCGAA GGTGACTCCA AGGTCGCTCT CGTTTTCGGT 450 

CAAATGAACG AACCTCCAGG TGCCCGTGCC CGTGTCGCTT TGACTGGTCT 500 

TACCATTGCC GAATACTTCC GTGATGAAGA AGGACAAGAT GTCTTGCTTT 550 

TCGTTGACAA CATTTTCAGA TTCACCCAAG CCGGTTCTGA AGTCTCTGCT 600 

25 CTTTTGGGTC GTATTCCATC TGCCGTCGGT TACCAACCTA CCCTTGCTAC 650 

CGATATGGGT GCCCTCCAAG AACGTATTAC CACCACCCAA AAGGGTTCCG 700 

TCACATCTGT CCAAGCCGTC TATGTCCCAG CAGACGATTT GACTGATCCT 750 

GCCCCAGCCA CCACTTTCGC TCACTTGGAC GCCACCACTG TCTTGTCTCG 800 

TTCCATTTCC GAATTGGGTA TCTACCCAGC TGTCGATCCT CTCGATTCCA 850 

3 0 AGTCTCGTCT TTTGGATCCT GAAGTTATTG GACACGAACA CTACGAAGTT 900 

GCCACTCAAG TTCAACAAAC CCTCCAAGCT TACAAGTCTC TCCAAGATAT 950 

CATTGCCATT TTGGGTATGG ATGAATTGTC TGAAGCTGAT AAGCTTACTG 1000 

TCGAACGTGC CCGTAAGATC CAAAGATTCC TTTCCCAACC ATTCGCTGTT 1050 

GCCGAAGTTT TCACTGGTAT CGAAGGTCGT CTCGTTCCAT TGAAGGAAAC 1100 

35 CGTCAGATCT TTCAAGGAAA TCCTTGAAGG TAAGTACGAT CACCTTCCAG 1150 

AAGC 1154 



40 2) INFORMATION FOR SEQ ID NO: 460 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1295 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida albicans 

(B) STRAIN: ATCC 18804 



55- 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 460 



CCAATTCGAC GAAGGAAACT TGCCAGCTAT TTTGAATGCT TTGACTTTGA 50 

AGAACGGTGA CCAAGACTTG GTCTTGGAAG TTGCTCAACA TTTGGGTGAA 100 

AACACCGTCA GAGCTATTGC TATGGATGGT ACTGAAGGTT TAGTCAGAGG 150 

TACCGAAGTC AACGATACCG GTGCCCCAAT CTCCGTCCCA GTCGGTAGAG 200 

60 GTACTTTAGG TAGAATCATC AATGTTGTTG GTGAACCAAT TGATGACAGA 250 

280 
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GGTCCAATTG 
ATTCGTTGAA 
TTGTCGACTT 
GGTGGTGCTG 
5 CATTGCCAAA 
GAACCAGAGA 
ATCAACTTGG 
CGAACCACCA 
CTGAATACTT 

10 AACATTTTCA 
TCGTATTCCA 
GTCTTTTGCA 
GTCCAAGCTG 
CACTACATTC 

15 CTGAATTGGG 
TTATTGGACG 
TGTTCAACAA 
TTTTGGGTAT 
GCCCGTAAGA 

20 TTTCACTGGT 
CATTCAAGGA 



AATGTAAGGA 
CAATCCACTG 
GTTGGCCCCA 
GTGTCGGTAA 
GCCCATGGTG 
AGGTAACGAT 
AAGGTGACTC 
GGTGCTAGAG 
CAGAGATGAA 
GATTCACCCA 
TCTGCTGTCG 
AGAACGTATT 
TCTATGTCCC 
GCCCATTTGG 
TATCTACCCA 
CTTCTGTTGT 
ACTTTACAAG 
GGATGAATTG 
TCCAAAGATT 
ATCCCAGGTA 
TGTTTTGGAA 



AAAGAAACCA 
CTGCCGARAT 
TACGCCAGAG 
GACCGTCTTT 
GTTTCTCTGT 
TTGTACCGTG 
CAAGGTCGCT 
CTAGAGTTGC 
GAAGGTCAAG 
AGCTGGTTCC 
GTTATCAACC 
ACCACCACCA 
AGCTGATGAT 
ATGCCACTAC 
GCTGTCGATC 
TGGTCAAGAA 
CTTACAAATC 
TCTGAAGCTG 
CTTGTCTCAA 
GATTAGTCAG 
GGTAAATACG 



ATTCACGCTG 
TTTGGAAACC 
GTGGTAARAT 
ATCCAAGAAT 
CTTTACCGGT 
AAATGAAAGA 
TTGGTCTTCG 
TTTGACTGGT 
ATGTCTTGTT 
GAAGTGTCTG 
AACCTTAGCC 
AGAAAGGTTC 
TTGACCGATC 
TGTCTTGTCT 
CATTGGATTC 
CATTACGATG 
CTTACAAGAT 
ATAAATTGAC 
CCATTCGCTG 
ATTGCAAGAC 
ATAACTTGCC 



PCT/CA00/01150 

AACCACCATC 300 

GGTATCAAGG 350 

TGGTTTATTC 4 00 

TGATTAACAA 4 50 

GTYGGTGAAA 500 

AACCGGTGTC 550 

GTCAAATGAA 600 

TTGACCATTG 650 

GTTCATTGAT 700 

CTTTGTTAGG 750 

ACTGATATGG 800 

CGTCACCTCT 850 

CTGCTCCAGC 900 

AGAGGTATTT 950 

CAAATCCAGA 1000 

TCGCTACTGG 1050 

ATCATTGCTA 1100 

TGTCGAAAGA 1150 

TTGCTGAAGT 1200 

ACTGTCAAAT 12 50 

AGAAA 1295 



25 2) INFORMATION FOR SEQ ID NO: 461 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1277 bases 

(B) TYPE: Nucleic acid 

3 0 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida dubliniensis 

(B) STRAIN: NCPF 3949 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 461 

40 

TAACTTGCCA 
ATTTAGTTTT 
ATTGCTATGG 
TACCGGTGCC 
45 TCATCAATGT 
AAGGAAAAGA 
CACTGCTGCC 
CCCCATACGC 
GGTAAGACCG 
50 TGGTGGTTTC 
ACGATTTGTA 
GACTCCAAGG 
TAGAGCTAGA 
ATGAAGAAGG 
55 ACCCAAGCTG 
CGTCGGTTAT 
GTATTACCAC 
GTCCCAGCTG 
TTTGGATGCC 
60 ACCCAGCTGT 



GCTATTTTGA 
GGAAGTTGCT 
ATGGTACTGA 
CCAATCTCCG 
TGTTGGTGAA 
AACCAATTCA 
GAAATTTTGG 
CAGAGGTGGT 
TCTTTATCCA 
TCCGTCTTTA 
CCGTGAAATG 
TCGCTTTGGT 
GTTGCTTTGA 
TCAAGATGTC 
GTTCCGAAGT 
CAACCAACCT 
CACCAAGAAA 
ATGATTTGAC 
ACTACTGTCT 
CGATCCATTG 



ATGCTTTGAC 
CAACATTTGG 
AGGTTTAGTC 
TTCCAGTCGG 
CCAATTGATG 
TGCAGAACCA 
AAACCGGTAT 
AAGATTGGTT 
AGAATTGATT 
CCGGTGTCGG 
AAAGAAACCG 
CTTTGGACAA 
CTGGTTTGAC 
TTGTTGTTCA 
GTCTGCTTTG 
TAGCTACTGA 
GGTTCCGTCA 
CGATCCTGCT 
TGTCTAGAGG 
GATTCCAAAT 
28 



TTTGAAGAAC 
GTGAAAACAC 
AGAGGTACTG 
TAGAGGTACC 
ACAGAGGTCC 
CCATCCTTCG 
CAAGGTTGTC 
TGTTCGGTGG 
AACAACATTG 
TGAAAGAACC 
GTGTCATCAA 
ATGAACGAAC 
TATTGCTGAA 
TCGATAACAT 
TTAGGTCGTA 
TATGGGTCTT 
CCTCTGTCCA 
CCAGCCACCA 
TATTTCTGAA 
CCAGATTATT 
1 



GGTGACCAAG 
CGTCAGAGCT 
AAGTCAACGA 
TTAGGTAGAA 
AATTGAATGT 
TTGAGCAATC 
GACTTATTGG 
TGCTGGTGTC 
CTAAAGCCCA 
AGAGAAGGTA 
CTTAGAAGGT 
CACCAGGTGC 
TACTTCAGAG 
TTTCAGATTC 
TTCCATCTGC 
TTGCAAGAAC 
AGCTGTCTAT 
CATTCGCCCA 
TTGGGTATTT 
GGACGCTGCC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
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WO 01/23604 

GTTGTTGGTC AAGAACATTA TGATGTCGCT 
GCAAGCTTAC AAATCCTTAC AAGATATCAT 
AATTGTCTGA AGCTGATAAA TTGACTGTCG 
AGATTCTTGT CTCAACCATT CGCCGTTGCT 
5 AGGTAGATTA GTCAGATTGC AAGACACTGT 
TGGAAGGTAA ATACGATCAC TTGCCAG 



PCT/CAOO/01150 

ACTGGTGTTC AACAAACT^f 1050 
TGCTATTTTG GGTATGGATG 1100 
AAAGAGCCCG TAAGATTCAA 1150 
GAAGTTTTCA CTGGTATTCC 1200 
CAAATCATTC AAGGATGTTT 1250 

1277 



10 2) INFORMATION FOR SEQ ID NO: 462 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1278 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Double, 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida famata 

(B) STRAIN: ATCC 62894 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 462 



25 

GTAACTTGCC 
GACTTAGTTT 
TATTGCTATG 
ATTCTGGTGC 

30 ATCTTGAACG 
CAAGGAAACC 
CCACCAAGGC 
GCCCCTTACG 
CGGTAAGACC 

35 ATGGTGGTTT 
AACGATTTAT 
TGACTCCAAG 
CTAGAGCTAG 
GACGAAGAAG 

4 0 CACCCAAGCC 
CTGTCGGTTA 
AGAATTACCA 
CGTCCCAGCC 
ACTTGGATGC 

45 TACCCAGCTG 
TATCGTTGGT 
TACAAGCTTA 
GAATTGTCTG 
AAGATTCTTG 

50 CAGGTAGATT 
TTAGAAGGTA 



AGCTATTTTG 
TAGAAGTTGC 
GATGGTACTG 
TCCAATTTCT 
TTATTGGTGA 
AGACCAATTC 
TGAAGTTTTG 
CTAGAGGTGG 
GTCTTTATCC 
CTCTGTTTTC 
ATAGAGAAAT 
GTCGCCTTGG 
AGTTGCTTTA 
GTCAAGATGT 
GGTTCCGAAG 
TCAACCAACC 
CCACCACCAA 
GATGATTTAA 
TACCACTGTG 
TCGATCCATT 
AAAGAACACT 
CAAATCTTTA 
AAGCCGATAA 
TCTCAACCAT 
AGTTAGATTG 
AATATGATCA 



AACGCTTTGA 
CCAACATTTG 
AAGGTTTAGT 
GTCCCAGTCG 
ACCAATTGAT 
ACCAAGACCC 
GAAACTGGTA 
TAAGATTGGT 
AAGAATTGAT 
ACTGGTGTCG 
GAAGGAAACT 
TTTTCGGTCA 
ACCGGTTTAA 
GTTATTGTTC 
TGTCTGCTTT 
TTAGCCACTG 
GGGTTCCGTT 
CCGATCCTGC 
TTGTCTCGTG 
GGATTCCAAA 
ACGAAGTTGC 
CAAGATATCA 
ATTGACTGTC 
TCGCCGTTGC 
CAAGACACTG 
CTTACCAG 



CCTTGAAGAA 
GGTGAAAACA 
TAGAGGTACT 
GTAGAGGTAC 
GAACAAGGTC 
ACCAGCATTC 
TCAAGGTTGT 
TTATTCGGTG 
TAACAACATT 
GTGAAAGAAC 
GGTGTCATTA 
AATGAACGAA 
CCATTGCCGA 
GTCGATAACA 
GTTAGGTCGT 
ATATGGGTCT 
ACTTCTGTCC 
TCCAGCTACC 
GTATCTCTGA 
TCCAGATTGT 
CACTGGTGTC 
TTGCTATTTT 
GAAAGAGCCA 
CGAAGTTTTC 
TTAAATCTTT 



CGGTGAAAAC 50 

CCGTCAGAGC 100 

CCAGTTACCG 150 

TTTAGGTAGA 2 00 

CAGTTGATGC 250 

GTTGATCAAT 3 00 

CGATTTATTA 350 

GTGCCGGTGT 400 

GCCAAGGCCC 450 

CAGAGAAGGT 500 

ACTTGGAAGG 550 

CCACCAGGTG 600 

ATACTTCAGA 650 

TTTTTAGATT 700 

ATTCCATCGG 750 

TTTACAAGAA 800 

AAGCTGTCTA 850 

ACTTTCGCCC 900 

ATTGGGTATT 950 

TAGATGCTGA 1000 

CAACAAACCT 1050 

AGGTATGGAT 1100 

GAAAGATCCA 1150 

ACCGGTATCC 1200 

CAAGGAAGTC 1250 
1278 



55 2) INFORMATION FOR SEQ ID NO: 463 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1154 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 

282 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(D) TOPOLOGY: Linear 



PCT/CAOO/01150 



(ii) MOLECULE TYPE: Genomic DNA 



5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida glabra ta 

(B) STRAIN: ATCC 66032 



10 



15 



20 



25 



30 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 463 

TCTGGTCAGA GGCGAGAAGG TCGTCGACAC AGGTGCCCCA 
CTGTCGGCAG AGAGACCCTG GGCAGAATCA TCAACGTTAT 
ATCGACGAGC GTGGCCCAAT CAACTCAAAG TTGAGAAAGC 
CGACCCTCCT TCCTTCGCAG AACAGTCCAC CGCCGCCGAA 
CAGGTATCAA GGTCGTCGAC TTGCTGGCCC CTTACGCCAG 
ATCGGTCTGT TCGGTGGTGC CGGTGTCGGT AAGACCGTGT 
ACTGATCAAC AACATCGCAA AGGCTCACGG TGGTTTCTCC 
GTGTCGGTGA AAGAACCAGA GAAGGTAACG ATTTGTACAG 
GAAACCGGTG TCATCAACTT GGAAGGTGAC TCTAAGGTCG 
CGGCCAAATG AACGAACCAC CAGGAGCCAG AGCCAGAGTC 
GTTTGACCAT CGCAGAATAC TTCAGAGATG AAGAAGGTCA 
CTGTTCGTCG ACAACATTTT CAGATTCACC CAAGCCGGTT 
CGCTTTGCTA GGTCGTATCC CATCCGCCGT CGGTTATCAA 
CCACCGATAT GGGTCTGTTG CAAGAAAGAA TTACCACCAC 
TCCGTCACTT CCGTCCAAGC CGTCTACGTG CCTGCAGATG 
TCCTGCCCCT GCCACTACTT TCGCGCACTT GGACGCCACC 
CCAGAAGTAT CTCAGAATTG GGTATCTACC CAGCTGTCGA 
TCCAAGTCTA GATTGCTAGA CGCTGCCGTT GTCGGTGAAG 
CGTCGCCACA AAGGTCCAAG AAACTTTACA AACTTACAAG 
ATATCATCGC CATTTTGGGT ATGGATGAAT TGTCCGAACA 
ACTGTCGAAA GAGCAAGAAA GATCCAAAGA TTCTTGTCCC 
TGTCGCTGAA GTTTTCACCG GTATCGAAGG TAAGCTGGTC 
ACACCATCTC CTCTTTCAAG GCTGTCTTGG AAGGTAAGTA 
CCAG 



ATCTCCGTCC 50 

CGGTGAACCT 100 

CTATCCACGC 150 

GTCCTGGAAA 200 

AGGTGGTAAG 250 

TCATCCAAGA 300 

GTGTTCACAG 350 

AGAAATGAAG 400 

CCTTGGTCTT 450 

GCCTTGACCG 500 

AGATGTCCTG 550 

CAGAAGTCTC 600 

CCAACCTTGG 650 

AAAGAAGGGT 700 

ATTTAACAGA 750 

ACCGTCTTGT 800 

CCCATTGGAC 850 

AGCATTACAA 900 

TCTCTGCAAG 950 

AGATAAGCTA 1000 

AACCATTCGC 1050 

AGATTGAAGG 1100 

CGATGATCTT 1150 
1154 



2) INFORMATION FOR SEQ ID NO: 4 64 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 93 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

45 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida guilliermondii 
50 (B) STRAIN: ATCC 6260 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 464 



CCACTACGAG GACGGTAACC 
55 AGAACGGTGA CCAAAACTTG 
AACACCGTCA GAACCATTGC 
TGCCAGCGTC ACTGACACTG 
GTACTTTGGG TCGTATCATC 
GGACCAATCG AGTCCAAGCA 
60 GTTCGTCGAA CAATCCACTT 



TTCCTGCTAT TTTCAACGCC 
GTTTTGGAAG TTGCCCAGCA 
TATGGATGGT ACTGAAGGTT 
GTGCTCCTAT CTCTGTGCCT 
AACGTTATTG GTGAGCCAAT 
AAAGAAGCCC ATTCACGCTG 
CTGCCGAGGT TTTGGAAACC 
283 



TTGACTCTTA 50 

TTTGGGTGAA 100 

TGGTTAGAGG 150 

GTTGGTCGTG 200 

TGACGAGCGT 250 

AACCACCATC 300 

GGTATCAAGG 350 



SUBSTITUTE SHEET (RULE 26) 
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TTGTCGACTT GTTGGCTCCA TACGCCAGAG GTGGTAAGAT TGGATTGTTff" ^00 

GGTGGTGCCG GTGTCGGTAA GACTGTGTTT ATCCAAGAGT TGATTAACAA 4 50 

CATTGCCAAG GCTCACGGTG GTTTCTCCGT GTTCACCGGT GTCGGTGAAA 500 

GAACCAGAGA AGGTAACGAT TTGTACCGTG AAATGAAGGA AACTGGTGTC 550 

5 ATCAACTTGG AAGGTGAATC CAAGGTGGCC TTGGTGTTCG GTCAAATGAA 600 

CGAACCTCCA GGAGCTAGAG CCAGAGTTGC CCTTACCGGT TTGACCATCG 650 

CTGAATACTT CAGAGATGAG GAGGGTCAAG ATGTGTTGTT GTTCGTCGAC 700 

AACATTTTCA GATTCACTCA AGCTGGTTCT GAAGTGTCGG CTTTGTTGGG 750 

TCGTATTCCT TCGGCTGTCG GTTACCAACC TACTTTGGCC ACCGATATGG 800 

10 GTTTGTTGCA AGAGCGTATT ACCACCACCA AGAAGGGTTC CGTCACCTCT 850 

GTCCAAGCTG TCTATGTGCC AGCCGATGAT TTGACCGATC CTGCTCCTGC 900 

TACTACTTTT GCTCACTTGG ATGCTACCAC TGTGTTGTCT AGAGGTATCT 950 

CCGAGTTGGG TATTTACCCA GCTGTCGATC CTTTGGATTC CAAGTCGAGA 1000 

TTGTTGGATG CCTCTGTTGT CGGTGAGGAG CACTACTCGG TTGCTTCTAA 1050 

15 CGTTCAACAA ACCTTGCAAG CTTACAAGTC, TTTGCAAGAT ATCATTGCCA 1100 

TTTTGGGTAT GGACGAATTG TCGGAAGCTG ACAAGTTGAC CGTCGAGAGA 1150 

GCCCGTAAGA TCGAGAGATT CTTGTCTCAA CCATTTGCTG TTGCCGAAGT 1200 

TTTCACTGGT ATCAGTGGTA AGTTGGTCAG ATTGGAGGAC ACTATCAGAT 1250 

CTTTCAAGGA AGTCTTGGAA GGTAAGTACG ATCACTTGCC AGA 1293 

20 



2) INFORMATION FOR SEQ ID NO: 465 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1293 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida haemulonii 
35 (B) STRAIN: ATCC 22991 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 465 

CACTTTGACG ATGGTAACTT . GCCAGCCATT TTCAACGCCT TGAAGTTGAA 50 

40 GAACGGTGAC CAGGACTTGG TCTTGGAGGT CGCCCAGCAC TTGGGTGAGA 100 

ACACCGTCAG AACCATTGCC ATGGACGGTA CCGATGGTTT GGTCAGAGGC 150 

GAGGCTGTCA CTGACACTGG TGCTCCAATC TCCGTGCCTG TTGGTCGTGA 200 

GACTTTGGGT CGTATCATCA ACGTTATTGG TGAGCCAATT GACGAGAGAG 250 

GACCAATCAA GTCCAAGAAG AGAAACCCAA TCCACACTGA CCCACCAACC 300 

45 TTCGTTGAGC AGTCTACTTC TGCTGAGGTT TTGGAGACTG GTATTAAGGT 350 

TGTCGACTTG TTGGCCCCTT ACGCCAGAGG TGGTAAGATT GGTTTGTTCG 4 00 

GTGGTGCCGG TGTCGGTAAG ACCGTCTTTA TCCAAGAGTT GATTAACAAC 450 

ATTGCCAAGG CCCACGGTGG TTTCTCTGTC TTTACCGGTG TCGGTGAGAG 500 

AACCAGAGAA GGTAACGATT TGTACCGTGA AATGCAGGAG ACCGGTGTCA 550 

50 TCAACTTCGA GGGTGACTCC AAGGTCGCCT TGGTCTTCGG TCAGATGAAC 600 

GAGCCACCAG GAGCTAGAGC TAGAGTTGCT TTGACCGGTT TGACCATTGC 650 

CGAATACTTC AGAGATGAAG AAGGTCAGGA TGTGTTGTTG TTCGTTGACA 700 

ACATTTTCAG ATTCACTCAG GCTGGTTCCG AGGTGTCCGC CTTGTTGGGT 750 

CGTATTCCAT CTGCTGTCGG TTACCAGCCA ACCTTGGCCA CCGATATGGG 800 

55 TACCTTGCAA GAAAGAATTA CCACCACCAA GAAGGGTTCC GTCACCTCTG 850 

TCCAGGCCGT TTACGTGCCA GCTGATGATT TGACCGATCC TGCCCCAGCT 900 

ACCACTTTCG CTCACTTGGA TGCTACCACT GTGTTGTCTA GAGGTATTTC 950 

CGAGTTGGGT ATCTACCCAG CTGTCGACCC ATTGGACTCC AAGTCTAGAT 1000 

TGTTGGATGC CTCTGTTGTC GGTAAGGAGC ACTACGATGT TGCTACCAAC 1050 

60 GTCCAGCAGA CCTTGCAGGC CTACAAGTCC TTGCAGGATA TCATTGCCAT 1100 

284 
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TTTGGGTATG GATGAATTGT CCGAAGCCGA 
CTAGAAAGAT TCAGAGATTC TTGTCCCAGC 
TTCACTGGTA TCGAGGGTAA GTTGGTCAGA 
TTTCAAGGAG GTCTTGGATG GTAAGTACGA 



PCT/CA00/01150 

CAAGTTGACT GTCGAGAGAG 115CT 

CATTCGCCGT CGCCGAGGTT 1200 

TTGGAGGAGA CCGTCAAGTC 1250 

CCACTTGCCA GAG 1293 



10 



15 



2) INFORMATION FOR SEQ ID NO: 466 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1111 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear, 

(ii) MOLECULE TYPE: Genomic DNA 



20 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida inconspicua 

(B) STRAIN: ATCC 16783 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 466 



AACGGTGGAT 
25 CACTGTCAGA 
AACCAGTTAA 
ACTTTAGGTA 
TCCAATTGAT 
TCGTTGAACA 
30 GTTGATTTAT 
TGGTGCTGGT 
TTGCAAAGGC 
ACCAGAGAAG 
TAACTTAGAA 

3 5 AACCACCAGG 

GAATACTTCA 
CATTTTCAGA 
GAATTCCATC 
CTTTTACAAG 

4 0 CCAAGCAGTT 

CTACTTTCGC 
GAATTAGGTA 
ATTAGATGTT 
TTCAAGAAAC 
4 5 TTAGGTATGG 
AAGAAAGATT 



CTAAGTTAGT 
ACCATTGCTA 
TGATACTGGT 
GAATCTTAAA 
TGTAAGGAAA 
ATCTACTGAA 
TAGCTCCTTA 
GTCGGTAAAA 
TCATGGTGGT 
GTAATGATTT 
GGTGAATCTA 
AGCAAGAGCA 
GAGATGAAGA 
TTTACTCAAG 
CGCTGTCGGT 
AAAGAATTAC 
TATGTCCCAG 
CCACTTAGAT 
TTTACCCAGC 
GCTGTTGTTG 
TTTACAAGCA 
ATGAATTATC 
C 



CTTAGAAGTT 
TGGATGGTAC 
GCTCCAATCT 
CGTTATTGGT 
GAAAACCAAT 
GCTGAAGTTT 
CGCTAGAGGT 
CCGTTTTCAT 
TTCTCAGTTT 
ATACAGAGAA 
AGGTCGCCTT 
AGAGTTGCTT 
AGGTCAAGAT 
CAGGTTCTGA 
TATCAACCAA 
AACTACTAAG 
CAGATGATTT 
GCAACTACTG 
TGTCGATCCA 
GTCAAGAACA 
TACAAATCTT 
TGAACAAGAT 



GCTCAACATT 
TGAAGGTTTA 
CTGTCCCAGT 
GATCCAGTCG 
TCATCAAGAT 
TAGAAACCGG 
GGTAAGATTG 
TCAAGAATTA 
TCACTGGTGT 
ATGAAGGAAA 
AGTTTTCGGT 
TAACCGGTTT 
GTCTTGTTAT 
AGTTTCTGCA 
CTTTAGCAAC 
AAAGGTTCCG 
AACTGATCCT 
TCTTATCTAG 
TTAGATTCTA 
TTATGATGTT 
TACAAGATAT 
AAATTAACCG 



TGGGTGAAAA 50 

GTTAGAGGTC 100 

CGGTAGAGGT 150 

ATGAAAGAGG 2 00 

CCTCCTGCTT 250 

TATTAAGGTT 300 

GTTTATTCGG 350 

ATTAACAATG 4 00 

CGGTGAAAGA 4 50 

CTGGTGTTAT 500 

CAAATGAATG 550 

AACTATTGCT 6 00 

TCATTGATAA 650 

TTGTTAGGTA 7 00 

TGATATGGGT 750 

TTACTTCTGT 800 

GCTCCAGCAA 850 

AGGTATTTCC 900 

AATCTAGATT 950 

GCAACTCAAG 1000 

TATTGCTATT 10 50 

TTGAAAGAGC 1100 
1111 



50 2) INFORMATION FOR SEQ ID NO: 467 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1283 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 

285 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CAOO/01150 

(A) ORGANISM: Candida kefyr 

(B) STRAIN: ATCC 28838 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 467 

5 

TCGAACAAGG TCAATTGCCA GAAATTTTGA ACGCTTTGGA GATTGAWACT 50 

CCTCAAGGTA AGTTGGTTTT GGAAGTTGCC CAACATTTGG GTGAAAACAC 100 

CGTCAGAACC ATTGCTATGG ACGGTACCGA AGGTTTGGTC CGTGGTGAGA 150 

AGGTTTTGGA CACTGGTGCT CCAATTTCCG TCCCAGTCGG TAGAGAAACT 200 

10 TTGGGTAGAA TCATCAACGT TATTGGTGAG CCAATTGACG AAAGAGGCCC 250 

AATCAAGTCC AAGATGAGAA AGCCAATTCA CGCTGACCCT CCATCCTTTG 300 

TTGAACAATC CACTGCTGCT GAAGTTTTGG AAACCGGTAT CAAGGTTGTC 350 

GACTTGTTGG CCCCATACGC CAGAGGTGGT AAGATTGGTT TGTTCGGTGG 400 

TGCCGGTGTC GGTAAGACCG TTTTCATCCA AGAGTTGATT AACAACATTG 450 

15 CCAAGGCCCA TGGTGGTTTC TCCGTCTTCA, CCGGTGTCGG TGAAAGAACC 500 

AGAGAAGGTA ACGATTTGTA CCGTGAAATG AAGGAAACCG GTGTCATCAA 550 

CTTGGAAGGT GACTCCAAGG TCGCCTTGGT CTTCGGTCAA ATGAACGAAC 600 

CACCTGGAGC TAGAGCCAGA GTTGCCTTGA CCGGTTTGAC TATCGCTGAA 650 

TACTTCAGAG ATGAAGAAGG TCAAGATGTG TTGTTGTTTA TCGACAACAT 700 

20 TTTCAGATTC ACACAAGCCG GTTCCGAAGT GTCCGCTTTG TTGGGTCGTA 750 

TTCCATCCGC TGTCGGTTAC CAACCTACTT TGGCCACCGA TATGGGTTTG 800 

TTGCAAGAAA GAATTACCAC TACCAAGAAG GGTTCCGTTA CCTCCGTCCA 85 0 

AGCTGTCTAC GTCCCTGCTG ATGATTTGAC TGATCCTGCT CCAGCTACYA 900 

CTTTCGCCCA TTTGGACGCC ACCACCGTGT TGTCCAGAGG TATCTCCGAA 950 

25 TTGGGTATCT ACCCAGCTGT CGATCCATTG GATTCCAAGT CTAGATTGTT 1000 

GGACGCTGCC GTTGTCGGTC AAGAACATTA CGACGTCGCT ACTCAAGTTC 1050 

AACAAACTTT GCAAGCTTAC AAGTCTTTGC AAGATATCAT TGCCATTTTG 1100 

GGTATGGATG AATTGTCTGA ACAAGACAAG TTGACTGTCG AAAGAGCCAG 1150 

AAAGATCCAA AGATTCTTGT CTCAACCATT CGCCGTCGCC GAAGTTTTCA 12 0 0 

30 CTGGTATCCC AGGTAGATTG GTCAGATTAA AGGACACCAT CGCTTCCTTC 1250 

AAGGCTGTTT TGGAAGGTAA GTACGATCAC TTG 1283 



35 2) INFORMATION FOR SEQ ID NO: 468 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1287 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida krusei 

(B) STRAIN: ATCC 34135 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 68 



50 

CTTCGAACAA 
ATGGTGGTAA 
ACTGTCAGAA 
AACCGTTAAC 

55 CCTTAGGTAG 
CCAGTTGACT 
CGTTGAACAA 
TCGATTTATT 
GGTGCTGGTG 

60 YGCAAAGGCT 



GGCCAATTAC 
CAAGTTAGTT 
CCATTGCTAT 
GATACCGGTG 
AATCTTGAAC 
GTAAGGAAAG 
TCCACTGAAG 
AGCACCTTAC 
TTGGTAAGAC 
CATGGTGGTT 



CACAAATTTT 
TTAGAAGTTG 
GGATGGTACT 
CTCCAATCTC 
GTCATTGGTG 
AAAGCCAATT 
CTGAAGTTTT 
GCAAGAGGTG 
CGTTTTTATC 
TCTCCGTTTT 
28 



AAACGCTTTA 
CTCAACATTT 
GAAGGTTTAG 
TGTCCCAGTT 
ATCCAGTCGA 
CACGCTGATC 
GGAAACTGGT 
GTAAGATTGG 
CAAGAATTGA 
CACTGGTGTT 
6 



GTTATGGATA 
AGGTGAAAAC 
TTAGAGGTCA 
GGTAGAGGTA 
TGAAAGAGGT 
CTCCAGCTTT 
ATTAAGGTTG 
TTTATTCGGT 
TCAACAATGT 
GGTGAAAGAA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
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CCAGAGAAGG TAACGATTTA TACAGAGAAA 
AACTTGGAAG GTGAATCCAA GGTCGCCTTA 
ACCACCAGGA GCTAGAGCAA GAGTTGCTTT 
AATATTTCAG AGATGAAGAA GGTCAAGATG 
5 ATTTTCAGAT TCACCCAAGC AGGTTCTGAA 
AATTCCATCT GCTGTCGGTT ATCAACCAAC 
TTTTACAAGA AAGAATTACC ACCACCAAGA 
CAAGCTGTTT ATGTCCCAGC CGATGATTTA 
TACTTTCGCC CACTTGGATG CAACCACTGT 

10 AATTAGGTAT CTACCCAGCT GTCGATCCAT 
TTAGATGTTG CAGTTGTTGG TCAAGAACAT 
CCAAGAAACT TTACAAGCTT ACAAGTCTTT 
TGGGTATGGA TGAATTATCT GAACAAGATA 
AGAAAGATCC AAAGATTCTT ATCTCAACCA 

15 CACTGGTATT CCAGGTAAGT TAGTCAGATT 
TCAGGGATGT TCTTGCAGGT AAGTACGATC 



PCT/CA00/01150 

TGAAGGAAAC TGGTGTTATT 550 
GTTTTCGGTC AAATGAACGA 600 
AACTGGTTTG ACCATTGCAG 650 
TCTTGTTATT CATTGATAAC 700 
GTCTCTGCAT TATTAGGTAG 750 
TTTAGCAACC GATATGGGTC 800 
AGGGTTCCGT TACTTCTGTC 850 
ACCGATCCTG CTCCAGCTAC 900 
CTTGTCYAGA GGTATTTCCG 950 

TAGATTCTAA GTCTAGATTA 1000 

TATGAAGTTG CAACTCAAGT 1050 

ACAAGATATT ATTGCTATTT 1100 

AGTTAACYGT TGAAAGAGCA 1150 

TTCTCTGTTG CAGAAGTTTT 12 00 

AGAAGAAACC ATCAAGTCTT 1250 

ACTTACC 1287 



20 2) INFORMATION FOR SEQ ID NO: 469 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1283 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

3 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida lawbica 

(B) STRAIN: ATCC 24750 



35 



40 



45 



50 



55 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 469 

TCGAACAAGG TCAATTACCA CCAATTCTTA ACGCTTTGGT CATGGAAAAC 50 

GATGGTCAAA AGTTAGTTTT GGAAGTTGCT CAACATTTGG GTGAAAACAC 100 

CGTCAGAACC ATTGCTATGG ACGGTACCGA AGGTTTAGTT AGAGGTCAAC 150 

CTGTTAACGA CACTGGTGCT CCAATCTCTG TCCCAGTTGG TAGAGGTACT 200 

CTTGGTAGAA TCTTGAACGT CACTGGTGAC CCAGTYGATG AAAGAGGTCC 250 

TGTCGAMTGT AAGGAGAGAA GACCAATTCA CCAAGACCCA CCTGCTTTCG 300 

TTGACCAATC CACTGAAGCT GAAGTTTTGG AAACCGGTAT TAAGGTTGTC 350 

GATTTATTAG CACCTTACGC TAGAGGTGGT AAGATTGGTT TGTTCGGTGG 4 00 

TGCTGGTGTC GGTAAGACCG TTTTCATCCA AGAATTGATT AACAATGTTG 4 50 

CTAAGGCCCA CGGTGGTTTC TCCGTTTTCA CTGGTGTCGG TGAAAGAACC 500 

AGAGAAGGTA ACGATTTATA CAGAGAAATG AAGGAAACCG GTGTTATTAA 550 

CCTTGAAGGT GAATCTAAGG TCGCCCTCGT TTTCGGTCAA ATGAACGAAC 600 

CACCAGGAGC AAGAGCTAGA GTTGCTCTTA CTGGTTTGAC CATTGCTGAA 650 

TACTTCAGAG ATGAAGAAGG TCAAGATGTC TTGCTTTTCA TTGACAACAT 700 

TTTCAGATTC ACCCAAGCAG GTTCCGAAGT GTCTGCTTTG TTAGGTAGAA 750 

TTCCWTCTGC CGTTGGTTAC CAACCAACCT TAGCAACCGA TATGGGTTTG 800 

TTACAAGAAA GAATTACCAC CACCAAGAAG GGTTCCGTCA CCTCCGTCCA 850 

AGCTGTTTAC GTCCCAGCTG ATGATTTRAC TGACCCTGCA CCAGCAACCA 900 

CTTTCGCCCA CTTGGATGCT ACCACCGTCT TGTCTAGAGG TATTTCCGAA 950 

TTAGGTATTT ACCCAGCTGT CGATCCATTA GATTCTAAGT CTAGATTATT 1000 

GGATGTCGCT GTTGTCGGTC AAGAACATTA TACCGTTGCA ACYCAAGTCC 1050 

AAGAAACTTT ACAAGCTTAC AAGTCCTTAC AAGATATCAT TGCTATTTTG 1100 

GGTATGGACG AATTATCTGA ACAAGATAAG CTTACTGTTG AAAGAGCAAG 1150 

AAAGATCCAA AGATTCCTTT CCCAACCTTT CTCCGTCGCA GAAGTTTTCA 1200 

CTGGTATCCC AGGTAAGCTT GTCAGATTAG AAGAAACCAT TATTTCTTTC 1250 

287 
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AGAGATGTCC TCGATGGTAA GTACGACCAC TTA 



PCT/CAOO/01150 
1283 



2) INFORMATION FOR SEQ ID NO: 470 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 114 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida lusitaniae 

(B) STRAIN: ATCC 66035 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 470 

AGGTGCCTCT GTCACTGACA CTGGTTCTCC AATCTCTGTC CCTGTTGGTC 50 

GTGAAACCTT GGGTAGAATT ATCAACGTTG TTGGTGAGCC AATTGACGAG 100 

AGAGGCCCAA TCAACTCCAA GAAGAGAAAC CCAATTCACA CTGAGCCACC 150 

ATCGTTTGTT GAACAATCCA CTTCTGCTGA AGTTTTGGAG ACTGGTATCA 2 00 

AGGTTGTCGA CTTGTTGGCC CCTTACGCCA GAGGTGGTAA GATTGGTTTG 250 

TTCGGTGGTG CCGGTGTCGG TAAGACCGTT TTCATCCAAG AGTTGATTAA 300 

CAACATTGCC AAGGCCCACG GTGGTTTCTC TGTTTTCACT " GGTGTCGGTG 350 

AAAGAACCAG AGAAGGTAAC GATTTGTACC GTGAAATGCA AGAGACCGGT 4 00 

GTCATCAACT TCGAGGGTGA CTCCAAGGTC GCCTTGGTCT TCGGTCAAAT 450 

GAACGAACCA CCAGGAGCCA GAGCTAGAGT TGCTTTGACC GGTTTGACTA 500 

TTGCCGAGTA CTTCAGAGAC GAAGAGGGCC AAGATGTCTT GTTGTTCGTT 550 

GACAACATTT TCAGATTCAC CCAGGCCGGT TCTGAAGTGT CTGCTTTGTT 600 

GGGTCGTATT CCATCCGCTG TCGGTTACCA ACCAACCTTG GCCACCGATA 650 

TGGGTGCTTT GCAAGAGAGA ATTACCACCA CCAAGAAGGG TTCCGTCACC 700 

TCTGTCCAAG CCGTTTATGT TCCAGCTGAT GACTTGACTG ACCCTGCTCC 750 

AGCCACCACC TTCGCCCACT TGGACGCCAC CACTGTGTTG TCCAGAGGTA 800 

TCTCTGAATT GGGTATCTAC CCAGCTGTCG ACCCATTGGA CTCCAAGTCT 850 

AGATTGTTGG ACGCTTCTAT TGTTGGTAAG GAGCACTACG AAGTTGCTTC 900 

TAACGTTCAA CAAACTTTGC AAGCTTACAA GTCTTTGCAA GATATCATTG 950 

CCATTTTGGG TATGGATGAA TTGTCTGAGG CTGACAAGTT GACCGTTGAG 1000 

AGAGCCAGAA AGATCCAAAG ATTCTTGTCT CAACCATTCG CTGTTGCCGA 1050 

GGTTTTCACT GGTATCCCAG GTAGATTGGT CAGATTGGAG GACACTGTCA 1100 

GATCCTTCAA GGAAGTTTTG GACGGTAAGT ACGACCACTT 114 0 



2) INFORMATION FOR SEQ ID NO: 471 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12 96 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida norvegensis 

(B) STRAIN: ATCC 22977 

, 288 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 471 
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TCAATTCGAA 
ACAACGGTGG 
5 AACACCGTCA 
TCAACCAGTT 
GTACTTTAGG 
GGTCCAATCG 
TTTCGTTGAA 

10 TTGTCGATTT 
GGTGGTGCAG 
TGTTGCAAAA 
GAACTAGAGA 
ATTAATTTAG 

15 CGAACCACCT 
CTGAATATTT 
AACATTTTCA 
TAGAATTCCA 
GTCTTTTACA 

20 GTTCAAGCAG 
AACTACTTTC 
CCGAATTAGG 
TTATTAGATG 
AGTTCAAGAA 

25 TTTTAGGTAT 
GCTAGAAAAA 
TTTCACTGGT 
CATTCAGAGA 



CAAGGTCAAT 
AACTAAGTTA 
GAACCATTGC 
AATGATACTG 
TAGAATCTTA 
ATTGTAAGGA 
CAATCAACTG 
ATTAGCTCCT 
GTGTTGGTAA 
GCTCATGGTG 
AGGTAACGAT 
AAGGTGATTC 
GGAGCAAGAG 
CAGAGATGAA 
GATTTACTCA 
TCCGCTGTCG 
AGAAAGAATT 
TTTATGTCCC 
GCCCACTTAG 
TATTTACCCA 
TTGCAGTTGT 
ACTTTACAAG 
GGATGAATTA 
TTCAAAGATT 
ATCCCAGGTA 
TGTCTTAGCA 



TACCACCAAT 
GTTTTAGAAG 
TATGGATGGT 
GTGCTCCAAT 
AACGTTATTG 
AAGAAAACCA 
AAGCTGAAGT 
TACGCTAGAG 
AACCGTTTTT 
GTTTCTCCGT 
TTATACAGAG 
TAAGGTCGCA 
CAAGAGTTGC 
GAAGGTCAAG 
AGCAGGTTCT 
GTTATCAACC 
ACTACTACTA 
AGCAGATGAT 
ATGCAACTAC 
GCTGTCGATC 
TGGTCAAGAA 
CTTACAAATC 
TCTGAACAAG 
CTTATCTCAA 
AGCTTGTTAG 
GGTAAGTACG 



TTTAAACGCT 
TTGCTCAACA 
ACTGAAGGTT 
CTCTGTCCCA 
GTGATCCAGT 
ATTCATCAAG 
TTTAGAAACT 
GTGGTAAGAT 
ATTCAAGAAT 
TTTCACTGGT 
AAATGAAGGA 
TTAGTTTTCG 
TTTAACTGGT 
ATGTCTTGTT 
GAAGTTTCCG 
AACTTTAGCA 
AGAAAGGTTC 
TTAACTGATC 
TGTCTTATCT 
CATTAGATTC 
CATTATGATG 
TTTACAAGAT 
ATAAATTAAC 
CCTTTCTCTG 
ATTAGAAGAA 
ATCACTTACC 



TTAGTCATGG 50 

TTTAGGTGAA 100 

TAGTTAGAGG 150 

GTCGGTAGAG 200 

CGATGAAAGA 250 

ATCCTCCTGC 300 

GGTATTAAAG 350 

TGGTTTATTC 4 00 

TAATTAACAA 450 

GTCGGTGAAA 500 

AACTGGTGTT 550 

GTCAAATGAA 600 

TTAACTATTG 650 

ATTCATTGAT 700 

CTTTGTTAGG 750 

ACTGATATGG 800 

TGTTACTTCC 850 

CTGCTCCAGC 900 

AGAGGTATTT 950 

TAAATCTAGA 1000 

TTGCAACTCA 1050 

ATTATTGCTA 1100 

CGTTGAAAGA 1150 

TCGCAGAAGT 1200 

ACTATTTCTT 12 50 

AGAAAA 1296 



30 



2) INFORMATION FOR SEQ ID NO: 472 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 12 97 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



40 



45 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida parapsilosis 

(B) STRAIN: ATCC 90018 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 472 



ACACTTCGAA 
AAAACGGTAA 

50 AACACCGTCA 
TACTTCTGTC 
GTACTTTGGG 
GGTCCAATTG 
ATTCATTGAA 

55 TTGTCGACTT 
GGTGGTGCCG 
CATTGCCAAG 
GAACCAGAGA 
ATCAACTTGG 

60 CGAACCACCT 



GAAGGAAACT 
ACAAGACTTG 
GAGCAATTGC 
ACTGACACTG 
TAGAATCATC 
AATGTAAGAA 
CAATCCACTG 
GTTGGCTCCA 
GTGTCGGTAA 
GCACATGGTG 
AGGTAACGAT 
AAGGTGACTC 
GGAGCTAGAG 



TGCCAGCTAT 
GTCTTGGAAG 
TATGGATGGT 
GTGCCCCAAT 
AATGTTACTG 
GAGAAACCCA 
CTGCTGAAGT 
TACGCTAGAG 
AACCGTGTTT 
GTTTCTCGGT 
TTGTACCGTG 
CAAGGTCGCT 
CTAGAGTTGC 
28 



TTTGAACGCT 
TTGCCCAACA 
ACTGAAGGTT 
TTCTGTCCCA 
GTGACCCAAT 
ATCCACGCTG 
TTTGGAAACC 
GTGGTAAGAT 
ATCCAAGAAT 
TTTCACTGGT 
AAATGAAGGA 
TTGGTTTTCG 
TTTGACTGGT 
9 



TTGACCTTGA 50 

TTTGGGTGAA 100 

TAGTTAGAGG 150 

GTTGGTAGAG 200 

TGATGAAAGA 250 

AACCACCTTC 300 

GGTATTAAGG 350 

TGGTTTGTTC 4 00 

TGATCAACAA 4 50 

GTTGGTGAAA 500 

AACTGGTGTT 550 

GTCAAATGAA 600 

TTGACCATTG 650 
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CTGAATACTT TAGAGATGAA GAAGGTCAAG ATGTCTTGTT GTTTATTGAC * 700* 

AACATTTTCA GATTCACCCA AGCTGGTTCA GAAGTGTCTG CTTTGTTGGG 750 

TCGTATTCCA TCCGCTGTCG GTTATCAACC AACCTTGGCC ACCGATATGG 800 

GTCTTTTGCA AGAACGTATT ACCACTACCA AAAAGGGTTC AGTTACTTCA 850 

5 GTCCAAGCCG TTTACGTGCC AGCTGATGAT TTAACCGATC CTGCTCCAGC 900 

TACCACTTTC GCTCACTTGG ATGCCACCAC TGTGTTGTCT AGAGGTATTT 950 

CGGAGTTGGG TATTTACCCA GCTGTCGATC CATTGGATTC CAAATCCAGA 1000 

TTGTTGGATG CTGCCGTTGT TGGTCAAGAA CATTACGACG TTGCCACTGG 1050 

TGTTCAACAA ACTTTACAAG CTTACAAGTC CTTGCAAGAT ATCATTGCTA 1100 

10 TCTTGGGTAT GGATGAATTG TCCGAACAAG ATAAATTGAC TGTCGAAAGA 1150 

GCTCGTAAGA TTCAAAGATT CTTGTCTCAA CCATTTGCTG TCGCTGAAGT 1200 

TTTCACTGGT ATTCCAGGTA AATTGGTTAG ATTGTCTGAA ACTGTCAAGT 1250 

CATTCAAGGA AGTCTTGGAA GGTAAGTACG ATAACTTGCC AGAAAAT 12 97 



15 
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2) INFORMATION FOR SEQ ID NO: 473 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 1285 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida rugosa 

(B) STRAIN: ATCC 96275 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 473 



CCAGTTTGGC AACGACCTCC CTGCCATTTT GAACGCCCTC ACTCTTCAGC 50 

GTGAAGACGG TAACAAGCTT GTTCTTGAGG TTGCCCAGCA TCTCGGTGAG 100 

35 AACACCGTCC GTACCATTGC TATGGACGGT ACCGAGGGTT TGGTGCGTGG 150 

CACTGGTGTC CACGACACCG GACACCCCAT CATGACTCCC GTCGGTGACG 2 00 

GTACCCTGGG ACGTATTCTT AACGTCACCG GTGACCCTGT AGACGAGCGT 250 

GGTCCCGTCA AGACTGACAA GTTCCGCCCC ATCCACGCCG AGGCCCCTGC 300 

CTTCGATGAG CAGGCTACCA GTGCCGAGGT TCTTGAGACC GGTATCAAGG 350 

40 TTGTCGACTT GCTCGCTCCT TACGCCAAGG GTGGTAAGAT TGGTTTGTTC 4 00 

GGTGGTGCCG GTGTCGGCAA GACCGTCTTC ATCCAGGAGC TGATTAACAA 4 50 

CATCGCCAAG GCCCACGGTG GTTACTCCGT GTTCACTGGT GTCGGTGAGC 500 

GTACTCGTGA GGGTAACGAT TTGTACAAGG AAATGATCGA GTCCGGTGTC 550 

ATCAACCTCG ATGGTGAGTC CAAGGTCGCC TTGGTGTTCG GTCAGATGAA 600 

45 CGAGCCCCCT GGAGCCCGTG CCCGTGTCGC CCTTACCGGT CTTACCATCG 650 

CTGAGTATTT CCGTGATGAG GAGGGTAAGG ATGTCTTGTT GTTCGTTGAC 700 

AACATTTTCC GCTTCACTCA GGCCGGTTCT GAGGTGTCCG CCTTGCTTGG 750 

TCGTATTCCT TCCGCTGTCG GTTACCAGCC TACCCTGGCC ACCGATATGG 800 

GTGCCCTTCA GGAGCGTATC ACCACCACCA AGAAGGGTTC CGTTACATCC 850 

50 GTCCAGGCCG TCTACGTCCC TGCCGATGAT TTGACTGATC CCGCCCCTGC 900 

CACCACCTTC GCCCATTTGG ATGCCACCAC TGTCTTGTCT CGTGCCATCT 950 

CTGAGTTGGG TATCTACCCC GCTGTCGACC CTCTCGACTC CAAGTCCCGT 1000 

CTTCTTGACG CCGCTGTCGT TGGTCAGGAG CACTACGATA CTGCCACCTC 1050 

CGTTCAGCAG ACTTTGCAGG CTTACAAGTC TTTGCAGGAT ATCATTGCCA 1100 

55 TTCTTGGTAT GGATGAGTTG TCCGAGTCTG ACAAGCTCAC CGTCGAGCGT 1150 

GCTCGTAAGA TCCAGCGTTT CCTCTCCCAG CCTTTCGCTG TTGCTGAGGT 12 00 

CTTCACTGGT ATTCAGGGCC GTCTTGTTCC TCTCAAGGAC ACTGTCCGCT 1250 

CCTTCAAGGA GATTCTCGAA GGTAAGTACG ATGCT 12 85 



290 
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2) INFORMATION FOR SEQ ID NO: 474 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1283 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



10 



(ii) MOLECULE TYPE: Genomic DNA 



15 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida sphaerica 

(B) STRAIN: ATCC 2504 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 474 



20 



25 



30 
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40 



TTGAACAAGG 
CCAGAAGGAA 
TGTCAGAACC 
ACGTTTCTGA 
TTGGGTAGAA 
AATCAACTCC 
TTGAACAATC 
GACTTGTTGG 
TGCCGGTGTC 
CCAAGGCTCA 
AGAGAAGGTA 
CTTGGAAGGT 
CTCCTGGAGC 
TACTTCAGAG 
TTTCAGATTC 
TTCCATCCGC 
TTGCAAGAAA 
AGCCGTTTAC 
CTTTCGCGCA 
TTGGGTATCT 
GGATGCTGCC 
AACAAACTTT 
GGTATGGATG 
AAAGATTCAA 
CTGGTATCCC 
AAGGCTGTCT 



TCAATTGCCA 
AGTTGGTTTT 
ATTGCTATGG 
CACTGGTGCT 
TTATCAACGT 
AAGATGAGAA 
CACTGGTGCT 
CCCCATACGC 
GGTAAGACCG 
TGGTGGTTTC 
ACGATTTGTA 
GATTCTAAGG 
TAGAGCTAGA 
ATGAAGAAGG 
ACACAAGCCG 
TGTCGGTTAT 
GAATTACTAC 
GTGCCAGCTG 
TTTGGATGCC 
ACCCAGCTGT 
GTCGTTGGTC 
GCAAGCTTAC 
AATTGTCCGA 
AGATTCTTGT 
AGGTAGATTG 
TGGACGGTAA 



GCTATTTTGA 
GGAAGTCGCT 
ATGGTACTGA 
CCAATTTCCG 
TATTGGTGAG 
AGCCAATTCA 
GAAGTTTTGG 
CAGAGGTGGT 
TTTTCATCCA 
TCTGTCTTCA 
CCGTGAAATG 
TCGCGTTGGT 
GTCGCCTTGA 
TCAAGATGTC 
GTTCCGAAGT 
CAACCAACTT 
CACCAAGAAG 
ATGATTTGAC 
ACCACTGTGT 
CGATCCTTTG 
AAGAACATTA 
AAGTCTTTGC 
ACAAGATAAG 
CTCAACCTTT 
GTCAGATTAA 
GTACGATCAC 



ACGCTTTGGA 
CAACATTTGG 
AGGTTTAGTC 
TCCCAGTTGG 
CCAATTGACG 
TGCTGATCCT 
AAACTGGTAT 
AAGATTGGTT 
AGAATTGATT 
CTGGTGTCGG 
AAGGAAACTG 
TTTCGGTCAA 
CTGGTTTGAC 
TTGTTGTTTA 
GTCCGCTTTG 
TGGCCACCGA 
GGTTCCGTCA 
TGATCCTGCT 
TGTCCAGAGG 
GATTCCAAAT 
CGATGTCGCT 
AAGATATCAT 
TTGACCGTCG 
CGCTGTCGCT 
AGGACACCAT 
TTG 



AATCGACACT 
GTGAAAACAC 
CGTGGTGAAA 
TAGAGAAACC 
AAAGAGGTCC 
CCATTATTCG 
CAAGGTTGTC 
TGTTCGGTGG 
AACAACATTG 
TGAAAGAACC 
GTGTTATCAA 
ATGAACGAAC 
CATCGCTGAA 
TCGACAACAT 
TTGGGTCGTA 
TATGGGTTTG 
CTTCTGTCCA 
CCAGCTACCA 
TATCTCTGAA 
CTAGATTGTT 
ACTCAAGTTC 
TGCCATTTTG 
AAAGAGCTAG 
GAAGTCTTCA 
CTCTTCTTTC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1100 
1150 
1200 
1250 
1283 



45 



2) INFORMATION FOR SEQ ID NO: 475 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 1290 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



55 



(ii) MOLECULE TYPE: Genomic DNA 



60 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida tropical is 

(B) STRAIN: ATCC 13803 



291 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 475 
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CGACGAAGGT AACTTGCCAG CTATTTTGAA TGCTTTGACT TTGAAGAACG 50 

GTGACCAAGA CTTGGTTTTG GAAGTTGCTC AACATTTGGG TGAAAACACC 100 

5 GTCAGGGCTA TTGCTATGGA TGGTACCGAA GGTTTAGTCA GAGGTACTGC 150 

TGTCACTGAT ACCGGTGCTC CAATTTCTGT CCCAGTTGGT AGAGGTACCT 200 

TGGGTAGAAT TATCAACGTT GTTGGTGAAC CAATTGATGA CAGAGGTCCA 250 

ATTGAATGTA AGGAAAGAAA GCCAATTCAC GCTGAACCAC CTTCATTCGT 300 

TGAACAATCT ACTGCTGCCG AAATTTTGGA AACCGGTATT AAGGTTGTCG 350 

10 ATTTGTTGGC TCCTTACGCT AGAGGTGGTA AGATTGGTTT GTTCGGTGGT 4 00 

GCTGGTGTCG GTAAAACCGT CTTCATCCAA GAATTGATTA ACAACATTGC 450 

CAAAGCTCAT GGTGGTTTCT CTGTCTTTAC CGGTGTCGGT GAAAGAACCA 500 

GAGAAGGTAA CGATTTGTAC CGTGAAATGA AAGAAACTGG TGTCATCAAC 550 

TTGGAAGGTG ACTCCAAGGT CGCCTTGGTC TTCGGTCAAA TGAACGAACC 600 

15 ACCTGGTGCT AGAGCTAGAG TTGCTTTGAC, TGGTTTGACC ATTGCTGAAT 650 

ACTTCAGAGA TGAAGAAGGT CAAGATGTCT TGTTGTTCAT TGACAACATT 700 

TTCAGATTCA CCCAAGCTGG TTCCGAAGTG TCTGCTTTGT TGGGTCGTAT 750 

TCCATCTGCT GTCGGTTATC AACCAACCTT GGCCACTGAT ATGGGTCTTT 800 

TGCAAGAACG TATTACCACC ACCAAGAAAG GTTCTGTTAC TTCTGTCCAA 850 

20 GCTGTTTATG TCCCAGCCGA TGATTTAACC GATCCAGCTC CAGCTACCAC 900 

CTTCGCTCAC TTGGATGCCA CTACTGTCTT GTCTAGAGGT ATTTCTGAAT 950 

TGGGTATTTA CCCAGCTGTC GATCCATTGG ATTCTAAATC CAGATTATTG 1000 

GATGCTACTG TTGTTGGTCA AGAACATTAT GATGTTGCCA CTGGTGTTCA 1050 

ACAAACTTTA CAAGCTTACA AGTCCTTGCA AGATATCATT GCTATTTTGG 1100 

2 5 GTATGGATGA ATTGTCCGAA GCCGATAAAT TGACTGTCGA AAGAGCTCGT 1150 

AAGATTCAAA GATTCTTGTC CCAACCATTC GCTGTTGCCG AAGTTTTCAC 1200 

TGGTATCCCA GGTAGATTGG TTAGATTACA AGATACTGTT AAATCTTTCA 1250 

AAGATGTCTT GGAAGGTAAA TACGATCACT TGCCAGAAAA 1290 



30 
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2) INFORMATION FOR SEQ ID NO: 476 



( i ) SEQUENCE CHARACTERISTICS : 
35 (A) LENGTH: 1267 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida utilis 

(B) STRAIN: Csp 388 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 476 



TTTGAACGCC CTTGAGATCA AGAACCACGA GTCTGGAAGA TTGGTCCTCG 50 

AGGTTGCCCA GCACTTGGGT GAGAACACTG TCAGAACTAT TGCCATGGAT 100 

50 GGTACCGAAG GTCTCGTCCG TGGTGAGTCT GTTGTTGACA CCGGTTCTCC 150 

AATTACTGTC CCAGTTGGTC GTGAGACCTT GGGTCGTATC ATCAACGTTA 200 

TTGGTGAGCC AATTGATGAG CGTGGTCCAA TCAACACCAA GCACAGAAAC 250 

CCAATTCACG CTGAGCCACC ATCCTTCGCT GAGCAGTCTA CTGCTGCTGA 300 

GGTTTTGGAG ACTGGTATCA AGGTTGTCGA CCTTCTCGCC CCATACGCCA 350 

55 GAGGTGGTAA GATTGGTCTC TTTGGTGGTG CCGGTGTCGG TAAGACCGTT 400 

TTCATCCAGG AGTTGATTAA CAACATTGCC AAGGCCCACG GTGGTTTCTC 450 

TGTTTTCACC GGTGTCGGTG AGAGAACCAG AGAGGGTAAC GATTTGTACC 500 

GTGAGATGAA GGAGACTGGT GTCATCAATC TCGAGGGTGA GTCCAAGGTT 550 

GCTCTCGTCT TCGGTCAGAT GAACGAGCCA CCAGGTGCCC GTGCCCGTGT 600 

60 CGCTTTGACC GGTTTGACCA TTGCTGAGTA CTTCAGAGAT GAGGAGGGTC 650 
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AGGATGTCTT GTTGTTTATT GACAACATTT TCAGATTCAC CCAGGCCGGT" 700 

TCCGAGGTGT CTGCCTTGCT TGGTCGTATT CCATCCGCTG TCGGTTACCA 750 

GCCAACTTTG GCCACCGATA TGGGTTTGTT GCAGGAGAGA ATTACCACCA 800 

CCCAGAAGGG TTCCGTCACT TCTGTCCAGG CCGTTTACGT CCCAGCTGAT 850 

5 GATTTGACTG ATCCTGCTCC AGCCACCACT TTCGCCCACT TGGACGCCAC 900 

CACTGTGTTG TCCCGTGGTA TCTCTGAGTT GGGTATTTAC CCAGCTGTCG 950 

ACCCATTGGA CTCCAAGTCC AGATTGTTGG ACGCTGCCGT TGTTGGTGAC 1000 

GAGCACTACA ACACCGCCAC CGATGTCCAG CAGACCCTTC AGGCTTACAA 1050 

GTCTCTCCAG GATATCATTG CTATTTTGGG TATGGATGAG TTGTCTGAGG 1100 

10 CTGACAAGTT GACTGTCGAG AGAGCCAGAA AGATTCAGCG TTTCCTTTCC 1150 

CAGCCATTCG CTGTCGCTGA GGTTTTCACC GGTATCCCAG GTAGATTGGT 1200 

TAGACTTCAG GACACCATCA AGTCCTTCAG AGAGGTTTTG GACGGTAAGT 12 50 

ACGACCACTT GCCAGAG 12 67 



15 
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2) INFORMATION FOR SEQ ID NO: 477 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 1296 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida viswanathii 

(B) STRAIN: ATCC 28269 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 477 



CCAATTCGAC GAAGGTAGCT TGCCAGCTAT CTTGAACGCC TTGACCTTGA 50 

AGAACGGTGA CCAAGACTTG GTCTTGGAAG TTGCCCAACA CTTGGGTGAA 100 

35 AACACTGTCA GAGCTATTGC TATGGATGGT ACCGAAGGTT TGGTCAGAGG 150 

TACTGCTGTC ACTGACACCG GTGCTCCAAT CTCCGTCCCA GTCGGTAGAG 2 00 

GTACCTTGGG TAGAATCATC AACGTTGTTG GTGAACCAAT TGACGACAGA 250 

GGTCCAATTG AATGTAAGGA AAGAAAGCCA ATTCACGCTG AACCACCTTC 3 00 

TTTCGTTGAA CAATCCACTG CTGCCGAAAT TTTGGAAACC GGTATCAAGG 3 50 

40 TTGTCGACTT GTTGGCCCCA TACGCCAGAG GTGGTAAGAT TGGTTTGTTC 4 00 

GGTGGTGCCG GTGTCGGTAA GACTGTCTTT ATCCAAGAGT TGATTAACAA 4 50 

CATTGCCAAG GCCCATGGTG GTTTCTCTGT CTTCACTGGT GTCGGTGAAA 500 

GAACCAGAGA AGGTAACGAT TTGTACCGTG AAATGAAGGA AACCGGTGTC 550 

ATCAACTTGG AAGGTGACTC CAAGGTCGCC TTGGTTTTCG GTCAAATGAA 600 

4 5 CGAACCACCT GGTGCTAGAG CTAGAGTTGC TTTGACTGGT TTGACCATTG 650 

CCGAATACTT CAGAGACGAA GAAGGTCAAG ATGTCTTGTT GTTTATTGAC 700 

AACATTTTCA GATTCACCCA AGCCGGTTCC GAAGTGTCTG CTTTGTTGGG 750 

TCGTATTCCA TCTGCCGTCG GTTATCAACC AACCTTGGCC ACCGATATGG 800 

GTCTTTTGCA AGAACGTATT ACCACCACCA AGAAGGGTTC CGTCACCTCT 850 

50 GTCCAAGCTG TCTATGTCCC AGCCGATGAT TTGACCGATC CTGCTCCAGC 900 

CACCACCTTC GCTCACTTGG ATGCCACTAC TGTCTTGTCT AGAGGTATTT 950 

CCGAATTGGG TATCTACCCA GCTGTCGATC CATTGGACTC CAAGTCCAGA 1000 

TTGTTGGATG CTGCTGTTGT TGGTCAAGAA CATTACGATG TTGCCACTGG 1050 

TGTCCAACAA ACCTTGCAAG CTTACAAGTC TTTGCAAGAT ATCATTGCCA 1100 

55 TTTTGGGTAT GGATGAATTG TCTGAATCCG ATAAGTTGAC TGTCGAAAGA 1150 

GCTCGTAAGA TCCAAAGATT CTTGTCGCAA CCATTTGCTG TTGCCGAAGT 1200 

TTTCACTGGT ATCCCAGGTA GATTAGTTAG ATTGCTGGAA ACCGTCCAAT 1250 

CTTTCAAAGA CGTCTTGGCT GGTAAATACG ATCACTTGCC AGAAAA 1296 
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2) INFORMATION FOR SEQ ID NO: 4 78 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1295 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida zeylanoides 

(B) STRAIN: ATCC 7351 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 478 

TCAATTCGAG CAAGGCAACC TCCCTGCCAT CTTGAACGCT CTCACCTTGA 50 

AGAATGGTGA CAACGACTTG GTTTTGGAAG TTGCCCAGCA CTTGGGTGAG 100 

AACACCGTCA GAGCCATTGC CATGGATGGT ACCGAGGGTT TGGTTAGAGG 150 

TGCGTCCGTC AAGGACACTG GCGCCCCTAT CTCGGTCCCC GTTGGCCGCG 200 

GGACTTTGGG TCGTATCATC AACGTCACCG GTGACCCCAT TGACGAGAGA 2 50 

GGTCCCATCG AGCTGACCCA GAGAAACCCT ATCCACGCCG ACCCCCCCTC 3 00 

GTTCGTTGAG CAGTCCACCA ACGCTGAGGT TTTGGAGACT GGTATCAAGG 350 

TTGTCGATTT GTTGGCTCCC TACGCCAGAG GTGGTAAGAT TGGTTTGTTC 4 00 

GGTGGTGCCG GTGTCGGTAA GACCGTCTTC ATTCAGGAGT TGATCAACAA 4 50 

CATCGCCAAG GCCCACGGTG GGTTCTCGGT CTTCACTGGT GTCGGTGAGA 500 

GAACTAGAGA GGGTAACGAC TTGTACCGTG AGATGAAGGA GACCGGTGTC 550 

ATCAACTTGG AGGGTGACTC CAAGGTGGCC TTGGTGTTCG GTCAGATGAA 600 

CGAGCCCCCT GGAGCCAGAG CCAGAGTCGC CTTGACCGGG TTGACCATTG 650 

CCGAATACTT TAGAGACGAG GAGGGTCAGG ATGTGTTGTT GTTCGTCGAC 700 

AACATCTTCA GATTCACCCA AGCTGGTTCG GAGGTGTCGG CCTTGTTGGG 750 

TCGTATTCCC TCTGCCGTCG GTTACCAGCC CACCTTGGCA ACTGATATGG 800 

GATTGTTGCA GGAGCGTATC ACCACGACCA AGAAGGGTTC CGTCACCTCA 850 

GTGCAGGCCG TCTACGTCCC CGCTGATGAC TTGACTGACC CTGCTCCCGC 900 

CACCACCTTT GCCCACTTGG ACGCCACCAC CGTGTTGTCC AGAGGTATCT 950 

CTGAGTTGGG TATCTACCCC GCCGTCGACC CCTTGGACTC CAAATCGAGA 1000 

TTGTTGGACG CTGCCGTGGT CGGTCAGGAG CACTACGATG TTGCCTCGAA 1050 

CGTCCAGCAG ACCTTGCAGG CCTACAAGTC CTTGCAGGAT ATCATTGCCA 1100 

TTTTGGGTAT GGATGAGTTG TCCGAGGCTG ACAAGTTGAC CGTTGAGAGA 1150 

GCCAGAAAGA TCCAGAGATT CTTGTCGCAG CCCTTCGCTG TTGCCGAGGT 1200 

TTTCACTGGT ATCAAGGGTA GATTGGTCAG ATTGGAGGAC ACCGTCAGAT 12 50 

CTTTCAAGGA GGTTTTGGAG GGTAAGTACG ACCACTTGCC CGAGA 12 95 



2) INFORMATION FOR SEQ ID NO: 479 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 534 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Coccidioides immitis 

(B) STRAIN: Silveira 
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SUBSTITUTE SHEET (RULE 26) 



10 



WO 01/23604 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 



PCT/CA00/01150 



479 



TTCAGGAATT 
CTGATATGCT 
TTCACTGGTG 
AATGCAAGAG 
TTGTCTTCGG 
CTTACCGGTT 
CGGTAGGCTT 
TCGCTTTTCT 
GCTGGTTCTG 
TTACCAACCT 
CCACCACCAC 



GATTGTAAGT 
CTAGAACAAC 
TCGGTGAGCG 
ACCCGTGTCA 
TCAAATGAAC 
TGACCATTGC 
CATGCTTCTA 
AGTGCTTCTC 
AAGTGTCTGC 
ACTCTCGCCG 
CAAGGGATCC 



TCTGTTATCA 
ATTGCTAAGG 
TACCCGTGAG 
TTCAACTCGA 
GAGCCCCCTG 
TGAATACTTC 
TCGCTAGGGG 
TTTATTGACA 
CTTGCTCGGT 
TCGACATGGG 
ATTACTTCAG 



ACTAAAGCCG 
CTCACGGTGG 
GGTAACGATT 
CGGAGAGTCC 
GTGCCCGTGC 
CGTGACGAGG 
CGTGTGATAC 
ACATTTTCCG 
CGTATTCCTT 
TGTTATGCAG 
TGCA 



ACAGCGGTTG 
TTACTCCGTG 
TGTACCATGA 
AAGGTCGCTC 
CCGTGTTGCC 
AAGGCCAAGA 
AGGAGGCTAA 
TTTCACTCAA 
CCGCTGTCGG 
GAACGTATCA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
534 



15 



20 



25 



30 



35 



40 



2) INFORMATION FOR SEQ ID NO: 480 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 494 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptococcus albidus 

(B) STRAIN: ATCC 66030 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 480 



CGTCTTGATT 
ACTCGGTCTT 
TACCACGAAG 
CTGATGTTGT 
GAGGGTGACT 
TGGAGCCCGA 
TCCGAGACGA 
CGATTCACCC 
CTCCGCCGTC 
AGGAGCGAAT 



CAAGAATTGA 
CACCGGTGTC 
TGAGTTGCGC 
CCCTTCTTCT 
CCAAGGTCGC 
GCCCGAGTCG 
GGAGGGTCAG 
AAGCCGGTTC 
GGTTACCAGC 
TACCACCACC 



TCAACAACAT 
GGTGAGCGAA 
CGTCCGAGTG 
CAGATGAGGG 
CTTGGTCTTC 
CCTTGACCGG 
GATGTCTTGT 
CGAAGTGTCC 
CCACTCTGTC 
AAGAAGGGTT 



CGCCAAGGCC 
CTCGAGAGGG 
TTTCCCGGGG 
AAACCGGTGT 
GGTCAGATGA 
TTTGACCATT 
TGTTCATTGA 
GCCTTGTTGG 
CACCGACATG 
CCATCACTTC 



CACGGTGGTT 
TAACGATCTG 
AATCGCAAGA 
CATCAACCTC 
ACGAGCCCCC 
GCCGAATACT 
CAACATTTTC 
GTCGTATCCC 
GGTACCATGC 
CGTC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
494 



45 



50 



55 



60 



2) INFORMATION FOR SEQ ID NO: 481 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 415 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusarium oxysporum 

(B) STRAIN: WSA-212 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 481 
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TTCAGGAGCT TATCAACAAC 
TTCACTGGTG TCGGTGAGCG 
AATGCAGGAG ACTTCCGTCA 
TGGTCTTCGG TCAGATGAAC 
CTTACCGGTC TTACTGTAGC 
CGTGCTGCTC TTCATTGACA 
AGGTGTCTGC CCTTCTCGGT 
ACCCTCGCCG TCGACATGGG 
CAAGGGTTCC ATTAC 



ATCGCCAAGG 
AACTCGTGAG 
TTCAGCTTGA 
GAGCCCCCTG 
TGAATACTTC 
ACATTTTCCG 
CGTATCCCCT 
TGGTATGCAA 



CCCACGGTGG TTACTCCGTC 
GGTAACGATC TGTACCACGA 
TGGCGAGTCC AAGGTCGCCC 
GAGCTCGTGC CCGTGTCGCC 
AGAGATGAGG AGGGTCAGGA 
ATTCACTCAG GCCGGTTCCG 
CTGCCGTCGG TTACCAGCCC 
GAGCGTATTA CCACCACCAC 



100 
150 
200 
250 
300 
350 
400 
415 



10 



15 



20 



25 



30 



35 



40 
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2) INFORMATION FOR SEQ ID NO:, 482 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1281 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Geotrichu/n spp. 

(B) STRAIN: Lev-4 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 482 

AGGACAACCT CCCCGCTATT CTTAACGCTC TTGAGCTTAA GAGAGATAAC 50 

GGTGAGAAGC TCGTTCTCGA GGTTGCCCAG CATCTGGGTG AGAACACTGT 100 

CCGTACTATT GCTATGGACG GTACTGAGGG TCTCGTCCGT GGTCAGCCCG 150 

TTGTTGACAC CGGTGCCCCC ATTACCATTC CCGTTGGTCG TGGTACTCTT 200 

GGTAGAATTA TCAACGTCAT TGGTGAGCCC ATCGATGAGC GTGGACCCAT 250 

TGAGGCTACC AAGTACCTCC CCATCCACAC CGAGCCCCCC ACCTTCGCTG 300 

AGCAGTCTAC CTCCGCTGAG GTTCTTGAGA CTGGTATCAA GGTTGTCGAT 350 

CTCCTTGCCC CCTACGCCCG TGGTGGTAAG ATTGGTCTCT TCGGTGGTGC 4 00 

CGGTGTCGGT AAGACCGTTT TCATTCAGGA GCTGATTAAC AACATTGCCA 450 

AGGCCCATGG TGGTTTCTCC GTTTTCACCG GTGTCGGTGA GAGAACCCGT 500 

GAGGGTAACG ATTTGTACCG TGAGATGAAG GAGACCGGTG TCATCAACCT 550 

CGAGGGTGAG TCTAAGGTCG CTCTCGTTTT CGGTCAGATG AACGAGCCCC 600 

CTGGAGCCCG TGCCCGTGTT GCTCTTACTG GTCTTACCAT TGCTGAGTAC 650 

TTCCGTGATG AGGAGGGTCA GGATGTGTTG CTCTTCGTTG ACAACATTTT 700 

CCGTTTCACT CAGGCCGGTT CCGAGGTGTC TGCCCTTTTG GGTCGTATTC 750 

CCTCCGCTGT CGGTTACCAG CCCACCCTTG CCACTGATAT GGGTGCCCTG 800 

CAAGAGCGTA TTACCACCAC CCAGAAGGGT TCCGTCACTT CCGTCCAGGC 850 

CGTCTACGTC CCTGCCGATG ATTTGACCGA TCCTGCCCCT GCTACCACCT 900 

TCGCCCATTT GGATGCCACC ACCGTCTTGT CGCGTTCCAT TTCTGAGTTG 950 

GGTATCTACC CCGCTGTCGA TCCCCTTGAT TCCAAGTCTC GTCTTTTGGA 1000 

TATCACCGTC GTTGGCCAGG AGCACTACGA TGTTGCTACC CAGGTCCAGC 1050 

AGACCCTCCA GTCCTACAAG TCTCTTCAGG ATATCATTGC CATTTTGGGT 1100 

ATGGATGAGT TGTCTGAGGC TGATAAGCTT ACTGTCGAGC GTGCCCGTAA 1150 

GATCCAGAGA TTCCTTTCCC AGCCCTTCAC TGTCGCTGAG GTTTTCACTG 1200 

GTATCGAGGG CCGTCTCGTT CCTTTGAAGG ACACTGTTCG CTCTTTCAAG 1250 

GAGATCCTTG AGGGCAAGTA CGACCACCTC C 1281 



2) INFORMATION FOR SEQ ID NO: 483 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 586 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Histoplasma capsulation 

(B) STRAIN: G185A5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 483 

15 ATTCAAGAAT TGATCGTACG TTCCTCCGCC, CCACACACGA TCAATGGAGA 50 

AAGAAACAAA TTTTTTGGGA GTGGTCATTT TTTCTAATAA TTCGAATAGA 100 

ACAACATTGC CAAAGCCCAC GGTGGTTACT CCGTGTTCAC TGGTGTCGGC 150 

GAGCGGACCC GTGAAGGAAA TGACTTGTAC CACGAAATGC AGGAAACCCG 200 

TGTTATCCAG CTCGATGGAG AGTCCAAGGT CGCCCTCGTT TTCGGTCAGA 250 

20 TGAACGAGCC TCCCGGAGCC CGTGCCCGTG TTGCCCTCAC TGGTCTGACC 300 

GTTGCCGAGT ACTTCCGTGA CGAGGAAGGC CAAGACGGTA TGTATAAGTA 350 

TACACCGTAG CAAATCAACA CAGAGCTTCA CTCACGCTCG GATTTAGTGC 400 

TTCTCTTCAT CGACAACATT TTCCGCTTCA CTCAGGCCGG TTCCGAAGTG 450 

TCTGCCCTGC TCGGCCGTAT TCCCTCCGCC GTCGGTTACC AACCCACCCT 500 

25 CGCCGTGGAC ATGGGTGGTA TGCAGGAACG TATCACCACT ACCACCAAGG 550 

GCTCCATCAC CTCTGTGCAR GCCGTCTACG TCCCCG 586 



30 2) INFORMATION FOR SEQ ID NO: 484 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1145 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

4 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Malassezia furfur 

(B) STRAIN: ATCC 42132 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 484 



GCGTGGCCAG AAGGTCATTG ACACTGGTGC TCCCATCACC ATCCCCGTCG 50 

GTGGTGCCAC ACTGGGTCGT ATTCTGAACG TCACGGGTGA CCCTATTGAC 100 

GAGCGTGGCC CCGTTAAGAC TGACGTTTTC CGCCCCATTC ACCGTGACCC 150 

CCCTGCCTTT GTCGAGCAGT CGACTGATGC CGAGATTCTC GAGACTGGTA 200 

50 TCAAAGTCGT TGACCTGATT GCCCCTTACG CCCGTGGTGG TAAGATTGGT 250 

CTGTTCGGTG GTGCCGGTGT CGGTAAGACC GTGCTTATCC AGGAGCTCAT 300 

CAACAACATC GCCAAGGCCC ACGGTGGTTT CTCCGTGTTC ACTGGTGTCG 350 

GTGAGCGTAC TCGTGAGGGT AACGATTTGT ACCACGAGAT GATTGAAACC 400 

GGTGTCATTA ACCTCGAGGG TGACTCGAAG GTGGCTCTGG TGTTCGGTCA 450 

55 GATGAACGAG CCCCCGGGTG CCCGTGCGCG TGTCGCTCTT ACTGGTCTGA 500 

CTGTGGCCGA GTACTTCCGT GACGACGAGG GCCAGGATGT GCTGCTGTTC 550 

ATTGACAACA TTTTCCGTTT CACCCAGGCC GGTTCGGAGA CTTCGGCTCT 600 

GCTGGGTCGT ATCCCTTCGT CGGTCGGTTA CCAGCCCACT TTGTCGACCG 650 

ATATGGGTGC CATGCAGGAG CGTATCACCA CCACCAAGAA GGGTTCGATT 700 

60 ACGTCGGTGC AGGCCGTCTA CGTGCCCGCC GATGATGTCA CTGACCCTGC 750 
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CCCTGCCACT 
CGATCGCTGA 
TCGCGTATGC 
CTCTGGCGTG 
TTGCCATTCT 
GAGCGTGCCC 
CCAGGTCTTT 
TCAAGGCCTG 



ACCTTCGCCC 
GCTGGGTATC 
TTGACCCCGC 
CAGAAGCTGC 
GGGTATGGAT 
GTAAGATGCA 
ACTGGTATCG 
CAAGGAGATC 



ACCTTGACGC 
TACCCCGCTG 
TATTGTGGGT 
TCCAGGACTA 
GAGCTTTCTG 
GCGTTTCATG 
AAGGTCGTCT 
CTGTCGGGCA 



TACCACTGTG 
TTGACCCGTT 
CAGGAGCACT 
CAAGTCGCTT 
AGGAGGACAA 
TCGCAGCCTT 
TGTTGCCCTG 
AGCACGACAA 



PCT/CAOO/01150 

CTTGACCGTT 800 

GAACTCGAAC 850 

ACGACGTGGC 900 

CAAGATATCA 950 

GCTCACTGTC 1000 

TCGCTGTCGC 1050 

AAGGACACGA 1100 

CCTCC 1145 



15 



20 



25 



30 



35 



40 



45 



2) INFORMATION FOR SEQ ID NO: 485 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1261 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY:, Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Malassezia pachydermatis 

(B) STRAIN: ATCC 42756 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 485 



50 



TCCCGCCATC 
GCCTTGTTCT 
ATTGCTATGG 
CACTGGTGCC 
TCCTGAACGT 
GACGTCTACC 
GACTGATGCT 
CCCCATACGC 
GGTAAGACCG 
CGGTGGTTTC 
ACGATCTGTA 
GACTCGAAGG 
CCGTGCCCGT 
ACGACGAGGG 
ACTCAGGCTG 
TGTCGGTTAC 
GTATYACCAC 
GTTCCGGCCG 
CTTGGACGCC 
ACCCGGCCGT 
ATTGTCGGTG 
YCAGGACTAC 
AGTTGTCGGA 
CGTTTCCTGT 
GGGTCGTCTT 
TGTCCGGTAA 



TTCAACGCCC 
GGAGGTTGCC 
ACGGTACCGA 
CCGATCACTA 
CACTGGTGAG 
GTCCAATCCA 
GAGATTCTTG 
TCGTGGTGGT 
TGCTGATTCA 
TCGGTGTTCA 
CCATGAAATG 
TCGCTCTCGT 
GTCGCCCTGA 
TCAGGATGTG 
GTTCGGAGAC 
CAGCCKACCC 
CACCAAGAAG 
ACGATGTGAC 
ACCACGGTGT 
CGACCCGCTG 
TGGAGCACTA 
AAGTCGCTCC 
GGAGGACAAG 
CGCAGCCTTT 
GTGTCGCTCA 
G 



TGGAGGTCCA 
CAGCACCTTG 
GGGTCTTGTC" 
TCCCTGTCGG 
CCTGTGGATG 
CCGTGAGCCC 
AGACTGGTAT 
AAGATTGGTC 
GGAGCTTATC 
CTGGTGTCGG 
ATTGAGACTG 
GTTCGGTCAG 
CYGGTCTGAC 
CTGCTCTTCA 
TTCGGCTCTG 
TTGCCACGGA 
GGTTCGATTA 
TGACCCTGCC 
TGGACCGTTC 
AACTCGAAGT 
CAACGTTGCT 
AAGATATCAT 
CTCACTGTCG 
CGCTGTGGCC 
AGGACACCAT 



GGACATGAAG 
GTGAGAACAC 
CGTGGTCAGA 
TAACGGTACC 
AGCGTGGTCC 
CCGGCGTTCG 
CAAGGTGGTC 
TWTTCGGTGG 
AACAACATTG 
TGAGCGTACT 
GTGTCATCAA 
ATGAACGAGC 
CATCGCCGAG 
TTGACAACAT 
CTGGGTCGTA 
TATGGGTGCC 
CCTCGGTGCA 
CCGGCCACGA 
GATTGCYGAG 
CGCGTATGCT 
TCGGGTGTCC 
TGCCATTCTG 
AGCGTGCCCG 
CAGGTCTTCA 
CAACGCCTGC 



AACGGTGGCC 50 

TGTTCGTTGC 100 

AGGTCCTTGA 150 

TTGGGCCGYA 200 

GGTTAAGACT 2 50 

TTGAGCAGTC 3 00 

GACCTGCTCG 3 50 

TGCYGGTGTY 4 00 

CCAAGGCCCA 4 50 

CGTGAGGGTA 500 

CGTTGACGGT 550 

CCCCGGGTGC 600 

TACTTCCGTG 650 

TTTCCGTTTC 700 

TCCCGTCGGC 750 

ATGCAGGARC 800 

GGCYGTTTAC 850 

CCTTCGCCCA 900 

CTGGGTATCT 950 

TGACCCGTCA 1000 

AGAAGCTTCT 1050 

GGTATGGATG 1100 

TAAGATGCAG 1150 

CTGGTATCGA 1200 

AAGGAGATTC 1250 
1261 



55 



2) INFORMATION FOR SEQ ID NO: 486 



60 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1282 bases 
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(B) TYPE: Nucleic acid 

(C) STRAND EDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Metschnikowia pulcherrima 

(B) STRAIN: DSM 70336 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 486 

AGGAGGGCAA CTTGCCAGCT ATCTTCAACG CTTTGACGTT GAAGAACGGC 50 

GACCAGAAGT TGGTCTTGGA GGTGGCCCAG CACTTGGGTG AGAACACCGT 100 

CAGAACCATT GCCATGGACG GTACCGAGGG, TTTGGTCAGA GGCGCCTCTG 150 

TCACCGACAC YGGTGCCCCT ATCTCCGTGC CTGTCGGCCG TGAGACCTTG 2 00 

GGTCGTATTA TCAATGTTGT TGGTGAGCCA ATCGACGAGA GAGGCCCAAT 250 

CAACACCAAG AAGAGAAACC CTATTCACAC CGACCCACCT TCGTTTGTCC 300 

AGCAATCCAC TTCCGCCGAG GTCTTGGAGA CTGGTATCAA GGTTGTCGAC 350 

TTGTTGGCCC CTTACGCCAG AGGTGGTAAG ATTGGTTTGT TCGGTGGTGC 400 

CGGTGTCGGT AAGACCGTGT TCATCCAGGA GTTGATTAAC AACATTGCCA 4 50 

AGGCCCACGG TGGTTTCTCC GTTTTCACCG GTGTCGGTGA GAGAACCAGA 500 

GAGGGTAACG ATTTGTACCG TGAAATGCAG GAGACTGGTG TCATCAACTT 550 

CGAGGGTGAC TCCAAGGTCG CCTTGGTTTT CGGTCAGATG AACGAGCCTC 600 

CAGGAGCTAG AGCYAGAGTT GCCTTGACCG GTTTGACCAT CGCCGAGTAC 650 

TTCAGAGACG AGGAGGGTCA GGATGTGTTG TTGTTCGTCG ACAACATTTT 700 

CAGATTCACC CAAGCCGGTT CTGAGGTGTC TGCCTTGTTG GGTCGTATTC 750 

CATCCGCTGT CGGATACCAG CCAACCTTGG CCACCGATAT GGGTGCCTTG 800 

CAGGAGAGAA TTACCACCAC CAAGAAGGGT TCCGTCACCT CCGTCCAGGC 850 

CGTCTACGTG CCAGCCGATG ACTTGACTGA CCCTGCGCCA GCCACCACTT 900 

TCGCCCACTT GGACGCCACC ACTGTGTTGT CCAGAGGTAT CTCTGAGTTG 950 

GGTATCTACC CCGCTGTCGA CCCCTTGGAC TCCAACTCCA GATTGTTGGA 1000 

CGCCACCGTT GTTGGCCAGG AGCACTACGA CGTCGCCACC AACGTCCAGC 1050 

AAACTTTACA AGCTTACAAG TCCTTGCAGG ATATCATTGC CATTTTGGGT 1100 

ATGGATGAGT TGTCCGAGAC CCGACAAGTT GACCCGTCGA GAGAGCCAGA 1150 

AAGATCCAGA AGTTCTTGTC CCAGCCATTT GCCGTCGCCG AGGTTTTCAC 1200 

CGGTATTGAG GGTAGATTGG TTAGATTGGA GGACACCGTT AGATCCTTTA 1250 

AGGAGGTTTT GGAAGGTAAG TACGACCACT TG 1282 



2) INFORMATION FOR SEQ ID NO: 487 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 482 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Penicillium marneffei 

(B) STRAIN: WSA-214 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 487 

TGTCTTTATC CAGGAGTTGA TTGTACGTCT TTACCTTTCT GCCTGACTGT 50 

TTACGACAAC TAACGAAAGC GTAGAACAAC ATTGCCAAGG CTCACGGTGG 100 

TTACTCTGTC TTCACTGGTG TCGGTGAACG TACTCGTGAG GGTAACGATT 150 
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TGTACCACGA AATGCAGGAA ACTGGTGTCA 
AAGGTCGCCC TCGTGTTCGG TCAGATGAAC 
CCGTGTCGCT CTTACTGGTT TGACCATTGC 
AAGGTCAGGA CGTGCTTCTC TTCATTGACA 
GCCGGTTCTG AGGTGTCTGC CCTTCTGGGT 
TTACCAGCCC ACCCTTGCCG TCGACATGGG 
CCACCACCAC CAAGGGTTCC ATCACCTCCG 



TTCAGCTCGA GGGTGAATCC 
GAGCCCCCCG GTGCCCGTGC 
CGAGTACTTC CGTGACGAGG 
ACATTTTCCG TTTCACTCAG 
CGTATCCCCT CTGCCGTCGG 
TATCATGCAG GAGCGTATTA 
TC 



200 
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2) INFORMATION FOR SEQ ID NO: 488 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12 90 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pichia anomala 

(B) STRAIN: ATCC 18205 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 488 

TCGAACAAGG TAACTTACCA GCTATCTTGA ATGCTTTAGA AATCAAAACC 50 

CCAAGTGGTA CTCCATTAGT TTTAGAAGTT GCTCAACATT TAGGTGAAAA 100 

CACTGTCAGA ACTATTGCTA TGGATGGTAC TGAAGGTTTA GTTCGTGGTG 150 

AACAAGTTAC TGATACTGGT TCTCCAATCA CTGTCCCAGT TGGTCGTGAA 200 

ACTTTAGGTC GTATTATCAA CGTTGTTGGT GAACCAATTG ATGAACGTGG 250 

TCCAATTAAC ACCAAACAAA GAAACCCAAT TCACGCTGAA CCACCTTCAT 300 

TCAGTGAACA ATCAACTGCT GCTGAAGTTT TAGAAACTGG TATCAAAGTT 350 

GTYGATTTAT TAGCTCCATA CGCTAGAGGT GGTAAAATTG GTTTATTCGG 4 00 

TGGTGCCGGT GTCGGTAAAA CTGTCTTTAT CCAAGAATTG ATTAACAACA 450 

TTGCTAAAGC TCATGGTGGT TTCTCAGTTT TCACCGGTGT TGGTGAAAGA 500 

ACCAGAGAAG GTAACGATTT ATACCGTGAA ATGAAAGAAA CTGGTGTTAT 550 

TAACTTGGAA GGTGATTCTA AGGTCGCTTT AGTTTTCGGT CAAATGAATG 600 

AACCACCAGG TGCTAGAGCT CGTGTTGCTT TAACTGGTTT GACCATTGCT 650 

GAATACTTCA GAGATGAAGA AGGTCAAGAT GTCTTGTTAT TCGTTGATAA 700 

CATTTTCAGA TTCACCCAAG CCGGTTCAGA AGTTTCTGCC TTATTAGGTC 750 

GTATTCCATC TGCTGTCGGT TATCAACCAA CTTTAGCAAC TGATATGGGT 800 

TTGTTACAAG AACGTATTAC CACCACACAA AAAGGTTCAG TTACTTCTGT 850 

CCAAGCTGTT TATGTCCCAG CTGATGATTT AACAGATCCT GCTCCAGCTA 900 

CCACTTTCGC CCATTTGGAT GCTACTACTG TCTTGTCTCG TGGTATTTCA 950 

GAATTAGGTA TTTACCCAGC TGTCGATCCA TTAGATTCTA AATCAAGATT 1000 

ATTAGATGCT TCAGTTGTTG GTCAAGAACA TTATGATGTT GCTACCAACG 1050 

TTCAACAAAC TTTACAAGCT TACAAATCTT TACAAGATAT TATTGCTATT 1100 

TTAGGTATGG ATGAATTGTC TGAACAAGAT AAATTGACTG TCGAAAGAGC 1150 

AAGAAAAATC CAAAGATTCT TATCTCAACC ATTTGCTGTT GCCGAAGTTT 1200 

TCACTGGTAT YCCAGGTAGA TTGGTTAGAT TAAAAGACAC TATCAAATCA 1250 

TTCAAAGATG TTTTGGAAGG TAAATATGAT CACTTACCAG 1290 



2) INFORMATION FOR SEQ ID NO: 489 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1291 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pichia anomala 

(B) STRAIN: ATCC 214 9 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 489 

CCAATTCGAA CAAGGTAACT TACCAGCTAT CTTGAATGCT TTAGAAATCA 50 

AAACCCCAAG TGGTACTCCA TTAGTTTTAG AAGTTGCTCA ACATTTAGGT 100 

GAAAACACTG TCAGAACTAT TGCTATGGAT GGTACTGAAG GTTTAGTCCG 150 

TGGTGAACAA GTTACTGATA CTGGTTCTCC, AATCACTGTC CCAGTTGGTC 200 

GTGAAACTTT AGGTCGTATT ATCAACGTTG TTGGTGAACC AATTGATGAA 250 

CGTGGTCCAA TTAACACCAA ACAAAGAAAC CCAATTCACG CTGAACCACC 3 00 

TTCATTCAGT GAACAATCAA CTGCTGCTGA AGTTTTAGAA ACTGGAATCA 350 

AAGTTGTTGA TTTATTAGCT CCATACGCTA GAGGTGGTAA AATTGGTTTA 400 

TTCGGTGGTG CCGGTGTCGG TAAAACTGTC TTTATCCAAG AATTGATTAA 450 

CAACATTGCT AAAGCTCATG GTGGTTTCTC AGTTTTCACC GGTGTTGGTG 500 

AAAGAACCAG AGAAGGTAAC GATTTATACC GTGAAATGAA AGAAACTGGT 550 

GTTATTAACT TGGAAGGTGA TTCTAAGGTC GCTTTAGTTT TCGGTCAAAT 600 

GAATGAACCA CCAGGTGCTA GAGCTCGTGT TGCTTTAACT GGTTTGACCA 650 

TTGCTGAATA CTTCAGAGAT GAAGAAGGTC AAGATGTCTT GTTATTCGTT 700 

GATAACATTT TCAGATTCAC CCAAGCCGGT TCAGAAGTTT CTGCCTTATT 750 

AGGTCGTATT CCATCTGCTG TCGGTTATCA ACCAACTTTA GCAACTGATA 800 

TGGGTTTGTT ACAAGAACGT ATTACCACCA CACAAAAAGG TTCAGTTACT 850 

TCTGTCCAAG CTGTTTATGT CCCAGCTGAT GATTTAACAG ATCCTGCTCC 900 

AGCTACCACT TTCGCCCATT TGGATGCTAC TACTGTCTTG TCTCGTGGTA 950 

TTTCAGAATT AGGTATTTAC CCAGCTGTCG ATCCATTAGA TTCTAAATCA 1000 

AGATTATTAG ATGCTTCAGT TGTTGGTCAA GAACATTATG ATGTTGCTAC 1050 

CAACGTTCAA CAAACTTTAC AAGCTTACAA ATCTTTACAA GATATTATTG 1100 

CTATTTTAGG TATGGATGAA TTGTCTGAAC AAGATAAATT GACTGTCGAA 1150 

AGAGCAAGAA AAATCCAAAG ATTCTTATCT CAACCATTTG CTGTTGCCGA 1200 

AGTTTTCACT GGTATCCCAG GTAGATTGGT TAGATTAAAA GACACTATCA 1250 

AATCATTCAA AGATGTTTTG GAAGGTAAAT ATGATCACTT A 1291 



2) INFORMATION FOR SEQ ID NO: 490 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 508 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rhodotorula minuta 

(B) STRAIN: ATCC 10658 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 90 

CGTATTGATT CGTGAGTGGC CGTTCCCTTA CAGCAAGCTT ATAAAGGAGC 50 

GAAAAAGATC TGACATTCGG CTTATGTGCT ATACAGAGGA ACTCATCAAC 100 

AACGTCGCCA AGGCTCATGG TGGTTACTCT GTCTTCACCG GTGTCGGAGA 150 

GCGAACACGT GAAGGTAACG ATCTCTACCA CGAAATGATT GAAACCGGTG 200 

301 
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TCATTCAGCT CAAGAACGAC 
ATGAACGAGC CCCCCGGAGC 
CATCGCCGAG TACTTCCGTG 
TCGACAACAT TTTCCGATTC 
CTCGGACGTA TCCCATCTGC 
TATGGGTGGT ATGCAAGAGC 
CCTCCGTC 



AAGTCCAAGG CCGCTCTGGT CTTCGGACA'G 
TCGTGCCCGT GTCGCTCTGA CCGGTCTCAC 
ACGTCGAAGG ACAGGATGTG CTACTCTTCA 
ACCCAGGCCG GTTCAGAGGT ATCTGCCCTG 
TGTCGGATAC CAGCCCACAC TCTCAACCGA 
GAATCACAAC CACCAAGAAG GGTTCGATTA 
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2) INFORMATION FOR SEQ ID NO: 491 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 686 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rhodotorula mucilaginosa 

(B) STRAIN: ATCC 66034 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 491 

TGTCCTCATT CAGGAGCTCA TCAACAACAT CGCCAAGGCC CACGGTGGTT 50 

ACTCGGTCTT CACCGGTGTC GGCGAGCGTA CCCGTGAGGG TAACGGTGAG 100 

TCTCCCCCTT CAAACTTTTG GCCGGCTAGT TGGCGCAGCG CAAACTGACG 150 

CGCGCGCCCT GTCCAGACTT GTACCACGAG ATGATCGAGA CTGGTGTCAT 200 

CCAGCTCGAG AACGACAACT CGAAGTGCGC TCTCGTGTTC GGCCAGATGA 250 

ACGAGCCCCC TGGTGCCCGT GCCCGTGTCG CTCTCACTGG GTTCGTCCTT 300 

TCTCTCTCTC GAGCGTCCTG GCTTGATACG GAACGCTGAC ACGTCACGCA 350 

GTCTCACTAT TGCTGAGTAC TTCCGTGACG AGGAGGGCCA GGACGTGCTC 400 

CTCTTCATCG ACAACATCTT CCGTTTCACC CAGGGTGAGC CGCCTCCGCG 450 

GGCATTCTCC CGTTTCTTTC GCGCTGACGT CTGTCCCGTA TAGCCGGTTC 500 

GGAGGTGTCT GCCCTTCTCG GACGTATCCC GTCCGCTGTC GGATACCAGC 550 

CGACTCTCTC GACCGACATG GGTCAGATGC AGGAGCGTAT CACGTAAGTT 600 

TGGCCGCAGC TCCGTCCGCG GCGCCCTTTG TGTCTGACCG TGTTCCACCG 650 

CTCAGCACCA CCAAGAAGGG CTCGATCACC TGTGTC 686 



2) INFORMATION FOR SEQ ID NO: 492 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 625 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sporobolomyces salmonicolor 

(B) STRAIN: ATCC 32311 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 492 
TGTCTTGATT CAGGAGCTCA TCAACAACGT CGCCAAGGCG CACGGTGGTT 50 
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ACTCCGTTTT 
CACACTTTCG 
ACCTCGAACA 
TCGACAACGA 
CCCCCTGGCG 
GTACTTCCGT 
TCTTCCGTTT 
TCGTGGTGAC 
CCCTTCTCGG 
ACCGACATGG 
TCTCTCGTTT 
CAAGAAGGGT 
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CACCGGTGTC 
CCGACCTGGC 
GATCTCTACC 
CAAGTCGAAG 
CCCGTGCCCG 
GACGACGAGG 
CACCCAGGGT 
TGACAACCTG 
TCGTATCCCG 
GTGGCATGCA 
CGCTCTGCAT 
TCGATCACCT 



GGTGAGCGTA 
CATTCCGGAA 
ACGAGATGAT 
ACTGCTCTCG 
TGTCGCTCTT 
GCCAGGACGT 
ACGTTCGATC 
TTGCGCGTGC 
TCCGCTGTCG 
GGAGCGTATC 
CGCTCACGCA 
GTGTC 



CCCGTGAGGG 
CGTGCGGACT 
CGAGACCGGT 
TCTTCGGCCA 
ACTGGTCTCA 
GCTTCTCTTC 
CCGCCCGTCC 
AGCCGGTTCG 
GATACCAGCC 
ACGTACGCCC 
TGTTCGCCCG 



TAACGGTGAG 
GACGAGGACC 
GTCATTCAGC 
GATGAACGAG 
CCATCGCGGA 
ATCGACAACA 
AACACGAATG 
GAGGTGTCTG 
CACTCTCTCG 
TCTTCTGCTT 
ACAGCACCAC 



100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
625 



2) INFORMATION FOR' SEQ ID NO: 493 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1211 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sporothrix schenckii 

(B) STRAIN: WSA-14 8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 493 



TCTCGTCCGT 
CCGTCGGCCC 
ATCGACGAGC 
TGAGGCTCCC 
CTGGTATCAA 
ATTGGTCTGT 
GCTCATCAAC 
GTGTCGGCGA 
GAGACCTCTG 
CGGTCAGATG 
GTTTGACTGT 
TTTTGAATTA 
TGTCAGTGCT 
TCTGAGGTGT 
GCCCACGCTC 
CCCGCAAGGG 
GATCTGACGG 
CACTGTGCTG 
ACCCCCTCGA 
GACCACTACG 
GTCGCTGCAG 
CCGACAAGCT 
CAGCCGTTCA 
CGATCTGAAG 
GTGACAGCCT 



GGTGCTAAGG 
CGGTACCCTC 
GCGGTCCCAT 
GAGTTCGTTG 
GGTCGTCGAT 
TTGGCGGTGC 
AACATCGCCA 
GCGTACCCGT 
TCATTCAGCT 
AACGAGCCCC 
CGCTGAGTAC 
TTTCCTTGTC 
TCTCTTCATC 
CTGCCCTTCT 
GCCGTGGACA 
CTCAATTACC 
ATCCCGCCCC 
TCCCGAGGTA 
CTCCAAGTCG 
AGACCGCCAC 
GACATCATCG 
TACAGTCGAG 
CGGTCGCGCA 
GACACTATCG 
T 



CCACTGACAC 
GGTCGCATCA 
CAAGACCGAC 
ACCAGTCGAC 
CTGCTTGCTC 
CGGTGTTGGC 
AGGCCCACGG 
GAGGGTAACG 
TGACGGTGAC 
CTGGTGCTCG 
TTCCGTGACG 
GTACAGTTCC 
GACAACATTT 
GGGTCGTATT 
TGGGTCTGAT 
TCCGTCCAGG 
CGCCACCACC 
TCTCTGAGCT 
CGTATGCTGG 
TCGCGTCCAG 
CCATTCTGGG 
CGTGCTCGTA 
GGTCTTCACT 
CTTCGTTCAA 



TGGTTCCCCC 
TGAACGTCAC 
AAGTTCCGTC 
CACCGCTGAG 
CCTACGCCCG 
AAGACCGTGT 
TGGTTACTCC 
ATCTGTACCA 
TCCAAGGTCG 
TGCCCGTGTC 
AGGAGGGCCA 
AAATCGAAGA 
TCCGCTTCAC 
CCCTCCGCTG 
GCAGGAGCGT 
CCGTCTACGT 
TTCGCCCATC 
GGGTATCTAC 
ACCCCCGTAT 
CAGATCCTCC 
TATGGACGAG 
AGATCCAGCG 
GGTATCGAAG 
GGCTATCCTG 



ATTACCATCC 
CGGTGACCCG 
CCATCCACGC 
GTTCTCGTGA 
TGGTGGTAAG 
TCATCCAGGA 
GTCTTCACCG 
CGAAATGCAG 
CCCTGGTGTT 
GCCTTGACCG 
GGATGGTATG 
ATTACTAACT 
CCAGGCCGGT 
TCGGTTACCA 
ATTACCACCA 
GCCCGCTGAC 
TGGACGCCAC 
CCCGCTGTCG 
TGTCGGTGAC 
AAGAGTACAA 
CTGTCTGAGG 
TTTCCTGAGC 
GCCAGCTGGT 
AGCGGTGAGG 
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2) INFORMATION FOR SEQ ID NO: 494 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1133 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

10 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Stephanoascus ciferrii 

(B) STRAIN: ATCC 52550 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 494 



TCTTGTTAGA GGCACCCCAG TCAAGGACAC TGGTGCTCCA ATTACCATTC 50 

CAGTTGGTAA CGGCACTTTG. GGCCGTATCG TCAACGTTCT CGGTGAGCCA 100 

ATTGATGAGC GTGGACCAGT CAAGGCTGAC AAGTTCAGAC CTATTCACGC 150 

20 TGAGCCACCA ACCTTCGCTG ACCAGTCCAC CTCTGCCGAG GTTCTTGAGA 200 

CCGGTATTAA GGTTGTCGAC TTGCTTGCCC CTTATGCCAG AGGTGGTAAG 250 

ATTGGTCTTT TCGGTGGTGC CGGTGTCGGT AAGACTGTGT TCATTCAGGA 300 

GCTTATTAAC AACATTGCTA AGGCCCACGG TGGTTACTCT GTCTTCACTG 3 50 

GTGTCGGTGA GCGAACTCGT GAAGGTAACG ATTTGTACCA CGAAATGATG 400 

25 GAGACCGGTG TCATCAACCT TGAGGGTGAC TCCAAGGTGT CTCTTGTGTT 450 

CGGTCAGATG AACGAGCCTC CAGGAGCCCG TGCCCGTGTT GCCTTGACCG 500 

GTTTGACCAT TGCCGAGTAC TTCAGAGATG AGGAGGGCCA GGATGTCTTG 550 

TTGTTCATTG ACAACATTTT CCGATTCACC CAGGCCGGTT CTGAGGTCTC 600 

TGCCTTGTTG GGTCGTATCC CATCTGCCGT CGGTTACCAA CCAACCTTGG 650 

30 CTACTGATAT GGGTGGTCTT CAAGAACGTA TTACCACCAC TCAAAAGGGT 700 

TCCGTCACCT CTGTCCAGGC TGTCTACGTC CCAGCTGACG ATTTGACTGA 750 

TCCTGCCCCA GCTACCACCT TCGCCCATTT GGACGCCACC ACCGAATTGT 800 

CCCGATCTAT CTCTGAGTTG GGTATCTACC CAGCTGTCGA CCCTCTTGGT 850 

TCCAAGTCCC GTCTTTTGGA TGCCTCCGTC GTCGGCCAAG AGCACTACGA 900 

35 CGTTGCCGCC AACGTCCAAC AGACCTTGCA GGCCTACAAG TCTCTCCAGG 950 

ATATCATTGC CATTTTGGGT ATGGACGAAT TGTCTGAGGC TGATAAGCTC 1000 

ACTGTCGAGC GTGCTCGTAA GATGCAGAGA TTCCTTTCTC AGCCATTCAC 1050 

CGTCGCTGAG GTCTTCACTG GTCTCGAGGG TAGACTCGTT TCTTTGAAGG 1100 

ACACCATCCG ATCCTTCAAG GAGATCCTTG ACG 113 3 

40 



2) INFORMATION FOR SEQ ID NO: 495 



45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 608 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton mentagrophytes 
55 (B) STRAIN: WSA-225 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 495 



GGAGTTGATT GTAAGTCATT TGAAACCCAG CCCCAAGAAA CAGAAGCTAG 50 
60 GCGAAAATTG GACAATTGAG CAATTTAGCC ATTGGAGAAA AGAAATTTCG 100 

304 
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AGTATTAATT GTTTTTATAG AACAACATTG 
TCTGTCTTCA CTGGTGTCGG AGAGCGTACC 
CCATGAAATG CAGGAGACCC GTGTCATTCA 
TCGCCCTGGT CTTCGGCCAG ATGAACGAGC 
5 GTTGCTCTTA CTGGTTTGAC CATTGCTGAG 
TCAAGACGGT GAGTTTCTTA TGGATAAAAA 
TTTTTTTCAA GAAATTCATG TTCTAACAAA 
TCATCGACAA CATTTTCCGT TTCACTCAGG 
CTGCTTGGTC GTATCCCATC TGCCGTCGGT 
10 CGACATGGGT GGTATGCAGG AACGTATTAC 
TTACCTCC 



PCT/CAOO/01150 

CCAAGGCTCA CGGTGGTTAC * VB9 

CGTGAAGGAA ACGATCTCTA 200 

GCTTGATGGC GAGTCCAAGG 250 

CCCCAGGTGC CCGTGCCCGT 300 

TACTTCCGTG ATGAGGAAGG 350 

AaaATTTTTT TTTTTTTTTT 400 

GTGTATTCTA GTGCTTCTCT 450 

CTGGTTCCGA AGTGTCTGCC 500 

TACCAACCCA CTCTTGCCGT 550 

CACCACCAAG AAGGGATCCA 600 

608 



15 2) INFORMATION FOR SEQ ID NO: 496, 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 794 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

2 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Wangiella dermatitidis 

(B) STRAIN: WSA-229 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 496 

30 

GTTTATTCAA 
AATTACTGAT 
CCGTGTTCAC 
CACGAAATGC 

3 5 CGCGCTGGTG 
TTGCTCTGAC 
CCCATCTTGA 
CTGGTTTTTG 
TTACAAGATT 

4 0 GGGATGAGGA 
AATATCGACG 
GTGAGAAGTG 
GGGGCTCCCG 
TCCGATTCAC 

45 CCATCTGCCG 
GCAGGAACGT 



GAACTCATTG 
TGAAAATAGA 
TGGTGTCGGC 
AGGAGACCTC 
TTTGGTCAAA 
TGGGTAAGTT 
GAATACGTCT 
GGAGGCCCTC 
ATGCTAACTT 
GGGACAGGAT 
GCGTACTCTT 
CGCCGGAAAT 
TGCTAACACG 
TCAASCCGGT 
TCGGTTACCA 
ATCACCACCA 



TGGGTGGCAT 
ACAACATTGC 
GAGCGAACTC 
TGTCATTCAG 
TGAACGAACC 
GTTCCTTCGC 
GCCACCATGT 
AAGTTCAATT 
AATGGAGTCT 
GGTAAGTTTG 
CGCATCAAAA 
AATGGCAACC 
TGACAGTCTT 
TCTGARGTGT 
ACCCACACTC 
CCCGGAAGGG 



TCTCATAATG 
AAAGGCTCAT 
GTGAGGGTAA 
CTCGATGGCG 
TCCTGGTGCT 
TTCTTGCGCG 
CATGTGATGT 
TTTGGATGAC 
TACGGTGGCT 
ATAACAATCT 
AACCAAAGAG 
ACGTGACAAT 
GCTCTTCATC 
CTGCCTTGCT 
GCCGTCGACA 
ATCCATCACA 



TTTCGGCCAC 
GGTGGTTACT 
CGACTTGTAC 
AGTCCAAGGT 
CGTGCTCGTG 
TATCCACATC 
TGGGCTGGTT 
AGCACCAGCT 
GAGTTCTTCA 
CGTCGGTGTC 
GTGGTTTGGT 
GACCACGTGT 
GACAACATTT 
TGGTCGTATT 
TGGGTCTCAT 
TCTG 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
794 



50 2) INFORMATION FOR SEQ ID NO: 497 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 8 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 

305 
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(A) ORGANISM: Yarrowia lipolytica 

(B) STRAIN: ATCC 38295 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 497 

5 

TCTTGTCCGA GGCACCGCCG TCGCTGACAC CGGTGCTCCC ATCACTATCC 50 

CCGTCGGCCG AGGTACCCTT GGTCGAATCA TCAACGTCTG TGGTGAGCCC 100 

ATTGACGAGC GAGGACCCAT CGAGGCTTCC AAGTACCTCC CCATCCACGC 150 

TGACCCCCCT ACCTTCGCTG AGCAGTCTAC CTCCGCTGAG GTTCTCGAGA 200 

10 CCGGTATTAA GGTCGTCGAC CTCCTCGCCC CTTACGCCCG AGGTGGTAAG 250 

ATTGGTCTCT TCGGTGGTGC CGGTGTCGGT AAGACTGTCT TCATCCAGGA 300 

GCTGATTAAC AACATTGCCA AGGCCCATGG TGGTTTCTCC GTTTTCTGCG 350 

GTGTCGGTGA GCGAACCCGA GAGGGTAACG ATCTTTACCG AGAGATGAAG 4 00 

GAGACTGGTG TCATCAACCT CGAGGGTGAG TCTAAGGTCA CCCTCGTCTT 450 

15 CGGTCAGATG AACGAGCCTC CCGGAGCCCG TGCCCGAGTC GCCCTTACTG 500 

GTCTGACCAT TGCCGAGTAC TTCCGAGACG AGGAGGGTCA GGATGTGTTG 550 

CTCTTCGTTG ACAACATTTT CCGATTCACC CAGGCCGGTT CCGAGGTGTC 600 

CGCTCTGCTT GGTCGAATTC CCTCCGCTGT CGGTTACCAG CCCACTCTGG 650 

CCACCGATAT GGGTGCCCTC CAGGAGCGAA TTACCACCAC CCAGAAGGGT 700 

20 TCCGTCACTT CCGTCCAGGC CGTCTACGTG CCTGCCGATG ATTTGACCGA 750 

TCCTGCTCCC GCCACCACCT TCGCCCATCT TGACGCCACC ACCGTCCTGT 800 

CCCGAGGTAT TTCCGAGCTG GGTATCTACC CCGCTGTCGA TCCCCTTGAT 850 

TCCAAGTCTC GACTTCTGGA TATCGATGTT GTCGGAAAGG AGCACTACGA 900 

TGTTGCTTCC AACGTCCAGC AGACCCTCCA GGCTTACAAG TCTCTCCAGG 950 

25 ATATCATTGC CATTCTTGGT ATGGATGAGC TGTCCGAGCA GGACAAGCTG 1000 

ACCGTCGAGC GAGCTCGAAA GATCCAGCGA TTCCTGTCTC AGCCCTTCAC 1050 

CGTCGCCGAG GTTTTCACCG GTATTGAGGG ACGACTTGTC TCTCTCAAGG 1100 

ACACTGTCCG ATCCTTCAAG GAGATCCTTG ACGGTAAGCA CGATGCTC 114 8 

30 

2) INFORMATION FOR SEQ ID NO: 498 



(i) SEQUENCE CHARACTERISTICS : 
35 (A) LENGTH: 966 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 

(B) STRAIN: WSA-172 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 498 



GCGCTATTGT CGTTGTTGCT 
CTTGGGAAAT ACGTCGTAAA 

50 CCAGACTCGT GAGCATTTGC 
TCGTTGTCTT CGTCAACAAA 
GAACTGGTCG AACTCGAGAT 
GGGTGAAGAG ACTCCGATCA 
GACGCCGTGA CGACATCGGT 

55 GTCGACACCT GGATCCCCAC 
GATGTCTGTC GAGGAAGTGT 
CTGGTCGTGT CGAGCGTGGT 
GTTGGAGGCT CCTTCGAACC 
CTTCAAGAAG AGCTGTGATG 

60 TCCTGCGTGG TATCCGACGT 



GCCTCCGACG GTCAGATGTA 
ACACGTCGCT TACGTTTTCG 
TGCTCGCCCG CCAGGTTGGT 
ATCGATGCTA TTGATGATCC 
GCGTGAGCTG CTGAACAGCT 
TTTTCGGTTC CGCTCTCTGT 
AAAGACAGAA TTGAGCAGCT 
TCCTCAGCGT GACCTCGACA 
TCTCTATCGC CGGCCGTGGT 
ATCTTGAAGA AGGACTCTGA 
CAAGAAGACC AAAGTCACCG 
AATCGCGTGC TGGTGACAAC 
GAAGACGTCA AGCGTGGTAT 
306 



GGTGGAACAT 50 

CGAATAGGCC 100 

GTCCAGAAGA 150 

GGAGATGCTG 200 

ACGGTTTCGA 250 

GCTCTCGAAG 300 

TATGAACGCT 350 

AACCTTTCTT 400 

ACCGTGGCTT 450 

GGTTGAGATT 500 

ACATTGAAAC 550 

TCTGGTCTCC 600 

GGTCATTGCT 650 
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GTTCCCGGCA GCACCAAGGC 
CCTGACCGAG GCGGAGGGTG 
GTCCCCAAGT CTTCATCCGT 
AGATCTTCCG AGAGATTAGC 
TGCTGACCTC AGCTTCCCTG 
GTGACAACGT CGAGATGATC 
GCTGGTCAAC GCTTCA 



TCACGACAAG TTCCTCGTCT CCATGTACGT 
GTCGTCGTAC TGGCTTCGGT GCCAACTACC 
ACTGCAGGTA AGTTCCCGCA CACCGTGTCC 
GATATATGCT AATGATTCAT CAGACGAGGC 
ACGGCGACCA ATCTCGCAGA GTTATGCCTG 
CTGAAGACCC ACCACCCTGT TGCTGCTGAG 



700 
750 
800 
850 
900 
950 
966 



2) INFORMATION FOR SEQ ID NO: 499 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid , 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Blastoschizomyces capitatus 

(B) STRAIN: ATCC 10663 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 499 

TGGTGCTATT ATTGTTGTTG CTGCTTCTGA TGGTCAAATG CCCCAAACCC 50 

GTGAGCACTT GCTTCTTGCT CGTCAAGTTG GTGTTAAGCA CATTGTTGTT 100 

TTCGTTAACA AGATTGATAC TATTGATGAT CCTGAAATGT TGGAACTTGT 150 

CGAAATGGAA ATGAGAGAAC TTCTTTCTTC TTACGGTTTT GATGGCGATA 200 

ACACCCCTGT CATTATGGGT TCTGCTCTCT GTGCTCTTGA AGGTCGTGAA 250 

CCAGAAATTG GTGAACAAAG AATCAACCAA CTCCTTGATG CTATCGATGA 300 

ATACATTCCT ACCCCAGTTC GTGATATGGA CCAACCTTTC TTGATGCCAC 350 

TTGAAGGTGT TTTCTCTATT CCAGGTCGTG GTACTGTTGC CACTGGACGT 4 00 

GTCTATCGTG GTACTTTGAA GAGAGGTGAA GAAGTTGAAG TTGTTGGCTA 4 50 

CAATGATGCT CCAATCAAGA CCACCGTTAC TGGTATTGAA ATGTTCAAGA 500 

AGGAACTTGA TCAAGCTCAA GCTGGTGACA ACGCTGGTAT TCTTTTGAGA 550 

GGTGTTAAGC GTGAAGACCT TAAGCGTGGT ATGGTTGTTG CTAAACCAGG 600 

TACCGTTAAG CCACACACCA AGTTCCTTGC CTC CATC TAT GTTTTGACTA 650 

AGGAAGAAGG TGGCAGACAC TCTGGCTTTG GTCTTAACTA CAGACCTCAA 700 

CTTTTCCTTG GTTCTGCTGA TGTTACCACT GTCTTGACCT TCCCAGAGGG 750 

TGTTGACCAA AGCACTCAAG TCATGCCAGG TGACAACACT GAAATGGTTT 800 

GCGAACTTGT TCACCCAGTT GCTGTCGAAC AAGGCCAACG TTTCAA 84 6 



2) INFORMATION FOR SEQ ID NO: 500 

( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida rugosa 

(B) STRAIN: ATCC 96275 



307 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 500 
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GGTGCCATTA TTGTTGTTGC TGCCTCTGAT GGACAGATGC CCCAGACCCG 50 

TGAGCATCTT TTGCTTGCCC GCCAAGTCGG TATGCAAAAG GTCGTTGTGT 100 

5 TTGTTAACAA GATTGATACC ATTGATGACC CTGAAATGCT TGATCTTGTC 150 

GAGATGGAGA TGCGTGAACT GTTGAATGAA TATGACTTCG ATGGAGATAA 200 

CTCTCCTGTC ATTATGGGCT CTGCTCTTGC TGCTCTTGAG GACAAGAACC 250 

CCGAGATTGG TAAGGACCGT ATCATGCAGC TCTTGGACGC TGTTGATGAA 300 

TGGATCCCTA CCCCCGAGCG TGACCTTGAC AAGCCTTTCA TGATGCCTAT 350 

10 TGAGGCCTCT TTCTCCATTT CTGGTCGTGG TACTGTTGCC ACTGGCCGTG 4 00 

TCGAGCGTGG TATTCTCAAG AAGGGTGAGG AAGTCGAGAT CGTTGGTTTC 450 

AACAAGCAGC CCCTGAAATC TGTTGTTACT GGTATTGAAA TGTTCAAGAA 500 

GGAACTTGAT CAGGCCCAGG GCGGTGATAA TGCTGGTATC TTGCTTCGTG 550 

GTATTCGTCG TGAGGACTTG CAGCGTGGTA TGGTTTTGGC CAAGCCTGGA 600 

15 ACTGTTAAGG CTCACACCAA GTTCCTTTCC TCCATCTACG TTCTCTCCAA 650 

GGAAGAGGGC GGCCGTCACT CTCCTTTCGG TATGAACTAT CGTCCCCAGA 700 

TGTTCGTTTC TGCAGCTGAT GTCACCGTTA CTCTTACTTT CCCTGAGGGT 750 

GTTGAACAGC ACACTCAGGX CTTCCCTGGT GAGAACACCG AGATGGTTGG 800 

CGAGCTCGTT CACCCTACTG CTATTGAGGT TGGTCAACGC TTCAAC 84 6 

20 



2) INFORMATION FOR SEQ ID NO: 501 

2 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 944 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Coccidioides immitis 
35 (B) STRAIN: Silveira 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 501 

AGTTGTCGTC GTTGCTGCTT CAGACGGTCA AATGTATGCA ACCGAGAGCA 50 

4 0 CTCCCGGATC TTGGTTTAAA TGGCACTAAT ATAAGACAGG CCTCAAACTC 100 

GAGAGCATTT ACTTCTCGCC CGTCAGATCG GTATCCAAAA AATCGTCGTC 150 

TTCGTGAACA AGGTTGATGC CATCGAGGAC AAAGAGATGT TGGAGCTTGT 2 00 

TGAATTGGAG ATGCGTGAAC TCCTAACCAG CTACGGTTTC GAGGGTGAAG 250 

AAACTCCCAT CATTTTTGGC TCTGCTCTCT GTGCCCTCGA AGGAAGACAA 300 

45 CCCGAGATCG GTGTTACCAA GATTGATGAG CTCTTGCAGG CCGTCGACAC 350 

CTGGATTCCC ACTCCTCAGC GTGAGACTGA CAAGCCCTTC TTGATGTCCA 4 00 

TTGAGGAAGT GTTCTCTATT TCCGGACGAG GAACCGTTGT CTCCGGCCGT 4 50 

GTGGAGCGTG GTATCCTCAA GAAGGACTCC GAAGTTGAAA TTGTCGGCGG 500 

TTCGCCCGAG CCAATCAAAA CCAAGGTTAC CGATATCGAA ACCTTTAAGA 550 

50 AGTCTTGCGA CGAGTCTCGC GCTGGTGATA ACTCCGGCTT GCTCCTACGA 600 

GGCGTTAAGC GTGAAGATAT TAGCCGTGGC ATGGTCGTCG CTGTACCAGG 650 

AAGTGTCAAG GCCCATACTG AATTCTTAGT TTCGCTTTAC GTCCTCACCG 700 

AAGCTGAGGG TGGGCGCAAA TCTGGATTCA GCAGCAAGTA CCGCCCACAG 750 

ATGTTCATTC GCACTGCCGG TATGTAATAC TGTGATAATT TCGTTGACAT 800 

55 GGTACTGATT GAATTCTATA GACGAAGCGG CTCAGCTCAG CTGGCCCGGA 850 

GAAGATCAAG ACAAGATGGC TATGCCAGGA GACAATATCG AAATGATTTG 900 

CACCACCTTG CACCCAGTTG CCGCCGAGGC TGGCCAGCGA TTCA 944 



60 

308 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 84 9 bases 
<B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusarium oxysporum 

(B) STRAIN: WSA-212 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 502 

GCTATCATCG TCGTTGCTGC CTCCGATGGA CAGATGCCCC AGACCCGTGA 50 

ACACTTGCTC CTCGCTCGTC AGGTCGGTGT CCAGCGAATT GTCGTCTTCG 100 

TCAACAAGGT CGATGCCATT GATGACCCCG AGATGCTTGA GCTCGTCGAG 150 

ATGGAGATGC GCGAGCTTCT TAACACCTAC GGCTTCGAAG GCGACGACAC 200 

TCCCGTCATC ATGGGCTCTG CTCTTATGTC TCTCCAGAAC CAGCGCCCCG 250 

AGATTGGCAC CGAGAAGATC GATGAGCTCC TTGCTGCCGT CGACGAGTGG 300 

ATCCCAACCC CCGAGCGTGA CCTTGACAAG CCCTTCCTTA TGTCCGTCGA 350 

GGATGTCTTC TCCATTGCCG GCCGTGGTAC CGTCGTGTCT GGCCGTGTGG 4 00 

AGCGTGGTAT TCTGAAGCGT GATCAGGAGA TCGAGCTTGT CGGAAAGGGG 450 

CAGGAGGTTA TCAAGACCAA GGTTACCGAC. ATTGAGACCT TCAAGAAGTC 500 

TTGTGAGCAG TCCCAGGCTG GTGACAACTC TGGTCTCCTC ATCCGAGGTG 550 

TTCGCCGTGA GGATGTCCGC CGTGGTATGG TTGTCTGCGC TCCTGGCACC 600 

GTCAAGTCTC ATACCCAGTT TCTCGCTTCC CTCTACGTCC TCACCAAGGA 650 

GGAGGGTGGC CGACACACCG GTTTCCAGGA GCACTACCGA CCCCAGCTCT 700 

ATCTCCGAAC TGCAGATGAG TCCATTGACC TGACTTTCCC CGAGGGTACT 750 

GAGGATGCCT CCAGTAAGAT GGTCATGCCT GGTGACAACA CCGAGATGGT 800 

TGTCACCATG GGTCACCCCA ATGCCATCGA GGTTGGTCAG CGATTCAAC 84 9 



2) INFORMATION FOR SEQ ID NO: 503 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1064 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Histoplasma capsulation 

(B) STRAIN: G186A5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 503 

TGGTGCTATT GTCGTTGTTG CTGCTGCTGA CGGCCAAATG TAAGACGCCG 50 

CGAGGGACTG CTGAGGGTTT TATGCTTTTT AGGCCCCCTT GTTTCTGAGA 100 

GCATGATGAT ACTAATATTC GGAAACGTAT CTATTAGGCC TCAAACACGT 150 

GAGCATTTGC TCCTTGCCCG ACAGGTCGGT GTCCAAAAGA TCGTCGTTTT 200 

CGTGAACAAA GTCGACGCCC TTGAGGACAA GGAGATGTTG GAGCTTGTCG 2 50 

AGTTAGAAAT GAGAGAGCTC TTAAACACCT ACGGCTTCGA GGGTGAAGAG 300 

ACACCCATCA TCTTTGGTTC TGCCCTTTGC GCCATGGAAG GCCGTGAGCC 350 

TGAGTTGGGA GAAAAGAAAA TTGATGAATT GCTGGAGGCT GTTGATACTT 4 00 

309 
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GGATCCCCAC 
GAGGAAGTAT 
TGAGCGCGGT 
GCTCCACCCC 
5 TCCTGTGACG 
TATCAAGCGT 
GCGTCAAGGC 
GCTGAGGGTG 
GTTCATCCGC 
10 TTCTTTTTTT 
GCCTGATTCA 
ATCTCAGCTT 
GACAACGTCG 
CGGCCAGCGA 

15 



ACCACAACGT 
TCTCTATCTC 
GTCCTCAAGA 
CATCAGGACG 
AGTCTAGAGC 
GAAGATATCC 
CCACGACAAG 
GTCGCCGAAC 
ACAGCTGGTA 
CCTTTTCCTC 
CGAAATTAAC 
CCCTAGTGGA 
AGATGATCCT 
TTCA 



GATACCGAAA 
CGGCCGTGGA 
AGGATTCAGA 
AAGGTAACTG 
TGGGGACAAC 
GCCGTGGTAT 
TTCTTGGTGT 
CGGATTCGGC 
TGTCAAAATG 
TCTATCTCTC 
TAACCCGTTT 
GCAGATGAAA 
CCAGACACAC 



AACCTTTCTT 
ACCGTTGCCT 
AGTCGAGCTA 
ATATCGAAAC 
TCCGGTCTTT 
GGTAGTTGCC 
CGATGTATGT 
CAGAACTATC 
GGACCCCTTT 
TTTCTGTTTC 
GATTATAGAC 
GCAAACTCGT 
CGCCCCGTGG 



PCT/CAOO/01150 

GATGTCCGTT1 4'5© 

CCGGTCGTGT 500 

ATTGGGGGCG 550 

TTTCAAGAAA 600 

TATTGCGTGG 650 

GTTCCTGGCA 700 

CCTGACCGAA 750 

GTCCTCAAAT 800 

TCATAATCCT 850 

CTTTCAACTC 900 

GAAGCCGCCC 950 

TATGCCTGGT 1000 

CTGCTGAGGC 1050 
1064 



2) INFORMATION FOR SEQ ID NO: 504 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 982 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paracoccidioides brasiliensis 
30 (B) STRAIN: ATCC 32071 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 504 

TGGTGCTATC GTCGTTGTTG CTGCCTCTGA CGGCCAAATG TAGGGATTTT 50 

35 GCAAGACTGG TGAAAAAATC TAAAGAAAAT AGAAAAGATT GTGCTGATGT 10 0 

TTGGTATCAG GCCCCAAACA CGCGAGCATC TGCTTCTTGC CAGACAAGTC 150 

GGTGTTCAGA AAATCGTTGT TTTCGTCAAC AAGGTCGATG CTGTAGAGGA 200 

TAAGGAGATG TTGGAGCTTG TCGAATTGGA GATGAGAGAG CTCTTGACCA 250 

CCTATGGGTT CGAGGGTGAG AAGACACCTA TCATCTTTGG TTCTGCGCTC 300 

4 0 TGTGCTATGG AGGGCCGTCA GCCCGAGTTG GGAGAGCAGA AAATTGATGA 350 

ATTACTCGAG GCTGTGGATA CTTGGATCCC TACGCCACAG CGTGATACTG 4 00 

ACAAGCCCTT CCTGATGTCC ATTGAGGAGG TGTTCTCTAT CTCTGGACGA 450 

GGAACCGTTG CCTCCGGCCG CGTTGAGCGT GGTATCCTCA AGAAGGACTC 500 

CGAAGTTGAA ATTATTGGCG GCGGTGTTCC CACAATCCTG ACCAAGGTGA 550 

45 CTGATATCGA AACCTTCAAG AAGTCTTGCG ACGAGTCCAG AGCCGGGGAC 600 

AACTCCGGCC TCTTGTTGCG CGGTGTCAAG CGTGAGGATA TCCGCCGTGG 650 

TATGGTCGTT GCAGTTCCCG GAAGCGTCAA AGCACATGAC AGATTCTTGG 700 

TGTCGATGTA CGTTCTGACC GAGGCTGAGG GTGGTCGCCG CACTGGCTTC 750 

GGTCAGAACT ATCGTCCTCA AATGTTCATC CGCACAGCTG GTACGTTCAT 800 

50 TCTTTCACTA TATTCCTATA TGCATAGCCC GATCCTCCCA TTAACTAATT 850 

GACACAGACG AGGCTGCTGA ACTCAGCTGG CCTGATGGAG ACGACGAAGC 900 

CAAAATGGTC ATGCCCGGTG ACAATGTTGA AATGGTCCTG AAGTCACACC 950 

GCCCGGGGCC GCTGAGGCTG GACAGCGATT CA 982 

55 

2) INFORMATION FOR SEQ ID NO: 505 

(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 931 bases 

310 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Penicillium marneffei 

(B) STRAIN: ATCC 58950 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 505 

CGCTGTTGTT GTCGTCGCTG CTTCTGATGG TCAAATGTAA CATATCCACG 50 

AGCTGCCAAT TATGGACACT GCTGATAAGA ATAGGCCCCA AACCCGTGAG 100 

15 CACTTGCTCC TCGCCCGTCA GGTCGGTGTT, CAAAAGATCG TCGTCTTCGT 150 

CAACAAGGTT GATGCCGTCG AGGACCCCGA GATGTTGGAA CTTGTCGAAT 2 00 

TGGAAATGCG TGAACTCTTG ACCACCTACG GTTTCGAGGG TGAAGAGACC 2 50 

CCTATCATTT TCGGATCCGC TCTTTGCGCC TTGGAAGGCC GCAAGCCCGA 300 

GATTGGCGAA CAGAAGATTG ACGAGCTCAT GAACGCCGTT GATACCTGGA 350 

20 TCCCCACCCC CCAGCGTGAC CTTGACAAGC CCTTCTTGAT GTCCGTTGAG 4 00 

GAAGTTTTCT CCATCTCTGG TCGTGGTACC GTTGCATCTG GTCGTGTTGA 450 

GCGTGGTATT TTGCGCAAGG ATTCTGAGGT TGAGATTATC GGATACCAGA 500 

AGAACCCTAT CAAGACCAAG GTTACCGACA TTGAGACCTT CAAGAAGTCT 550 

TGCGATGAAT CTCGTGCTGG TGACAACTCT GGCTTGCTTC TCCGTGGTAT 600 

25 CAAGCGTGAG GACATTCGTC GTGGTATGGT TATCGCTGCT CCTGGAACCA 650 

CCAAGGCTCA TGACAACTTC TTGGTCTCCA TGTATGTCTT GACTGAGGCT 700 

GAAGGTGGTC GTCGTACTGG ATTCGGCGCC AACTACCGTC CTCAAGCTTT 750 

CATCCGTACT GCCGGTATGT TCCCTTTCAA AGTCAATTAA TGAGCGATTT 800 

GCTAACGAGT TATAGATGAG GCTGCTACTC TCAGCTTCCC CGGTGACGAT 850 

30 CAGTCCAAGC AGGTCATGCC CGGTGACAAC GTTGAGATGA TCTTGAAGAC 900 

ACACCGTCCC GTTGCCGCCG AAGCTGGTCA G 931 



35 2) INFORMATION FOR SEQ ID NO: 506 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



45 



50 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pichia anomala 

(B) STRAIN: ATCC 18205 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 506 



TGGTGCTATT 
GAGAACATTT 
TTTGTTAACA 
TGAAATGGAA 

55 ACGTCCCAGT 
GAAGAAATTG 
ATATATCCCA 
TTGAAGATGT 
GTTGAACGTG 

60 AAACAAAACT 



ATTGTTGTTG 
RTTATTGGCT 
AAGTTGATAC 
ATGAGAGAAT 
TGTTATGGGT 
GTGTCAAAGC 
ACCCCACAAA 
CTTYTCAATC 
GTAACTTGAA 
CCATTGAAAA 



CTGCTTCTGA 
AGACAAGTTG 
TATTGATGAC 
TGTTAAGTAC 
TCTGCTTTAT 
TATTGATAAA 
GAGATTTAGA 
TCAGGTAGAG 
GAAAGGTGAT 
CTACTGTYAC 
31 



TGGTCAAATG 
GTGTTCAACA 
CCAGAAATGT 
TTATGGTTTT 
GTGCCTTGGA 
TTATTAGCTG 
AAAACCATTC 
GTACCGTTGT 
GAAGTTGAAA 
NGGTATTGAA 
1 



CCTCAAACCA 50 

CATTGTTGTC 100 

TGGAATTAGT 150 

GATGGTGATA 2 00 

AGGTCGTGAA 250 

CTGTTGATGA 300 

TTGATGGGTG 350 

TACTGGTCGT 4 00 

TTGTTGGTTT 450 

ATGTTCAAAA 500 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



AAGAATTGGA CCAAGCTATG 
GGTATCAAAA GAGATGACAT 
TACCATCTCW GCTCACACTA 
AAGAAGAAGG TGGTCGTCAC 
TTATTCATCA GAACTGGTGA 
TGGTGATTCA TCTCAACAAA 
GTGAATTGGT TCACCCAACT 



GCTGGTGATA 
YAAAAGAGGT 
AATTCTTAGC 
TCAGGTTTTG 
TGTTACCGTT 
TCTTACCAGG 
GCTTTAGAAG 



ACTGTGGTAT CTTATTACGT 
ATGGTTATTG CTAAAACCGG 
CTCAATGTAT ATTTTGACTA 
GTGAACATTA CAGACCTCAA 
GTTTTAACCT TYCCAGAAGG 
TGACAATGTC GAAATGGTTT 
CTGGTCAAAG ATTCAA 



550 
600 
650 
700 
750 
800 
846 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 507 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 964 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

( i i ) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton mentagrophytes 

(B) STRAIN: WSA-225 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 507 

GGAGCTGTTG TCGTTGTCGC AGCTTCTGAC GGTCAAATGT AATTGAATGC 50 

CCGCCCAGAC GGATGAAAGG ATTTGACGTT TCTAACATCA GTCTAGGCCT 100 

CAGACCAGAG AACATTTGCT CCTTGCCCGC CAGGTCGGTG TCCAGAAGCT 150 

GGTCGTTTTC GTTAACAAGG TCGATGCCGT TGAGGACCCA GAGATGTTGG 200 

AGCTTGTCGA ACTTGAAATG CGTGAACTCC TCAGCCACTA CAGTTTTGAG 250 

GGTGAGGAGA CCCCCATCAT TTTTGGCTCT GCTCTCTGTG CCCTCGAGTC 300 

CCGTCGACCT GAGCTTGGTG TCGAGAAGAT TGACGAGCTA TTGAACGCCG 350 

TCGACACCTG GATCCCCACC CCCGAGCGCG CCACTGATAA GCCTTTCCTC 4 00 

ATGTCCATTG AGGAAGTGTT CTCTATCTCT GGTCGTGGTA CCGTCGTCTC 450 

CGGTCGTGTT GAGCGTGGTA TCCTCAAGAA GGATTCCGAC GTCGAAATTG 500 

TTGGTGGCTC TACCACCCCT ATCAAGACCA AGGTCACAGA TATCGAAACC 550 

TTCAAGAAGT CCTGCGATGA ATCTCGAGCT GGTGACAACT CTGGTCTCCT- 600 

TCTCCGAGGT ATCAAGCGTG AGGACTTGAA GCGTGGAATG GTTGTTGCTG 650 

CCCCCGGATC CACCAAGGCT CACACCGACT TCATGGTCTC CCTCTACGTC 700 

CTGACTGAGG CTGAGGGTGG TCGTTCCAAC GGCTTCACCC ACAAGTACCG 750 

CCCTCAAATG TTCATCCGTA CTGCTGGTAT GTAACCAAAG TTTCCGCTAT 800 

TTACTAAGTA GATCATTGCT AACTTGTATT CCCTTCCGTA GACGAAGCCG 850 

CATCTTTCAG CTGGCCTGGA GAAGACCAAG ACAAGAAGGC TATGCCTGGT 900 

GACAACGTCG AGATGATTTG CAAAACCCTC CACCCCATTG CTGCCGAGGC 950 

TGGCCAACGA TTCA 964 



2) INFORMATION FOR SEQ ID NO: 508 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 844 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(B) 
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ORGANISM: Yarrowia lipolytica 
STRAIN: ATCC 38295 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 508 



GGTGCTATCA TTGTCGTTGC TGCTGGAGAC GGTTCCAAGC CCCAGACCCG 50 

AGAGCATCTG CTGCTTGCTC GACAGGTCGG TGTCCAGAAC CTGGTTGTGT 100 

TTGTTAACAA GGTTGATCAG ATTGATGATA AGGAGATTCT TGAGCTCGTT 150 

GACATGGAGA TGCGAGATCT GCTGACCCAG TACGGTTTTG ATGGTGACAA 200 

10 CACCCCCGTT GTCATGGGCT CTGCTCTGTG CGCTCTTGAG GGCAAGCAGA 250 

AGGATATTGG AGAGGACGCC ATCATGGCCC TTATGGATGC CGTTGATGAG 300 

CACATCCCTA CCCCTAACCG TGACCTTGAG AAGCCCTTCC TGATGCCCGT 350 

TGAGGACGTT TTCTCCATCT CTGGCCGAGG AACTGTTGTT ACTGGCCGAG 400 

TCGAGCGAGG AAACCTGAAG AAGGGTGAGG AAATCGAGAT TGTTGGCTAC 4 50 

15 AACAACAAGC CCATCAAGGC TGTTGTTACC . GGTATTGAGA TGTTCAAGAA 500 

GGAGCTCGAG TCCGCCATGG CCGGTGACAA CGCCGGTATC CTGCTCCGAG 550 

GTATCAAGCG AGACGAGATC AAGCGAGGTA TGGTCATGTG CAAGCCTGGC 600 

ACCGTCAACG CCCACACCAA GTTCCTTGCT TCTCTTTACA TCATCCCCAC 650 

CGAGGAGGGT GGTCGAACCA GCTCTTTCGG CGCCAACTAC CGACCCCAGA 700 

20. TGTTCATCCG AACTTCTTCC GTCACCGCCA CTCTCACCTT CCCCGAGGGT 750 

ACCGACGAGT CCCAGACCGT CAACCCCGGT GACAACACTG AGATGGTTCT 800 

CGAGCTTGTT CACCCTACCG CCATTGAGGT CAACCAGCGA TTCA 844 



25 



2) INFORMATION FOR SEQ ID NO: 509 



( i ) SEQUENCE CHARACTERI STI CS : 
(A) LENGTH: 1067 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia bigemina 

(B) STRAIN: Suarez-2 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 509 

CTTGGACAAG CTGAAGAGCG AGCGTGAGCG TGGTATCACC ATTGACATTA 50 

CCCTGTGGAA GTTCGAAACT GGCAAGTACT ACTACACCGT CATTGACGCC 100 

CCCGGTCACC GTGACTTCAT TAAGAACATG ATTACGGGTA CCTCCCAGGC 150 

45 CGATGTTGCT ATGCTTGTCG TGCCCGCCGA GGCTGGTGGT TTCGAAGCTG 200 

CCTTCTCTAA GGAAGGTCAG ACCCGTGAGC ACGCTCTTTT GGCCTTCACC 250 

CTTGGTGTCA AGCAGATCAT TTGCGCCATC AACAAGATGG ACAAGTGCGA 300 

CTACAAGGAG GACCGTTACA GCGAAATCCA GAAGGAAGTT CAGGGTTACC 350 

TGAAGAAGGT CGGTTACAAC ATCGAGAAGG TGCCTTTCGT CGCCATCTCC 4 00 

50 GGTTTCATGG GTGACAACAT GGTTGAGCGC TCCACCAACA TGCCGTGGTA 450 

CAAGGGCAAG ACCTTGGTCG AGGCCCTCGA CATGATGGAG CCCCCGAAGA 500 

GGCCCGTCGA CAAGCCCCTG CGTCTTCCCC TCCAGGGTGT GTACAAGATC 550 

GGTGGTATCG GTACCGTCCC TGTCGGTCGT GTGGAGACTG GTCAGCTCAA 600 

GGCCGGTATG GTCCTCACCT TCGCCCCCAA CCCGATCACT ACTGAGTGCA 650 

55 AATCCGTCGA AATGCACCAC GAAGTTATCG ATGTTGCCAG CCCTGGTGAC 700 

AACGTTGGTT TCAACGTGAA GAACGTGTCC ACCTCTGACA TCCGCACTGG 750 

TCACGTCGCT TCTGACTCCA AGAACGACCC CGCCAAGGCC GCCGTGTCCT 800 

TCACCGCCCA GGTCATCATC TTGAACCACC CTGGTACCAT CAAGGCCGGT 850 

TACTCCCCTG TGGTTGACTG CCACACTGCC CACATCTCGT GCAAATTCGA 900 

60 CGAGATCACC AGCCGTATGG ACAAGCGTAC CGGTAAGGCC CTTGAGGAGA 950 

313 
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ACCCCAAGAC CATCAAGAAC GGCGACGCCG CTATGGTCGT CCTGAAGCCG 
TGCAAGCCCA TGGTCGTCGA GGCCTTCACT GAATACGCTC CCCTTGGTCG 
TTTCGCCGTG CGTGACG 



PCT/CA00/01150 

1000 



1050 
1067 



10 



15 



20 



25 



30 



35 



40 



2) INFORMATION FOR SEQ ID NO: 510 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1049 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM:. JBaJbesia bovis 

(B) STRAIN: Suarez-3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 510 



GTGAACGTGA 
ACCACCAAGT 
CATCAAGAAC 
TTGTACCAGC 
CAGACCCGTG 
CATCTGTGCC 
ACAGTGAAAT 
AATATTGAGA 
CATGGTTGAG 
TCGAGGCCCT 
CTTCGTCTTC 
CCCCGTCGGT 
CCTTTGCTCC 
CACGAAACCG 
AAAGAACGTT 
CTAAGAACGA 
GTGCTCAACC 
TTGCCACACC 
TGGACAAGCG 
AACGGTGACG 
CGAATCCTTC 



ACGTGGTATT 
ACTACTACAC 
ATGATTACGG 
TGAGGCTGGT 
AGCACGCTCT 
ATTAACAAGA 
CCAGAAGGAA 
AGGTGCCCTT 
CGTTCCACCA 
TGATCAGATG 
CCCTCCAGGG 
CGTGTTGAAA 
TAACCCAATC 
TTGAGGTTGC 
TCTACTTCTG 
CCCTGCCAAG 
ACCCTGGTAC 
GCTCACATTT 
TACCGGTAAA 
CTGCCATGGT 
ACTGAGTATG 



ACTATTGATA 
CGTCATTGAT 
GTACTTCTCA 
GGTTTCGAGG 
TTTGGCTTTC 
TGGACAAGTG 
GTCCAGGGTT 
CGTTGCCATC 
ACATGCCCTG 
GAACCCCCAA 
TGTCTACAAG 
CTGGTATGTT 
ACCACTGAAT 
TTACCCCGGT 
ACATTCGCAG 
GCTGCTGTTT 
CATTAAGGCC 
CATGTAAATT 
TCTCTTGAGG 
TGTGCTCAAG 
CTCCTCTTGG 



TTACCTTATG 
GCCCCTGGTC 
AGCCGATGTT 
CCGCTTTCTC 
ACCCTTGGTG 
CGACTACAAG 
ACCTCAAGAA 
TCCGGTTTCA 
GTATAAGGGA 
AGAGGCCCGT 
ATCGGTGGTA 
GAAGGCTGGT 
GCAAATCCGT 
GACAACGTCG 
TGGTCACGTT 
CCTTCACTGC 
GGTTACTGCC 
CGAAGAGATC 
AAAACCCCAA 
CCAATGAAGC 
TCGTTTCGCT 



GAAGTTCGAG 50 

ACCGTGACTT 100 

GCTATGCTTG 150 

CAAGGAAGGA 200 

TCAAACAGAT 2 50 

GAGGACCGTT 300 

GGTCGGTTAC 350 

TGGGAGACAA 4 00 

AAGACATTGG 450 

TGACAAGCCA 500 

TCGGTACCGT 550 

ATGATTCTAA 600 

TGAAATGCAC 650 

GTTTCAACGT 700 
GCCTCTGATT * 750 

CCAGGTCATT 800 

CCGTCGTCGA 850 

ACCAGCCGTA 900 

GACCATCAAG 950 

CCATGGTTGT 1000 

GTTCGTGAC 104 9 



45 



50 



55 



60 



2) INFORMATION FOR SEQ ID NO: 511 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1070 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Crithidia fasciculata 

(B) STRAIN: ATCC 11745 



314 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 511 



TGGACAAGCT 
CTGTGGAAGT 
5 CGGCCACCGC 
ATGCCGCCAT 
ATCTCCAAGG 
GGGCGTGAAG 
TGCAGTACGC 

10 TACCTGAAGC 
CTCGGGCTGG 
GGTACAAGGG 
GTGCGTCCGG 
GATCGGCGGT 

15 TGAAGCCGGG 
GTGAAGTCGA 
CGACAACGTG 
GTGGTAACGT 
GACTTCACTG 

2 0 CGGCTACGCG 
TCGCGGACAT 
AAGAGCCCGA 
CCCGCAGAAG 
GCCGCTTCGC 

25 



GAAGGCGGAG 
TCGAGTCGCC 
GACTTCATCA 
TCTGATGATT 
ACGGCCAGAC 
CAGATGGTTG 
CCAGGCCCGC 
GCGTGGGCTA 
CAGGGCGACA 
TCCCACGCTG 
TGGACAAGCC 
ATCGGCACTG 
CGACGTTGTG 
TCGAGATGCA 
GGCTTCAACG 
GTGCGGCAAC 
CCCAGGTGAT 
CCGGTGCTGG 
CGAGTCCAAG 
AGGCCATCAA 
CCGATGTGCG 
TGTCCGCGAT 



CGCGAGCGCG 
CAAGTCCGTG 
AGAACATGAT 
GACTCGACCC 
CCGCGAGCAC 
TGTGCTGCAA 
TACGAGGAGA 
CAACCCGGAG 
ACATGATCGA 
CTGGAGGCGC 
GCTGCGCCTG 
TGCCCGTGGG 
GTGTTCGCGC, 
CCACGAGCAG 
TGAAGAACGT 
ACGAAGAGCG 
CGTGCTGAAC 
ACTGCCACAC 
ATCGACCGCC 
GTCCGGCGAT 
TGGAGGTGTT 



GTATCACGAT 
TTCACGATCA 
CACCGGCACC 
AGGGTGGCTT 
GCCCTGCTTG 
CAAGATGGAC 
TCAGCAAGGA 
AAGGTGCGCT 
GAAGTCCGAC 
TCGACCTGCT 
CCCCTGCAGG 
CCGTGTGGAG 
CTGCCAACGT 
CTGGCTGAGG 
GTCCGTGAAG 
ACCCCCCGAA 
CACCCCGGCC 
GAGCCACATC 
GCTCTGGCAA 
GCGGCCATCG 
CAACGACTAC 



PCT/CAOO/01150 



CGATATTGCC 50 

TCGATGCCCC 100 

TCCCAGGCCG 150 

CGAGGCTGGC 200 

CCTTCACGCT 250 

GACAAGACGG 300 

GGTCGGCGCG 350 

TCATCCCGAT 4 00 

AACATGTCGT 450 

GGAGGCCCCC 500 

ACGTGTACAA 550 

ACCGGCGTGA 600 

GACGACCGAG 650 

CCGTGCCCGG 700 

GATATCCGCC 750 

GGAGGCGGCC 800 

AGATCAGCAA 850 

GCGTGCAAGT 900 

GGAGCTGGAG 950 

TGAAGATGAT 1000 

CCGCCGCTGG 1050 
1070 



2) INFORMATION FOR SEQ ID NO: 512 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1052 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY : Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM; Entamoeba histolytica 
40 (B) STRAIN: HM1-IMMS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 512 

GCTGAAAGAG AAAGAGGAAT TACTATTGAT ATTTCATTAT GGAAATTCGA 50 

45 AACATCTAAA TACTACTTCA CTATTATTGA TGCCCCAGGT CAYAGAGATT 100 

TCATTAAGAA CATGATTACT GGAACTTCAC AAGCTGATGT TGCCATCCTT 150 

ATTGTTGCTG CTGGTACTGG WGAATTTGAA GCTGGTATTT CAAAGAATGG 200 

ACAAACCAGA GAACAYATTC TTCTTTCATA CACTCTTGGA GTTAAACAAA 250 

TGATTGTTGG WGTTAACAAG ATGGATGCTA TTCAATATAA ACAAGAAAGA 300 

50 TATGAAGAAA TTAAGAAAGA AATTAGTGCA TTCCTTAAGA AGACWGGATA 350 

TAATCCAGAC AAGATTCCAT TTGTCCCAAT TTCAGGATTC CAAGGAGATA 400 

ATATGATTGA ACCATCAACC AACATGCCAT GGTACAAAGG ACCAACATTA 4 50 

ATTGGAGCAC TTGATTCAGT CACACCACCA GAAAGACCAG TTGATAAACC 500 

ACTTAGACTT CCACTTCAAG ATGTTTAYAA GATTTCAGGT ATTGGAACTG 550 

55 TACCATGTGG AAGAGTTGAA ACTGGAGTTC TTAGACCAGG AACTATTGTT 600 

CAATTTGCAC CATCAGGAGT TTCATCTGAA TGTAAATCAG TTGAAATGCA 650 

TCACACAGCA CTTGCTCAAG CTATTCCAGG TGATAATGTT GGATTCAATG 700 

TTAGAAAYTT AACAGTTAAA GATATTAAGA GAGGAAATGT AGCATCAGAT 750 

GCTAAGAATC AACCAGCTGT TGGATGTGAA GATTTCACTG CTCAAGTCAT 800 

60 TGTCATGAAC CATCCAGGAC AAATTAGAAA GGGATATACA CCAGTTCTTG 850 

315 
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ATTGCCATAC ATCACACATT GCATGTAAAT 
ATTGATAGAA GAACAGGTAA ATCCATGGAA 
TAAGAATGGA GATTCAGCAC TTGTTAAGAT 
GTGTTGAAGA ATTTGCTAAA TTCCCACCAT 
5 GA 



PCT/CA00/01150 

TCGAAGAATT ATTA^GCAAG 90^ 
GGAGGAGAAC CAGAATATAT 930 
TGTTCCAACT AAACCACTTT 1000 
TGGGAAGATT TGCTGTTAGA 1050 

1052 



10 



15 



2) INFORMATION FOR SEQ ID NO: 513 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1082 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Giardia lamblia 

(B) STRAIN: Faubert-1 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 513 



25 GACGAGCGCG 
GACGAAGAAG 
TCATCAAGAA 
GTCGTCGCGG 
CCAGACGCGC 

30 TGATCATCTG 
GAGCGCTACG 
CGGCTGGAAG 
CCGGGGACAA 
CCGTGCCTGA 

3 5 CGACAAGCCC 

TCGGGACCGT 
ATGAAGGTCG 
CGAGATGCAC 
GCTTCAACGT 

4 0 GTCGGGGACG 

CCAGGTCATC 
CCGTCATCGA 
CTCCAGAAGC 
CGACGCAGGC 
4 5 CCCTGTGCTG 
GTCCGCGACA 



AGCGCGGGAT 
TACATCGTCA 
CATGATCACG 
CGGGCCAGGG 
GAGCACGCGA 
CGTCAACAAG 
ACGAGATCAA 
AAGGCCGAGG 
CATCATGGAG 
TCGACGCGAT 
CTCCGCCTCC 
CCCCGCGGGC 
TCTTCGCCCC 
CACGAGGAGC 
CCGCGGGCTC 
TGACGAACGA 
GTCATGAACC 
CTGCCACACC 
TCGACAAGCG 
CGCGGCGATT 
CGAGACGTTC 
TGCGCCAAAC 



CACGATCAAC 
CGATCATCGA 
GGGACGTCCC 
CGAGTTCGAG 
CCCTTGCGAA 
ATGGACGACG 
GGGCGAGATG 
AGTTCGACTA 
AAGTCCGACA 
CGACGGGCTC 
CGATCCAGGA 
CGCGTCGAGA 
GACGTCCCAG 
TCAAGAAGGC 
GCCGTCAAGG 
CCCGCCCGTC 
ACCCGAAGAA 
GCGCACATCG 
CACGCTCAAG 
GCATCATCGT 
AACGACTACG 
CGTTGCCGTC 



ATCGCGCTCT 
CGCCCCGGGC 
AGGCCGACGT 
GCCGGGATCT 
CACGCTCGGG 
GCCAGGTCAA 
ATGAAGCAGC 
CATCCCGACG 
AGATGCCCTG 
AAGGCCCCGA 
CGTCTACAAG 
CGGGCGAGCT 
GTCTCGGAGG 
CGGGCCCGGG 
ACCTCAAGAA 
GGCTGCAAGA 
GATCCAGCCC 
CGTGCCAGTT 
CCCGAGATGG 
CAAGATGGTC 
CGCCCCTCGG 
GG 



GGAAGTTCGA 50 

CACCGCGACT 100 

CGCGATCCTC 150 

CGAAGGACGG 2 00 

ATCAAGACGA 2 50 

GTACTCGAAG 3 00 

TCAAGAACAT 350 

TCCGGCTGGA 40 0 

GTACGAGGGC 450 

AGCGCCCGAC 50 0 

ATCTCGGGCG 550 

CGCGCCCGGG 600 

TCAAGTCCGT 650 

GACAACGTCG 70 0 

GGGCTACGTC 750 

GCTTCACCGC 80 0 

GGCTACACGC 850 

CCAGCTCTTC 900 

AGAACCCGCC 950 

CCCCAGAAGC 1000 

CCGCTTCGCC 1050 
1082 



50 2) INFORMATION FOR SEQ ID NO: 514 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1098 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 

316 
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(A) ORGANISM: Leishmania tropica 

(B) STRAIN: ATCC 30816 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 514 

ACGCGTGGGT GCTCGACAAG CTGAAGGCGG AGCGCGAGCG CGGCATCACG 50 

ATCGACATTG CGCTGTGGAA GTTCGAGTCG CCCAAGTCCG TGTTCACGAT 100 

CATCGATGCG CCCGGCCACC GCGACTTCAT CAAGAACATG ATCACGGGCA 150 

CGTCGCAGGC GGACGCCGCC ATCCTGATGA TCGACTCGAC GCATGGTGGC 200 

TTCGAGGCTG GCATCTCGAA GGACGGCCAG ACCCGCGAGC ACGCGCTGCT 250 

TGCCTTCACT CTTGGCGTGA AGCAGATGGT GGTGTGCTGC AACAAGATGG 300 

ACGACAAGAC GGTGACGTAC GCGCAGTCGC GCTACGATGA GATCAGCAAG 350 

GAGGTGGGCG CGTACCTGAA GCGCGTGGGC TACAACCCGG AGAAGGTGCG 4 00 

CTTCATCCCG ATCTCGGGCT GGCAGGGCGA CAACATGATC GAGAAGTCGG 450 

ACAACATGCC GTGGTACAAG GGTCCCACGC, TGCTGGACGC GCTCGACATG 500 

CTGGAGCCGC CGGTGCGCCC GGTGGACAAG CCGCTGCGCC TGCCCCTGCA 550 

GGACGTGTAC AAGATCGGCG GTATCGGGAC GGTGCCCGTG GGGCGCGTGG 600 

AGACCGGCAT CATGAAGCCG GGCGACGTGG TGACGTTCGC GCCCGCCAAC 650 

GTGACGACTG AGGTGAAGTC GATCGAGATG CACCACGAGC AGCTGGCGGA 700 

GGCGCAGCCC GGCGACAACG TCGGCTTCAA CGTGAAGAAC GTGTCGGTGA 750 

AGGACATCCG CCGTGGTAAC GTGTGCGGCA ACTCGAAGAA CGACCCGCCG 800 

AAGGAGGCGG CCGACTTCAC GGCGCAGGTG ATCGTGCTGA ACCACCCCGG 850 

CCAGATCAGC AACGGCTACG CGCCGGTGCT GGACTGCCAC ACGAGCCACA 900 

TTGCGTGCCG CTTCGCGGAA ATCGAGTCCA AGATCGACCG CCGCTCCGGC 950 

AAGGAGCTGG AGAAGAACCC CAAGGCGATC AAGTCTGGCG ATGCCGCGAT 1000 

CGTGAAGATG GTGCCGCAGA AGCCGATGTG CGTGGAGGTG TTCAACGACT 1050 

ACGCGCCGCT GGGCCGCTTT GCCGTGCGCG ACATGCGCCA AACCGTTG 1098 



2) INFORMATION FOR SEQ ID NO: 515 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1104 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania aethiopica 

(B) STRAIN: ATCC 50119 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 515 

TACGCGTGGG TGCTCGACAA GCTGAAGGCG GAGCGCGAGC GCGGCATCAC 50 

GATCGACATT GCGCTGTGGA AGTTCGAGTC GCCCAAGTCC GTGTTCACGA 100 

TCATCGATGC GCCCGGCCAC CGCGACTTCA TCAAGAACAT GATCACGGGC 150 

ACGTCGCAGG CGGACGCCGC CATCCTGATG ATCGACTCGA CGCATGGTGG 200 

CTTCGAGGCT GGCATCTCGA AGGACGGCCA GACCCGCGAG CACGCGCTGC 250 

TTGCCTTCAC TCTTGGCGTG AAGCAGATGG TGGTGTGCTG CAACAAGATG 300 

GACGACAAGA CGGTGACGTA CGCGCAGTCG CGCTACGAGG AGATCAGCAA 350 

GGAGGTGGGC GCGTACCTGA AGCGCGTGGG CTACAACCCG GAGAAGGTGC 4 00 

GCTTCATCCC GATCTCGGGC TGGCAGGGCG ACAACATGAT CGAGAAGTCG 450 

GACAACATGC CGTGGTACAA GGGTCCCACG CTGCTGGACG CGCTCGACAT 500 

GCTGGAGCCG CCGGTGCGCC CGGTGGACAA GCCGCTGCGC CTGCCCCTGC 550 

AGGACGTGTA CAAGATCGGC GGTATCGGGA CGGTGCCCGT GGGCCGCGTG 600 

GAGACCGGCA TCATGAAGCC GGGCGACGTG GTGACGTTCG CGCCCGCCAA 650 

CGTGACGACT GAGGTGAAGT CGATCGAGAT GCACCACGAG CAGCTGGCGG 700 
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AGGCGCAGCC CGGCGACAAC GTCGGCTTCA 
AAGGACATCC GCCGTGGCAA CGTGTGCGGC 
GAAGGAGGCG GCCGACTTCA CGGCGCAGGT 
GCCAGATCAG CAACGGCTAC GCGCCGGTGC 
5 ATTGCGTGCC GCTTCGCGGA AATCGAGTCC 
CAAGGAGCTG GAGAAGAACC CCAAGGCGAT 
TCGTGAAGAT GGTGCCGCAG AAGCCGATGT 
TACGCGCCGC TGGGCCGCTT TGCCGTGCGC 
CGTC 



PCT/CA00/01150 

*CGV6XfiSK&' CttTfil^G«TG - -TSQf 

AACTCGAAGA ACGACCCGCC 8 (TO 

GATCGTGCTG AACCACCCCG 850 

TGGACTGCCA CACGAGCCAC 900 

AAGATCGACC GCCGCTCCGG 950 

CAAGTCTGGC GATGCCGCGA 100 0 

GCGTGGAGGT GTTCAACGAC 1050 

GACATGCGCC AAACCGTTGC 1100 

1104 



2) INFORMATION FOR SEQ ID NO: 516 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1106 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tropica 
25 (B) STRAIN: ATCC 30815 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 516 

TACGCGTGGG TGCTCGACAA GCTGAAGGCG GAGCGCGAGC GCGGCATCAC 50 

30 GATCGACATT GCGCTGTGGA AGTTCGAGTC GCCCAAGTCC GTGTTCACGA 100 

TCATCGATGC GCCCGGCCAC CGCGACTTCA TCAAGAACAT GATCACGGGC 150 

ACGTCGCAGG CGGACGCCGC CATCCTGATG ATCGACTCGA CGCATGGTGG 200 

CTTCGAGGCT GGCATCTCGA AGGACGGCCA GACCCGCGAG CACGCGCTGC 250 

TTGCCTTCAC KCTTGGCGTG AAGCAGATGG TGGTGTGCTG CAACAAGATG 300 

35 GACGACAAGA CGGTGACGTA CGCGCAGTCG CGCTACGATG AGATCAGCAA 350 

GGAGGTGGGC GCGTACCTGA AGCGCGTGGG CTACAACCCG GAGAAGGTGC 4 00 

GCTTCATCCC GATCTCGGGC TGGCAGGGCG ACAACATGAT CGAGAAGTCG 450 

GACAACATGC CGTGGTACAA GGGTCCCACG CTGCTGGACG CGCTCGACAT 500 

GCTGGAGCCG CCGGTGCGCC CGGTGGACAA GCCGCTGCGC CTGCCCCTGC 550 

40 AGGACGTGTA CAAGATCGGC GGTATCGGGA CGGTGCCCGT GGGGCGCGTG 600 

GAGACCGGCA TCATGAAGCC GGGCGACGTG GTGACGTTCG CGCCCGCCAA 650 

CGTGACGACT GAGGTGAAGT CGATCGAGAT GCACCACGAG CAGCTGGCGG 700 

AGGCGCAGCC CGGCGACAAC GTCGGCTTCA ACGTGAAGAA CGTGTCGGTG 750 

AAGGACATCC GCCGTGGTAA CGTGTGCGGC AACTCGAAGA ACGACCCGCC 800 

45 GAAGGAGGCG GCCGACTTCA CGGCGCAGGT GATCGTGCTG AACCACCCCG 850 

GCCAGATCAG CAACGGCTAC GCGCCGGTGC TGGACTGCCA CACGAGCCAC 900 

ATTGCGTGCC GCTTCGCGGA AATCGAGTCC AAGATCGACC GCCGCTCCGG 950 

CAAGGAGCTG GAGAAGAACC CCAAGGCGAT CAAGTCTGGC GATGCCGCGA 1000 

TCGTGAAGAT GGTGCCGCAG AAGCCGATGT GCGTGGAGGT GTTCAACGAC 1050 

50 TACGCGCCGC TGGGCCGCTT TGCCGTGCGC GACATGCGCC AAACCGTTGC 1100 

CGTCGG 1106 



55 2) INFORMATION FOR SEQ ID NO: 517 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1099 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 

318 
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(D) TOPOLOGY: Linear 



PCT/CAOO/01150 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania donovani 

(B) STRAIN: ATCC 50212 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 517 

TACGCGTGGG TGCTCGACAA GCTGAAGGCG GAGCGCGAGC GCGGCATCAC 50 

GATCGACATT GCGCTGTGGA AGTTCGAGTC GCCCAAGTCC GTGTTCACGA 100 

TCATCGATGC GCCCGGCCAC CGCGACTTCA TCAAGAACAT GATCACGGGC 150 

ACGTCGCAGG CGGACGCCGC CATCCTGATG ATCGACTCGA CGCATGGTGG 200 

CTTCGAGGCT GGCATCTCGA AGGACGGCCA, GACCCGCGAG CACGCGCTGC 250 

TTGCCTTCAC GCTTGGCGTG AAGCAGATGG TGGTGTGCTG CAACAAGATG 300 

GACGACAAGA CCGTGACGTA CGCGCAGTCG CGCTACGATG AGATCAGCAA 350 

GGAGGTGGGC GCGTACCTGA AGCGCGTGGG CTACAACCCG GAGAAGGTGC 4 00 

GCTTCATCCC GATCTCGGGC TGGCAGGGCG ACAACATGAT CGAGAGGTCG 450 

GACAACATGC CGTGGTACAA GGGTCCCACG CTGCTGGACG CGCTCGACAT 500 

GCTGGAGCCG CCGGTGCGCC CGGTGGACAA GCCGCTGCGC CTGCCCCTGC 550 

AGGACGTGTA CAAGATCGGC GGTATCGGGA CTGTGCCCGT GGGCCGCGTG 600 

GAGACCGGCA TCATGAAGCC GGGCGACGTG GTGACGTTCG CGCCCGCCAA 650 

CGTGACGACT GAGGTGAAGT CGATCGAGAT GCACCACGAG CAGCTGGCGG 700 

AGGCGCAGCC CGGCGACAAC GTCGGCTTCA ACGTGAAGAA CGTGTCGGTG 750 

AAGGACATCC GCCGTGGCAA CGTGTGCGGC AACTCGAAGA ACGACCCGCC 800 

GAAGGAGGCG GCCGACTTCA CGGCGCAGGT GATCGTGCTG AACCACCCCG 850 

GCCAGATCAG CAACGGCTAC GCGCCGGTGC TGGACTGCCA CACGAGCCAC 900 

ATTGCGTGCC GCTTCGCGGA AATCGAGTCC AAGATCGACC GCCGCTCCGG 950 

CAAGGAGCTG GAGAAGAACC CCAAGGCGAT CAAGTCTGGC GATGCCGCGA 1000 

TCGTGAAGAT GGTGCCGCAG AAGCCGATGT GCGTGGAGGT GTTCAACGAC 1050 

TACGCGCCGC TGGGCCGCTT TGCCGTGCGC GACATGCGCC AAACCGTTG 1099 



2) INFORMATION FOR SEQ ID NO: 518 

( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 1098 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania infantum 

(B) STRAIN: MOU 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 518 

TACGCGTGGG TGCTCGACAA GCTGAAGGCG GAGCGCGAGC GCGGCATCAC 50 

GATCGACATT GCGCTGTGGA AGTTCGAGTC GCCCAAGTCC GTGTTCACGA 100 

TCATCGATGC GCCCGGCCAC CGCGACTTCA TCAAGAACAT GATCACGGGC 150 

ACGTCGCAGG CGGACGCCGC CATCCTGATG ATCGACTCGA CGCATGGTGG 200 

CTTCGAGGCT GGCATCTCGA AGGACGGCCA GACCCGCGAG CACGCGCTGC 250 

TTGCCTTCAC GCTTGGCGTG AAGCAGATGG TGGTGTGCTG CAACAAGATG 300 

GACGACAAGA CCGTGACGTA CGCGCAGTCG CGCTACGATG AGATCAGCAA 350 

GGAGGTGGGC GCGTACCTGA AGCGCGTGGG CTACAACCCG GAGAAGGTGC 4 00 

GCTTCATCCC GATCTCGGGC TGGCAGGGCG ACAACATGAT CGAGAGGTCG 450 
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GACAACATGC CGTGGTACAA GGGTCCCACG 
GCTGGAGCCG CCGGTGCGCC CGGTGGACAA 
AGGACGTGTA CAAGATCGGC GGTATCGGGA 
GAGACCGGCA TCATGAAGCC GGGCGACGTG 
5 CGTGACGACT GAGGTGAAGT CGATCGAGAT 
AGGCGCAGCC CGGCGACAAC GTCGGCTTCA 
AAGGACATCC GCCGTGGCAA CGTGTGCGGC 
GAAGGAGGCG GCCGACTTCA CGGCGCAGGT 
GCCAGATCAG CAACGGCTAC GCGCCGGTGC 
10 ATTGCGTGCC GCTTCGCGGA AATCGAGTCC 
CAAGGAGCTG GAGAAGAACC CCAAGGCGAT 
TCGTGAAGAT GGTGCCGCAG AAGCCGATGT 
TACGCGCCGC TGGGCCGCTT TGCCGTGCGC 



PCT/CAOO/01150 

CTGCTGGACG CGCTCSACAT SOJb 
GCCGCTGCGC CTGCCCCTGC 550 
CTGTGCCCGT GGGCCGCGTG 600 
GTGACGTTCG CGCCCGCCAA 650 
GCACCACGAG CAGCTGGCGG 700 
ACGTGAAGAA CGTGTCGGTG 750 
AACTCGAAGA ACGACCCGCC 800 
GATCGTGCTG AACCACCCCG 850 
TGGACTGCCA CACGAGCCAC 900 
AAGATCGACC GCCGCTCCGG 950 

CAAGTCTGGC GATGCCGCGA 1000 

GCGTGGAGGT GTTCAACGAC 1050 

GACATGCGCC AAACCGTT 1098 



15 



2) INFORMATION FOR SEQ ID NO: 519 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 1071 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania enriettii 

(B) STRAIN: ATCC 50120 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 519 



CTCGACAAGC TGAAGGCGGA GCGCGAGCGC GGCATCACGA TCGACATTGC 50 

GCTGTGGAAG TTCGAGTCGC CCAAGTCTGT GTTCACGATC ATCGATGCGC 100 

3 5 CCGGCCACCG CGACTTCATC AAGAACATGA TCACCGGCAC GTCGCAGGCC 150 

GACGCCGCCA TCCTGATGAT CGACTCGACC CAGGGCGGCT TCGAGGCTGG 200 

CATCTCGAAG GACGGCCAGA CCCGCGAGCA CGCGCTGCTC GCCTTCACGC 250 

TTGGTGTGAA GCAGATGGTG GTGTGCTGCA ACAAGATGGA CGACAAGACG 300 

GTGCAGTACT CGCAGGCGCG CTACGAGGAG ATCAGCAAGG AGGTGGGCGC 350 

4 0 GTACCTGAAG CGCGTCGGCT ACAACCCGGA GAAGGTGCGC TTCATCCCGA 400 

TCTCGGGCTG GCAGGGCGAC AACATGATCG ACAAGTCGGA CAGCATGCCG 450 

TGGTACAAGG GGCCCACGCT GCTGGACGCG CTCGACATGC TGGAGGCGCC 500 

GGTGCGCCCG GTGGACAAGC CGCTGCGCCT GCCCCTGCAG GACGTGTACA 550 

AGATCGGCGG TATCGGGACG GTGCCCGTGG GCCGCGTGGA GACCGGCATC 600 

4 5 ATGAAGCCTG GCGACGTGGT GACGTTTGCG CCCGCCAACG TGACGACGGA 650 

GGTGAAGTCG ATCGAGATGC ACCACGAGCA GCTGGCGGAG GCGGTGCCCG 700 

GCGACAACGT CGGCTTCAAC GTGAAGAACG TGTCGGTGAA GGACATCCGC 750 

CGTGGTAACG TGTGCGGCAA CTCGAAGAAC GACCCGCCGA AGGAGGCGGC 800 

CGACTTCACG GCGCAGGTGA TCGTGCTGAA CCACCCTGGC CAGATCAGCA 850 

50 ACGGCTACGC GCCGGTGCTG GACTGCCACA CGAGCCACAT CGCGTGCCGC 900 

TTCGCGGATA TCGAGTCCAA GATCGACCGC CGCTCTGGCA AGGAGCTGGA 950 

GAAGAACCCC AAGGCGATCA AGTCCGGCGA TGCGGCCATC GTGAAGATGG 1000 

TGCCGCAGAA GCCGATGTGC GTGGAGGTGT TCAACGACTA CCCGCCGCTG 1050 

GGGCGCTTCG CTGTGCGCGA C 1071 

55 



2) INFORMATION FOR SEQ ID NO: 52 0 
60 (i) SEQUENCE CHARACTERISTICS: 

320 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01 150 

(A) LENGTH: 1071 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania gerbilli 
10 (B) STRAIN: ATCC 50121 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 520 

CTCGACAAGC TGAAGGCGGA GCGCGAGCGC GGCATCACGA TCGACATTGC 50 

15 GCTGTGGAAG TTCGAGTCGC CCAAGTCCGT GTTCACGATC ATCGATGCGC 100 

CCGGCCACCG CGACTTCATC AAGAACATGA TCACGGGCAC GTCGCAGGCG 150 

GACGCCGCCA TCCTGATGAT CGACTCGACG CATGGTGGCT TCGAGGCTGG 200 

CATCTCGAAG GACGGCCAGA CCCGCGAGCA CGCGCTGCTT GCCTTCACTC 250 

TTGGCGTGAA GCAGATGGTG GTGTGCTGCA ACAAGATGGA CGACAAGACG 300 

20 GTGACGTACG CGCAGTCGCG CTACGATGAG ATCAGCAAGG AGGTGGGCGC 350 

GTACCTGAAG CGCGTGGGCT ACAACCCGGA GAAGGTGCGC TTCATCCCGA 400 

TCTCGGGCTG GCAGGGCGAC AACATGATCG AGAAGTCGGA CAACATGCCG 450 

TGGTACAAGG GTCCCACGCT GCTGGACGCG CTCGACATGC TGGAGCCGCC 500 

GGTGCGCCCG GTGGACAAGC CGCTGCGCCT GCCCCTGCAG GACGTGTACA 550 

25 AGATCGGCGG TATCGGGACG GTGCCCGTGG GCCGCGTGGA GACCGGCATC 600 

ATGAAGCCGG GCGACGTGGT GACGTTCGCG CCCGCCAACG TGACGACTGA 650 

GGTGAAGTCG ATCGAGATGC ACCACGAGCA GCTGGCGGAG GCGCAGCCCG 700 

GCGACAACGT CGGCTTCAAC GTGAAGAACG TGTCGGTGAA GGACATCCGC 750 

CGTGGTAACG TGTGCGGCAA CTCGAAGAAC GACCCGCCGA AGGAGGCGGC 8 00 

30 CGACTTCACG GCGCAGGTGA TCGTGCTGAA CCACCCCGGC CAGATCAGCA 850 

ACGGCTATGC GCCGGTGCTG GACTGCCACA CGAGCCACAT TGCGTGCCGC 900 

TTCGCGGAAA TCGAGTCCAA GATCGACCGC CGCTCCGGCA AGGAGCTGGA 950 

GAAGAACCCC AAGGCGATCA AGTCTGGCGA TGCCGCGATC GTGAAGATGG 1000 

TGCCGCAGAA GCCGATGTGC GTGGAGGTGT TCAACGACTA CGCGCCGCTG 1050 

35 GGCCGCTTTG CCGTGCGCGA C 1071 



2) INFORMATION FOR SEQ ID NO: 521 



40 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1114 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
4 5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Leishmania hertigi 

(B) STRAIN: ATCC 50125 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 521 

55 TCGTTCAAGT ACGCGTGGGT GCTCGACAAG CTGAAGGCGG AGCGCGAGCG 50 

CGGTATCACG ATCGACATTG CGCTGTGGAA GTTCGAGTCG CCCAAGTCGG 100 

TGTTCACGAT CATCGATGCG CCCGGCCACC GCGACTTCAT CAAGAACATG 150 

ATCACCGGCA CGTCGCAGGC GGATGCTGCC ATTCTGATGA TCGATTCGAC 200 

GCAGGGTGGC TTCGAGGCTG GCATCTCGAA GGACGGCCAG ACGCGCGAGC 250 

60- ACGCGCTGCT GGCCTTCACG CTGGGCGTGA AGCAGATGGT TGTGTGCTGC 300 

321 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

AACAAGATGG ACGACAAGAC GGTGCAGTAC 
GATCAGCAAG GAGGTGGGCG CGTACCTGAA 
AGAAGGTGCG CTTCATCCCG ATCTCGGGCT 
GAGAAGTCCG ACAACATGTC GTGGTACAAG 
GCTGGACATG CTGGAGGCGC CGGTGCGCCC 
TGCCCCTGCA GGACGTGTAC AAGATCGGCG 
GGCCGTGTGG AGACCGGCAT CATGAAGCCC 
GCCCGCCAAC GTGACGACGG AGGTGAAGTC 
AGCTGCAGGA GGCTGTGCCC GGCGACAACG 
GTGTCGGTGA AGGACATCCG CCGTGGTAAC 
CGACCCGCCG AAGGAGGCGG CTGACTTCAC 
ACCACCCCGG CCAGATCAGC AACGGCTACG 
ACCAGCCACA TCGCGTGCCG CTTCGCGGAC 
CCGCTCCGGC AAGGAGCTGG AGAAGAACCC 
ACGCCGCCAT CGTGAAGATG GTGCCGCAGA 
TTCAACGACT ACCCGCCGCT GGGCCGCTTT 
AACCGTTGCC GTCG 



PCT/CA00/01150 

GCGCAGGCGC GCTACGAGGA 350/ 
GCGCGTGGGC TACAACCCGG 4 00 

GGCAGGGCGA CAACATGATC 4 50 

GGTCCCACGC TGCTGGAGGC 500 
GGTGGACAAG CCGCTGCGCC 550 
GCATTGGCAC GGTGCCGGTG 600 
GGCGACGTGG TGACGTTCGC 650 
GATCGAGATG CACCACGAGC 700 
TCGGCTTCAA CGTGAAGAAC 750 
GTGTGTGGCA ACTCGAAGAA 800 
GGCGCAGGTG ATCGTGCTGA 850 
CGCCGGTGCT GGACTGCCAC 900 
ATCGAGTCGA AGATCGACCG 950 

CAAGTCCATC AAGTCCGGCG 1000 

AGCCGATGTG CGTGGAGGTG 1050 

GCGGTGCGCG ACATGCGCCA 1100 

1114 



2) INFORMATION FOR SEQ ID NO: 522 

( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 1106 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania major 

(B) STRAIN: ATCC 50122 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 522 

TACGCGTGGG TGCTCGACAA GCTGAAGGCG GAGCGCGAGC GCGGCATCAC 50 

GATCGACATT GCGCTGTGGA AGTTCGAGTC GCCCAAGTCC GTGTTCACGA 100 

TCATCGATGC GCCCGGCCAC CGCGACTTCA TCAAGAACAT GATCACGGGC 150 

ACGTCGCAGG CGGACGCCGC CATTCTGATG ATCGACTCGA CGCATGGCGG 2 00 

CTTCGAGGCT GGCATCTCGA AGGACGGCCA GACCCGCGAG CACGCGCTGC 250 

TTGCCTTCAC TCTTGGCGTG AAGCAGATGG TGGTGTGCTG CAACAAGATG 3 00 

GACGACAAGA CGGTGACGTA CGCGCAGTCR CGCTACGATG AGATCAGCAA 350 

GGAGGTGGGC GCGTACCTGA AGCGCGTGGG CTACAACCCG GAGAAGGTGC 400 

GCTTCATCCC GATCTCGGGC TGGCAGGGCG ACAACATGAT CGAGAAGTCG 4 50 

GACAACATGC CGTGGTACAA GGGTCCCACG CTGCTGGACG CGCTCGACAT 500 

GCTGGAGCCG CCGGTGCGCC CGGTGGACAA GCCGCTGCGC CTGCCCCTGC 550 

AGGACGTGTA CAAGATCGGC GGTATCGGGA CGGTGCCCGT GGGCCGCGTG 600 

GAGACCGGCA TCATGAAGCC GGGCGACGTG GTGACGTTCG CGCCCGCCAA 650 

CGTGACGACT GAGGTGAAGT CGATCGAGAT GCACCACGAG CAGCTGGCGG 700 

AGGCGCAGCC CGGCGACAAC GTCGGCTTCA ACGTGAAGAA CGTGTCGGTG 750 

AAGGACATCC GCCGTGGTAA CGTGTGCGGC AACTCGAAGA ACGACCCGCC 800 

GAAGGAGGCG GCCGACTTCA CGGCGCAGGT GATCGTGCTG AACCACCCCG 850 

GCCAGATCAG CAACGGCTAT GCGCCGGTGC TGGACTGCCA CACGAGCCAC 900 

ATTGCGTGCC GCTTCGCGGA AATCGAGTCC AAGATCGACC GCCGCTCCGG 950 

CAAGGAGCTG GAGAAGAACC CCAAGGCGAT CAAGTCTGGC GATGCCGCGA 1000 

TCGTGAAGAT GGTGCCGCAG AAGCCGATGT GCGTGGAGGT GTTCAACGAC 1050 

TACGCGCCGC TGGGCCGCTT TGCCGTGCGC GACATGCGCC AAACCGTTGC 110 0 

CGTCGG 1106 

322 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



10 



2) INFORMATION FOR SEQ ID NO: 523 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1105 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania amazonensis 
15 (B) STRAIN: ATCC 50131 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 523 

TACGCGTGGG TGCTCGACAA GCTGAAGGCG GAGCGCGAGC GCGGCATCAC 50 

20 GATCGACATT GCGCTGTGGA AGTTCGAGTC GCCGAAGTCC GTGTTCACGA 100 

TCATCGATGC GCCCGGCCAC CGCGACTTCA TCAAGAACAT GATCACGGGC 150 

ACGTCGCAGG CGGACGCGGC CATCCTGATG ATCGACTCGA CGCATGGTGG 200 

CTTCGAGGCT GGCATCTCGA AGGACGGCCA GACCCGCGAG CACGCGCTGC 250 

TTGCCTTCAC TCTTGGCGTG AAGCAGATGG TGGTGTGCTG CAACAAGATG 300 

25 GACGACAAGA CGGTGATGTA CGCGCAGTCG CGCTACGATG AGATCAGCAA 350 

GGAGGTGAGC GCGTACCTGA AGCGCGTGGG CTACAACCCG GAGAAGGTGC 400 

GCTTCATCCC GATCTCGGGG TGGCAGGGCG ACAACATGAT CGACAAGTCG 450 

GACAACATGC CGTGGTACAA GGGTCCCACG CTGCTGGACG CGCTCGACAT 500 

GCTGGAGCCG CCGGTGCGCC CGGTGGACAA GCCGCTGCGC CTGCCCCTGC 550 

30 AGGACGTGTA CAAGATCGGC GGTATCGGGA CGGTGCCCGT GGGCCGCGTG 600 

GAGACCGGGA TCATGAAGCC GGGCGACGTG GTGACGTTCG CGCCCGCCAA 650 

CGTGACGACT GAGGTGAAGT CGATCGAGAT GCACCACGAG CAGCTGGCGG 700 

AGGCGCAGCC CGGCGACAAC GTCGGCTTCA ACGTGAAGAA CGTGTCGGTG 750 

AAGGACATCC GCCGTGGTAA CGTGTGCGGC AACTCGAAGA ACGACCCGCC 800 

35 GAAGGAGGCG GCCGACTTCA CGGCGCAGGT GATCGTGCTG AACCACCCCG 850 

GCCAGATCAG CAACGGCTAC GCGCCGGTGC TGGACTGCCA CACGAGCCAC 900 

ATCGCGTGCC GCTTCGCGGA GATCGAGTCC AAGATCGACC GCCGCTCCGG 950 

CAAGGAGCTG GAGAAGAACC CCAAGGCGAT CAAGTCTGGC GACGCCGCGA 1000 

TCGTGAAGAT GGTGCCGCAG AAGCCGATGT GCGTGGAGGT GTTCAACGAC 1050 

40 TACGCGCCGC TGGGCCGCTT TGCCGTGCGC GACATGCGCC AAACCGTTGC 1100 

CGTCG 1105 



45 2) INFORMATION FOR SEQ ID NO: 524 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1098 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania mexicana 

(B) STRAIN: ATCC 50156 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 524 

323 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

TACGCGTGGG 
GATCGACATT 
TCATCGATGC 
ACGTCGCAGG 
5 CTTCGAGGCT 
TTGCCTTCAC 
GACGACAAGA 
GGAGGTGAGC 
GCTTCATCCC 

10 GACAACATGC 
GCTGGAGCCG 
AGGACGTGTA 
GAGACCGGGA 
CGTGACGACT 

15 AGGCGCAGCC 
AAGGACATCC 
GAAGGAGGCG 
GCCAGATCAG 
ATCGCGTGCC 

20 CAAGGAGCTG 
TCGTGAAGAT 
TACGCGCCGC 



TGCTCGACAA 
GCGCTGTGGA 
GCCCGGCCAC 
CGGACGCGGC 
GGCATCTCGA 
TCTTGGCGTG 
CGGTGATGTA 
GCGTACCTGA 
GATCTCGGGG 
CGTGGTACAA 
CCGGTGCGCC 
CAAGATCGGC 
TCATGAAGCC 
GAGGTGAAGT 
CGGCGACAAC 
GCCGTGGTAA 
GCCGACTTCA 
CAACGGCTAC 
GCTTCGCGGA 
GAGAAGAACC 
GGTGCCGCAG 
TGGGCCGCTT 



GCTGAAGGCG 
AGTTCGAGTC 
CGCGACTTCA 
CATCCTGATG 
AGGACGGCCA 
AAGCAGATGG 
CGCGCAGTCG 
AGCGCGTGGG 
TGGCAGGGCG 
GGGTCCCACG 
CGGTGGACAA 
GGTATCGGGA 
GGGCGACGTG 
CGATCGAGAT 
GTCGGCTTCA, 
CGTGTGCGGC 
CGGCGCAGGT 
GCGCCGGTGC 
GATCGAGTCC 
CCAAGGCGAT 
AAGCCGATGT 
TGCCGTGCGC 



GAGCGCGAGC 
GCCGAAGTCC 
TCAAGAACAT 
ATCGACTCGA 
GACCCGCGAG 
TGGTGTGCTG 
CGCTACGATG 
CTACAACCCG 
ACAACATGAT 
CTGCTGGACG 
GCCGCTGCGC 
CGGTGCCCGT 
GTGACGTTCG 
GCACCACGAG 
ACGTGAAGAA 
AACTCGAAGA 
GATCGTGCTG 
TGGACTGCCA 
AAGATCGACC 
CAAGTCTGGC 
GCGTGGAGGT 
GACATGCGCC 



PCT/CAOO/01150 

GCGGCATCAC 50 

GTGTTCACGA 100 ' 

GATCACGGGC 150 

CGCATGGTGG 200 

CACGCGCTGC 250 

CAACAAGATG 300 

AGATCAGCAA 350 

GAGAAGGTGC 400 

CGACAAGTCG 450 

CGCTCGACAT 500 

CTGCCCCTGC 550 

GGGCCGCGTG 600 

CGCCCGCCAA 650 

CAGCTGGCGG 700 

CGTGTCGGTG 750 

ACGACCCGCC 800 

AACCACCCCG 850 

CACGAGCCAC 900 

GCCGCTCCGG 950 

GACGCCGCGA 1000 

GTTCAACGAC 1050 

AAACCGTT 1098 



25 



2 ) INFORMATION FOR SEQ ID NO: 525 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1081 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Doable 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tarentolae 

(B) STRAIN: II WT 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 525 

GCTGAAGGCG GAGCGCGAGC GCGGCATCAC GATCGACATT GCGCTGTGGA 50 

AGTTCGAGTC GCCCAAGTCG GTGTTCACGA TCATCGATGC GCCCGGCCAC 100 

CGCGACTTCA TCAAGAACAT GATCACGGGC ACGTCGCAGG CGGACGCGGC 150 

45 CATCCTGATG ATCGACTCGA CGCACGGTGG GTTCGAGGCT GGCATCTCGA 200 

AGGACGGGCA GACGCGCGAG CACGCGCTGC TTGCCTTCAC TCTTGGCGTG 250 

AAGCAGATGG TTGTGTGCTG CAACAAGATG GACGACAAGA CGGTGATGTA 300 

CGCGCAGTCG CGCTACGATG AGATCAGCAA GGAGGTGGGC GCGTACCTGA 350 

AGCGCGTGGG CTACAACCCG GAGAAGGTGC GCTTCATCCC GATCTCGGGC 4 00 

50 TGGCAGGGCG ACAACATGAT CGAGAAGTCG GACAACATGC CGTGGTACAA 450 

GGGTCCCACG CTGCTGGACG CGCTCGACAT GCTGGAGGCG CCGGTGCGCC 500 

CGGTGGACAA GCCGCTGCGC CTGCCCCTGC AGGACGTGTA CAAGATCGGC 550 

GGCATCGGCA CGGTGCCCGT GGGCCGCGTG GAGACCGGCA TCATGAAGCC 600 

GGGCGACGTG GTGACGTTCG CGCCCGCGAA CGTGACGACG GAGGTGAAGT 650 

55 CGATCGAGAT GCACCACGAG CAGCTGGCGG AGGCGCAGCC CGGCGACAAC 700 

GTCGGCTTCA ACGTGAAGAA CGTGTCGGTG AAGGACATCC GCCGTGGGAA 750 

CGTGTGCGGT AACTCGAAGA ACGACCCGCC GAAGGAGGCC GCCGACTTCA 800 

CGGCGCAGGT GATCGTGCTG AACCACCCCG GCCAGATCAG CAACGGCTAC 850 

GCGCCGGTGC TGGACTGCCA CACGAGCCAC ATCGCGTGCC GGTTCGCGGA 900 

60 CATCGAGTCC AAGATTGACC GCCGCTCCGG CAAGGAGCTG GAGAAGAACC 950 

324 



SUBSTITUTE SHEET (RULE 26) 



(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 1102 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM:, Leishmania tropica 

(B) STRAIN: ATCC 50129 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 526 



lOOOl 
10^0 



WO 01/23604 PCT/CAO0/01 150 

CCAAGGCGAT CAAGTCCGGC GATGCCGCGA TCGTGAAGAT GGTGdCGCAG 

AAGCCGATGT GCGTGGAGAT GTTCAACGAC TACGCGCCGC TTGGCCGCTT 

TGCTGTGCGC GACATGCGCC AAACCGTTGC C 1081 



2) INFORMATION FOR SEQ ID NO: 526 



AAATACGCGT GGGTGCTCGA CAAGCTGAAG GCGGAGCGCG AGCGCGGCAT 50 

CACGATCGAC ATTGCGCTGT GGAAGTTCGA GTCGCCCAAG TCCGTGTTCA 100 

25 CGATCATCGA TGCGCCCGGC CACCGCGACT TCATCAAGAA CATGATCACG 150 

GGCACGTCGC AGGCGGACGC CGCCATCCTG ATGATCGACT CGACGCATGG 2 00 

TGGCTTCGAG GCTGGCATCT CGAAGGACGG CCAGACCCGC GAGCACGCGC 250 

TGCTTGCCTT CACKCTTGGC GTGAAGCAGA TGGTGGTGTG CTGCAACAAG 300 

ATGGACGACA AGACGGTGAC GTACGCGCAG TCGCGCTACG ATGAGATCAG 350 

30 CAAGGAGGTG GGCGCGTACC TGAAGCGCGT GGGCTACAAC CCGGAGAAGG 4 00 

TGCGCTTCAT CCCGATCTCG GGCTGGCAGG GCGACAACAT GATCGAGAAG 450 

TCGGACAACA TGCCGTGGTA CAAGGGTCCC ACGCTGCTGG ACGCGCTCGA 500 

CATGCTGGAG CCGCCGGTGC GCCCGGTGGA CAAGCCGCTG CGCCTGCCCC 550 

TGCAGGACGT GTACAAGATC GGCGGTATCG GGACGGTGCC CGTGGGGCGC 600 

35 GTGGAGACCG GCATCATGAA GCCGGGCGAC GTGGTGACGT TCGCGCCCGC 650 

CAACGTGACG ACTGAGGTGA AGTCGATCGA GATGCACCAC GAGCAGCTGG 700 

CGGAGGCGCA GCCCGGCGAC AACGTCGGCT TCAACGTGAA GAACGTGTCG 750 

GTGAAGGACA TCCGCCGTGG TAACGTGTGC GGCAACTCGA AGAACGACCC 800 

GCCGAAGGAG GCGGCCGACT TCACGGCGCA GGTGATCGTG CTGAACCACC 850 

40 CCGGCCAGAT CAGCAACGGC TACGCGCCGG TGCTGGACTG CCACACGAGC 900 

CACATTGCGT GCCGCTTCGC GGAAATCGAG TCCAAGATCG ACCGCCGCTC 950 

CGGCAAGGAG CTGGAGAAGA ACCCCAAGGC GATCAAGTCT GGCGATGCCG 1000 

CGATCGTGAA GATGGTGCCG CAGAAGCCGA TGTGCGTGGA GGTGTTCAAC 1050 

GACTACGCGC CGCTGGGCCG CTTTGCCGTG CGCGACATGC GCCAAACCGT 1100 

45 TG . 1102 



2) INFORMATION FOR SEQ ID NO: 527 



50 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1105 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Neospora caninum 

325 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(B) STRAIN: Suarez-4 



PCT/CA00/01150 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 527 

5 GGACRAACTT AAAGCTGAAC GTGAGCGTGG TATCACCATT GATATCTCCC 50 

TGTGGAAATT TGAGACCAGC AAGTACTATG TTACCATCAT TGATGCCCCA 100 

GGACACAGAG ACTTCATCAA AAACATGATT ACAGGCACAT CCCAGGCTGA 150 

CTGTGCTGTC CTGATTGTTG CTGCTGGTGT TGGTGAATTT GAAGCCGGTA 200 

TCTCCAAGAA CGGGCAGACC CGTGAGCATG CCCTTNTGGC TTACACCCTG 250 

10 GGTGTGAAAC AACTAATTGT TGGCGTTAAC AAAAKGGATT CCACTGAGCC 300 

ACCCTATAGC CARAAGAGAT ACGARGAAAT TGTTAAGGAA GTCAGCMCCT 350 

AYNTTAAAAA AATTGGYTAC AACCCCGACA CAGTANCATT TGKGCCAATT 4 00 

TNTGGCTGGA ATGGTGACAA CATGCTGGAN CCAAGTGCTA ATATGCCATG 450 

GTTCAAGGGA TGGAAAGTCM CCCGTAAGGA CGGCAATGCC AGKGGAACCM 500 

15 CCCTGCTTGA AGCTYTGGAT TGCATTYTGC, CACCAAYTTG CCCAACTGAC 550 

AAACCCTTGC GTTTGCCTYT CCAGGATGTC TATAAAATTG GKGGTATTGG 600 

TACTGTCCCT GTGGGTCGTG TGGAGACTGG TGTTCTCAAA CCTGGCATGG 650 

TGGTCACCTT TGCTCCAGTC AATGTAACAA CTGAAGTGAA GTCTGTAGAA 70 0 

ATGCACCATG AAGCATTGAG TGAAGCCCTT CCTGGGGACA ATGTGGGCTT 750 

20 CAATGTCAAG AACGTGTCTG TCAAAGATGT CCGTCGTGGC AATGTGGCTG 800 

GTGACAGCAA AAATGATCCA CCCATGGAAG CTGCTGGCTT CACAGCTCAG 850 

GTGATTATTT TGAACCATCC AGGCCAAATC AGTGCTGGAT ATGCACCTGT 900 

GCTGGATTGT CACACAGCTC ACATTGCTTG CAAGTTTGCT GAGCTGAAGG 950 

AGAAGATTGA TCGTCGTTCT GGGAAAAAGC TGGAAGATGG CCCTAAATTC 1000 

25 TTGAAATCTG GTGACGCTGC CATCGTTGAT ATGGTTCCTG GCAAGCCCAT 1050 

GTGTGTCGAG AGCTTCTCTG ATTATCCTCC CCTGGGCCGT TTTGCTGTGC 1100 

GTGAC 1105 



30 



35 



2) INFORMATION FOR SEQ ID NO: 528 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 935 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichomonas vaginalis 

(B) STRAIN: ATCC 30001 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 528 



GCACATCCCA 
GGYTTCGAAG 
TCTTGCCTTC 

50 TGGATGACAA 
GCCGAAATGA 
CCGCTTCGTC 
CTCCAAACAT 
TCCCTTCAGC 

55 CCAGGATGTC 
TCGAATCCGG 
ACAGTTACAG 
AGAGGCTCTT 
CAGCTGATGT 

60 CCAGTCGAAT 



GGCTGATGCT 
CCGGTATCGC 
ACACTCGGCA 
GACAGTCAAC 
CACGCATCCT 
CCAATCTCCG 
GCCATGGTAC 
CACCAAAGCG 
TACAAGATCA 
CACAATGAAG 
CTGAAGTTAA 
CCAGGTGACA 
CAAGCGTGGC 
GCGCTTCCTT 



GCTATCCTTG 
TGAACAAGGC 
TCAAGCAGGT 
TACAACAAGG 
TACAGGCATC 
GCTGGGCTGG 
AATGGCCCAT 
CCCATTCGAC 
ACGGTATCGG 
CCAGGCATGA 
GTCCATCGAA 
ACATCGGCTT 
TACGTCGTTG 
CACAGCTCAR 
32 



TCATCGACTC 
CAGACACGTG 
CATTGTCGCC 
CYCGTTTCGA 
GGCTACAAGC 
CGACAACATG 
ACCTTCTTGA 
AAGCCACTCC 
TACAGTTCCA 
TCGTTAACTT 
ATGCACCACG 
CAACGTCAAG 
GTGATACAAA 
ATGATCATCT 
6 



CACACTCGGT 50 

AACACGCTCT 100 

GTCAACAAGA 150 

CGAAATCACA 200 

CAGAAATGTT 250 

ACAGAGAAGT 300 

AGCCCTCGAT 350 

GTCTTCCACT 400 

GTCGGCCGTG 450 

CGCCCCATCC 500 

AATCCCTTCC 550 

AACGTTTCCA 600 

GCGTGACCCA 650 

CCAACCACCC 700 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 
AGGCAAGATC CACGCCGGCT 
ACATCGCCTG CAAGTTCGAC 
GGCAAGAAGG CTACAGAGAA 
TATCGTCGAG GTTGTCCCAT 
5 AGTACCCACC ACTCGGCCGT 



ACCAGCCAGT TTTCGACTGC 
AAGCTCATCC AGCGTATCGA 
CCCAGAATAC ATTCAGAAGG 
CCAAGCCACT CGTCGTCGAG 
TTCGCCATCC GTGAT 



PCT/CAOO/01150 

CACACftGCTC 750 J 

TCGTCGCCAC 80 (/ 

ATGATGCCGC 850 
TCCTTCCAGG 900 
935 



2) INFORMATION FOR SEQ ID NO: 529 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1065 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Trypanosoma brucei subsp. Jbrucei 

(B) STRAIN: EATR0795 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 529 

25 AAGCTGAAGG CTGAGCGCGA ACGTGGTATC ACGATCGACA TTGCACTGTG 50 

GAAATTCGAG TCACCCAAGT CTGTCTTCAC TATTATTGAT GCTCCTGGGC 100 

ACCGTGACTT CATCAAGAAC ATGATCACCG GCACATCGCA AGCCGACGCA 150 

GCCATCCTCA TCATTGCCTC TGCGCAGGGT GAGTTCGAGG CTGGTATCTC 200 

CAAGGATGGA CAGACCCGCG AGCACGCGTT GCTGGCCTTC ACTTTGGGTG 250 

30 TGAAGCAGAT GGTTGTGTGC TGCAACAAGA TGGACGACAA GACTGTGAAC 300 

TACGGACAGG AGCGGTATGA CGAGATTGTG AAGGAGGTGT CTGCTTACAT 350 

CAAGAAGGTT GGGTACAACG TGGAGAAGGT GCGCTTCGTC CCCATCTCCG 400 

GATGGCAGGG CGACAACATG ATTGAGAAAT CCGAGAAGAT GCCATGGTAC 450 

AAGGGTCCAA CGCTCCTGGA GGCACTAGAC ATGCTGGAGC CACCAGTGCG 500 

35 TCCGAGCGAC AAGCCCCTGC GTCTGCCACT GCAGGACGTG TACAAGATCG 550 

GTGGTATTGG CACCGTGCCC GTTGGTCGTG TGGAGACCGG CGTGATGAAG 600 

CCTGGTGATG TGGTGACGTT TGCCCCCGCC AACGTGACGA CCGAGGTGAA 650 

ATCGATCGAG ATGCACCACG AGCAGCTCGC TGAGGCGACC CCCGGTGACA 700 

ACGTCGGCTT TAACGTGAAG AACGTTTCTG TAAAGGACAT CCGCCGTGGC 750 

4 0 AACGTCTGCG GTAACACCAA GAACGACCCC CCAAAGGAGG CCGCCGACTT 800 

CACGGCACAG GTGATCATCC TGAACCACCC CGGACAGATT GGAAACGGTT 850 

ATGCGCCCGT GCTGGACTGC CACACATCGC ACATTGCCTG CAAGTTCGCG 900 

GAGATCGAGT CGAAGATCGA CCGTCGCTCT GGCAAGGAGC TGGAGAAGGC 950 

TCCCAAGTCG ATCAAGTCTG GCGACGCCGC GATCGTGCGC ATGGTGCCGC 1000 

45 AGAAGCCTAT GTGCGTGGAG GTCTTCAACG ACTACGCGCC ACTCGGCCGC 1050 

TTTGCCGTGC GTGAC 1065 



50 2) INFORMATION FOR SEQ ID NO: 530 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1297 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 

327 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01 150 

(A) ORGANISM: Crithiaia rascicuiata 

(B) STRAIN: ATCC 11745 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 530 

5 

TTCGCGGAGG GCGTGCCGCC GGTGCTGACG GCGCTGGACG TGACGGAGGA 50 

CCTCGGCCGC GACGAGCCGC TGACGCTGGA GATTGTGCAG CACTTGGACG 100 

CGAACACCGG CCGCTGCATT GCCATGCAGA CGACGGATCT GCTGAAGCTG 150 

AAGTCGAAGG TTGTGTCGAC GGGCGGCAAC ATCTCCGTGC CGGTTGGCCG 200 

10 CGAGACGCTG GGCCGCATCT TCAACGTGCT CGGCGACGCG ATCGACCAGC 250 

GCGGTGTGGT GGGCGAGAAG ATGCGCATGC CGATCCACGC CGAGGCGCCG 300 

AAGCTGGCGG ACCAGGCCGC GGAGGACGCG ATTCTGACGA CCGGCATCAA 350 

GGTGATCGAC CTGATTCTGC CGTACTGCAA GGGTGGCAAG ATCGGGCTGT 400 

TCGGCGGTGC TGGTGTGGGC AAGACTGTGA TCATCATGGA GCTGATCAAC 450 

15 AACGTGGCCA AGGGCCACGG TGGTTTCTCC, GTGTTCGCCG GCGTTGGCGA 500 

GCGCACCCGC GAGGGCACGG ATCTGTACCT GGAGATGATG CAGTCGAAGG 550 

TCATTGACCT GAAGGGCGAG TCGAAGTGCG TGCTGGTGTA CGGCCAGATG 600 

AACGAGCCCC CGGGTGCGCG TGCGCGTGTT GCGCAGTCTG CGCTGACGAT 650 

GGCGGAGTAC TTCCGTGACG TGGAGGGCCA GAACGTGCTG CTGTTCATCG 700 

20 ACAACATCTT CCGCTTCACC CAGGCCAACT CCGAGGTGTC CGCCCTGCTG 750 

GGCCGCATTC CCGCCGCCGT GGGCTACCAG CCGACGCTTG CCGAGGATCT 800 

TGGTATGCTG CAGGAGCGCA TTACGTCGAC GACGAAGGGC TCGATTACGT 850 

CTGTGCAGGC CGTGTACGTG CCGGCCGATG ATATCACGGA TCCGGCGCCG 900 

GCGACGACCT TCTCGCACCT GGATGCGACG ACGGTGCTGG ACCGCGCGGT 950 

25 TGCCGAGTCT GGCATCTACC CCGCCGTGAA CCCGCTGGAG TGCGCGTCGC 1000 

GTATCATGGA CCCCGATGTG ATCGACGTGG ACCACTACAA CGTTGCGCAG 1050 

GATATCGTGC AGATGCTGAC CAAGTACAAG GAGCTGCAGG ATATCATTGC 1100 

CGTGCTGGGT ATCGACGAGC TGAGCGAGGA GGACAAGCTT GTGGTGGACC 1150 

GCGCTCGCAA GGTGACGCGC TTCCTGTCGC AGCCGTTCCA GGTGGCCGAG 12 00 

30 GTGTTCACCG GCATGACGGG CCACTACGTG CAGCTGGAGG ACACAGTGGA 1250 

GTCGTTCTCT GGCCTGCTGA TGGGCTCGTA CGACCAGATC CCGGAGA 12 97 



35 2) INFORMATION FOR SEQ ID NO: 531 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1298 bases 

(B) TYPE: Nucleic acid 

4 0 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tropica 

(B) STRAIN: ATCC 30816 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 531 



50 



CTTCTCGGAG GGCGTGCCGC CCGTGCTGAC GGCGCTGGAT GTGACGGAGG 50 

ACCTTGGCCG CGATGAGCCG CTGACGCTGG AGATCGTGCA GCACTTGGAC 100 

GCGAACACGG GCCGCTGCAT TGCGATGCAG ACGACGGACC TGCTGAAGCT 150 

GAAGTCGAAG GTCGTGTCGA CCGGCGGCAA CATCTCTGTG CCGGTGGGCC 200 

55 GTGAGACGCT GGGCCGCATC TTCAAYGTTC TGGGCGACGC GATCGACCAG 2 50 

CGCGGCCCCG TGGGCGAGAA GATGCGCATG GCGATCCACG CCGAGGCCCC 3 00 

GAAGCTGGCG GATCAGGCCG CGGAGGACAC GATCCTGACG ACCGGCATCA 350 

AGGTGATCGA CCTGATTCTG CCCTACTGCA AGGGTGGCAA GATCGGCCTG 4 00 

TTCGGCGGTG CCGGTGTGGG CAAGACTGTG ATCATCATGG AGCTGATCAA 450 

60 CAACGTCGCG AAGGGCCACG GCGGTTTCTC CGTGTTTGCC GGCGTTGGCG 500 

328 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

AGCGCACGCG CGAGGGCACG GACCTGTACC TGGAGATGAT GCAGT&3AAG 5£J0 

GTGATTGACC TGAAGGGCGA GTCGAAGTGY GTGCTTGTGT ATGGGCAGAT 600 

GAACGAGCCC CCGGGTGCGC GCGCGCGCGT TGCGCAGTCT GCGCTGACGA 650 

TGGCGGAGTA CTTCCGCGAC GTGGAGGGCC AGAACGTGCT GCTGTTCATC 700 

5 GACAACATCT TCCGCTTCAC GCAGGCGAAC TCCGAGGTGT CTGCGCTGCT 750 

GGGCCGCATT CCGGCCGCCG TGGGCTACCA GCCGACGCTT GCGGAGGATC 800 

TTGGTATGCT GCAGGAGCGC ATCACGTCGA CAACGAAGGG GTCGATCACG 850 

TCCGTGCAGG CCGTGTACGT GCCAGCGGAT GATATCACGG ATCCCGCGCC 900 

CGCGACGACG TTCTCGCACC TGGACGCGAC GACTGTGCTG GACCGCGCGG 950 

10 TGGCGGAGTC GGGCATCTAC CCTGCCGTGA ACCCGCTGGA GTGCGCGTCG 1000 

CGTATCATGG ACCCTGATGT GATCGATGTG GACCACTACA ACGTTGCGCA 1050 

GGATATCGTG CAGATGCTGA CCAAGTACAA GGAGCTGCAG GATATCATTG 1100 

CGGTGCTTGG CATCGACGAG CTGAGCGAGG AAGACAAGGT TGTTGTGGAC 1150 

CGCGCGCGCA AGGTGACCCG GTTCCTGTCG CAGCCGTTCC AGGTTGCGGA 1200 

15 GGTGTTCACG GGCATGACGG GCCACTACGT GCAGCTGGTC GACACGGTGG 1250 

AGTCGTTCTC TGGCCTGCTG ATGGGGTCGT ACGACCAGAT CCCGGAGA 1298 



20 2) INFORMATION FOR SEQ ID NO: 532 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1297 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania aethiopica 

(B) STRAIN: ATCC 50119 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 532 



TTCTCGGAGG GCGTGCCGCC CGTGCTGACG GCGCTGGATG TGACGGAGGA 50 

CCTTGGCCGC GATGAGCCGC TGACGCTGGA GATCGTGCAG CACTTGGACG 100 

CGAACACCGG CCGCTGCATT GCGATGCAGA CGACGGACCT GCTGAAGCTG 150 

AAGTCGAAGG TTGTGTCGAC CGGCGGCAAC ATCTCTGTGC CGGTGGGCCG 200 

40 TGAGACGCTG GGCCGCATCT TCAACGTTCT GGGCGACGCG ATCGACCAGC 250 

GCGGCCCCGT GGGCGAGAAG ATGCGCATGG CGATCCACGC CGAGGCCCCA 300 

AAGCTGGCGG ATCAGGCCGC GGAGGACACG ATCCTGACGA CCGGCATCAA 350 

GGTGATCGAC CTGATTCTGC CCTACTGCAA GGGTGGCAAG ATCGGCCTGT 400 

TCGGCGGTGC CGGTGTGGGC AAGACTGTGA TCATCATGGA GCTGATCAAC 4 50 

4 5 AACGTCGCGA AGGGCCACGG TGGTTTCTCC GTGTTTGCCG GCGTTGGCGA 500 
GCGCACGCGC GAGGGCACGG ACCTGTACCT GGAGATGATG CAGTCGAAGG . 550 

TGATTGACCT GAAGGGCGAG TCGAAGTGCG TGCTTGTGTA CGGGCAGATG 600 

AACGAGCCCC CGGGTGCGCG CGCGCGCGTT GCGCAGTCTG CGCTGACGAT 650 

GGCGGAGTAC TTCCGCGACG TGGAGGGCCA GAACGTGCTG CTGTTCATCG 700 

50 ACAACATCTT CCGCTTCACG CAGGCGAACT CCGAGGTGTC TGCGCTGCTG 750 

GGCCGCATTC CAGCCGCCGT GGGCTACCAG CCGACGCTTG CGGAGGATCT 800 

TGGTATGCTG CAGGAGCGCA TCACGTCGAC AACGAAGGGG TCGATCACGT 850 

CCGTGCAGGC CGTGTACGTG CCAGCGGATG ATATCACGGA TCCCGCGCCC 900 

GCGACGACGT TCTCGCACCT GGACGCGACG ACTGTGCTGG ACCGCGCGGT 950 

55 GGCGGAGTCG GGCATCTACC CTGCCGTGAA CCCGCTGGAG TGCGCGTCGC 1000 

GTATCATGGA CCCCGACGTG ATCGATGTGG ACCACTACAA CGTTGCGCAG 1050 

GATATCGTGC AGATGCTGAC CAAGTACAAG GAGCTGCAGG ATATCATTGC 1100 

GGTGCTTGGC ATCGACGAGC TGAGCGAGGA AGACAAGGTT GTTGTGGACC 1150 

GCGCGCGCAA GGTGACCCGG TTCCTGTCGC AGCCGTTCCA GGTTGCGGAG 1200 

60 GTGTTCACGG GCATGACGGG CCACTACGTG CAGCTGGTCG ACACGGTGGA 1250 

329 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 
GTCGTTCTCT GGCCTGCTGA TGGGGTCGTA CGACCAGATC CCGGAVjA 1297. 



5 2) INFORMATION FOR SEQ ID NO: 533 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1298 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania donovani 

(B) STRAIN: ATCC 50212 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 533 



20 



CTTCTCGGAG GGCGTGCCGC CCGTACTGAC GGCGCTGGAT GTGACGGAGG 50 

ACCTTGGCCG CGATGAGCCG CTGACGCTGG AGATCGTGCA GCACTTGGAT 100 

GCGAACACCG GCCGCTGCAT TGCGATGCAG ACGACGGACC TGCTGAAGCT 150 

GAAGTCGAAG GTTGTGTCGA CCGGCGGCAA CATCTCTGTG CCGGTGGGCC 2 00 

2 5 GTGAGACGCT GGGCCGCATC TTCAACGTTC TGGGCGACGC GATCGACCAG 250 

CGCGGCCCCG TGGGCGAGAA GATGCGCATG GCGATCCACG CCGAGGCCCC 300 

GAAGCTGGCG GACCAGGCCG CGGAGGACAC GATCCTGACG ACCGGCATCA 3 50 

AGGTGATCGA CCTCATTCTG CCCTACTGCA AGGGCGGCAA GATCGGCCTG 4 00 

TTCGGCGGTG CCGGTGTGGG CAAGACTGTG ATCATCATGG AGCTGATCAA 450 

30 CAACGTCGCG AAGGGCCACG GTGGCTTCTC CGTGTTTGCC GGCGTTGGCG 500 

AGCGCACGCG CGAGGGCACG GACCTATACC TGGAGATGAT GCAGTCGAAG 550 

GTGATTGACC TGAAGGGCGA GTCGAAGTGC GTGCTTGTGT ACGGGCAGAT 600 

GAACGAGCCC CCGGGTGCGC GCGCGCGCGT TGCGCAGTCT GCGCTGACGA 650 

TGGCGGAGTA CTTCCGCGAC GTGGAGGGCC AGAACGTGCT GCTGTTCATC 700 

35 GACAACATCT TCCGCTTCAC GCAGGCGAAC TCCGAGGTGT CTGCGCTGCT 750 

GGGCCGCATT CCGGCCGCCG TGGGCTACCA GCCGACGCTT GCCGAGGATC 800 

TTGGTATGCT GCAGGAGCGC ATCACATCGA CGACGAAGGG GTCGATCACG 850 

TCCGTGCAGG CCGTGTACGT GCCGGCGGAT GATATCACGG ATCCCGCGCC 900 

CGCGACGACG TTCTCGCACC TGGACGCGAC GACTGTGCTG GACCGCGCGG 950 

4 0" TGGCGGAGTC GGGCATCTAC CCTGCCGTGA ACCCGCTGGA GTGCGCGTCG 1000 

CGTATCATGG ACCCCGATGT GATCGATGTG GACCACTACA ACGTTGCGCA 1050 

GGATATCGTG CAGATGCTGA CCAAGTACAA GGAGCTGCAG GATATCATTG 1100 

CGGTGCTTGG CATCGACGAG CTGAGCGAGG AGGACAAGGT TGTGGTGGAC 1150 

CGCGCGCGCA AGGTGACCCG GTTCCTGTCG CAGCCGTTCC AGGTTGCGGA 1200 

4 5 GGTGTTCACG GGCATGACGG GCCACTACGT GCAGCTGGCC GACACGGTGG 1250 

AGTCGTTCTC TGGCCTGCTG ATGGGGTCGT ACGACCAGAT CCCGGAGA 1298 



50 2) INFORMATION FOR SEQ ID NO: 534 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1298 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 

330 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CAOO/01150 

(A) ORGANISM: Leishmania infantxm 

(B) STRAIN: MOU 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 534 

CTTCTCGGAG GGCGTGCCGC CCGTACTGAC GGCGCTGGAT GTGACGGAGG 50 

ACCTTGGCCG CGATGAGCCG CTGACGCTGG AGATCGTGCA GCACTTGGAT 100 

GCGAACACCG GCCGCTGCAT TGCGATGCAG ACGACGGACC TGCTGAAGCT 150 

GAAGTCGAAG GTTGTGTCGA CCGGCGGCAA CATCTCTGTG CCGGTGGGCC 200 

GTGAGACGCT GGGCCGCATC TTCAACGTTC TGGGCGACGC GATCGACCAG 2 50 

CGCGGCCCCG TGGGCGAGAA GATGCGCATG GCGATCCACG CCGAGGCCCC 3 00 

AAAGCTGGCG GACCAGGCCG CGGAGGACAC GATCCTGACG ACCGGCATCA 350 

AGGTGATCGA CCTCATTCTG CCCTACTGCA AGGGCGGCAA GATCGGCCTG 400 

TTCGGCGGTG CCGGTGTGGG CAAGACTGTG ATCATCATGG AGCTGATCAA 4 50 

CAACGTCGCG AAGGGCCACG GTGGCTTCTC CGTGTTTGCC GGCGTTGGCG 500 

AGCGCACGCG CGAGGGCACG GACCTATACC TGGAGATGAT GCAGTCGAAG 550 

GTGATTGACC TGAAGGGCGA GTCGAAGTGC GTGCTTGTGT ACGGGCAGAT 600 

GAACGAGCCC CCGGGTGCGC GCGCGCGCGT TGCGCAGTCT GCGCTGACGA 650 

TGGCGGAGTA CTTCCGCGAC GTGGAGGGCC AGAACGTGCT GCTGTTCATC 700 

GACAACATCT TCCGCTTCAC GCAGGCGAAC TCCGAGGTGT CTGCGCTGCT 750 

GGGCCGCATT CCGGCCGCCG TGGGCTACCA GCCGACGCTT GCCGAGGATC 800 

TTGGTATGCT GCAGGAGCGC ATCACATCGA CGACGAAGGG GTCGATCACG 850 

TCCGTGCAGG CCGTGTACGT GCCGGCGGAT GATATCACTG ATCCCGCGCC 900 

CGCGACGACG TTCTCGCACC TGGACGCGAC GACTGTGCTG GACCGCGCGG 950 

TGGCGGAGTC GGGCATCTAC CCTGCCGTGA ACCCGCTGGA GTGCGCGTCG 1000 

CGTATCATGG ACCCCGATGT GATCGATGTG GACCACTACA ACGTTGCGCA 1050 

GGATATCGTG CAGATGCTGA CCAAGTACAA GGAGCTGCAG GATATCATTG 1100 

CGGTGCTTGG CATCGACGAG CTGAGCGAGG AGGACAAGGT TGTGGTGGAC 1150 

CGCGCGCGCA AGGTGACCCG GTTCCTGTCG CAGCCGTTCC AGGTTGCGGA 1200 

GGTGTTCACG GGCATGACGG GCCACTACGT GCAGCTGGCC GACACGGTGG 1250 

AGTCGTTCTC TGGCCTGCTG ATGGGGTCGT ACGACCAGAT CCCGGAGA 1298 



2) INFORMATION FOR SEQ ID NO: 535 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1301 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania gerbilli 

(B) STRAIN: ATCC 50121 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 535 

GCACTTCTCG GAGGGCGTGC CGCCCGTGCT GACGGCGCTG GATGTGACGG 50 

AGGACCTTGG CCGCGATGAG CCGCTGACGC TGGAGATCGT GCAGCACTTG 100 

GACGCGAACA CCGGCCGCTG CATTGCGATG CAGACGACGG ACCTGCTGAA 150 

GCTGAAGTCG AAGGTTGTGT CGACCGGTGG CAACATCTCT GTGCCGGTGG 200 

GCCGTGAGAC GCTGGGCCGC ATCTTCAACG TTCTGGGCGA TGCGATCGAC 250 

CAGCGCGGCC CCGTGGGCGA GAAGATGCGC ATGGCGATCC ACGCCGAGGC 3 00 

CCCGAAGCTG GCGGATCAGG CCGCGGAGGA CACGATCCTG ACGACCGGCA 350 

TCAAGGTGAT CGACCTGATT CTGCCCTACT GCAAGGGTGG CAAGATCGGY 4 00 

CTGTTCGGCG GTGCCGGTGT GGGCAAGACT GTGATCATCA TGGAGCTGAT 4 50 

CAACAACGTC GCGAAGGGCC ACGGTGGTTT CTCCGTGTTT GCCGGCGTTG 500 

331 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

GCGAGCGCAC GCGCGAGGGC 
AAGGTGATTG ACCTGAAGGG 
GATGAACGAG CCCCCGGGTG 
CGATGGCGGA GTACTTCCGC 
ATCGACAACA TCTTCCGCTT 
GCTGGGCCGC ATTCCGGCCG 
ATCTTGGTAT GCTGCAGGAG 
ACGTCCGTGC AGGCCGTGTA 
GCCCGCGACG ACGTTCTCGC 
CGGTGGCGGA GTCGGGCATC 
TCGCGTATCA TGGACCCCGA 
GCAGGATATC GTGCAGATGC 
TTGCGGTGCT TGGCATCGAC 
GACCGCGCGC GCAAGGTGAC 
GGAGGTGTTC ACGGGCATGA 
TGGAGTCGTT CTCTGGCTTG 
A 



ACGGACCTGT ACCTGGAGAT 
CGAGTCGAAG TGCGTGCTTG 
CGCGCGCGCG CGTTGCGCAG 
GACGTGGAGG GCCAGAACGT 
CACGCAGGCG AACTCCGAGG 
CCGTGGGCTA CCAGCCGACG 
CGCATCACGT CGACAACGAA 
CGTGCCAGCG GATGATATCA 
ACCTTGACGC GACGACTGTG 
TACCCTGCCG TGAACCCGCT 
TGTGATCGAT GTGGACCACT 
TGACCAAGTA CAAGGAGCTG 
GAGCTGAGCG AGGAAGACAA 
CCGGTTCCTG TCGCAGCCGT 
CGGGCCACTA, CGTGCAGCTG 
CTGATGGGGT CGTACGACCA 



PCT/CA00/01150 



GATGC^GTCG 


' ' 5~5T) 


TGTACGGGCA 


600/ 


TCTGCGCTGA 


650 


GCTGCTGTTC 


700 


TGTCCGCGCT 


750 


CTTGCGGAGG 


800 


GGGGTCGATC 


850 


CGGATCCCGC 


900 


CTGGACCGCG 


950 


GGAGTGCGCG 


1000 


ACAACGTTGC 


1050 


CAGGACATCA 


1100 


GGTTGTGGTG 


1150 


TCCAGGTTGC 


1200 


GTCGACACGG 


1250 


GATCCCGGAG 


1300 




1301 



2) INFORMATION FOR SEQ ID NO: 536 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1298 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania hertigi 

(B) STRAIN: ATCC 50125 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 536 

CTTCGCGGAG GGCGTGCCGC CGGTGCTGAC GTCGCTGGAT GTGACGGAGA 50 

ACCTCGGCCG CGATGAGCCG CTGACGCTGG AGATTGTGCA GCACTTGGAC 100 

GCGAACACCG GTCGCTGCAT TGCGATGCAG ACGACGGACC TGCTGAAGCT 150 

GAAGTCGAAG GTCGTGTCGA CCGGTGGCAA CATCTCTGTG CCTGTTGGCC 200 

GCGAGACGCT GGGTCGCATC TTCAACGTGC TTGGCGATGC GATTGACCAG 250 

CGCGGCCCTG TGGGTGAGAA GATGCGCATG GCGATCCACG CCGAGGCGCC 300 

GAAGCTGGCG GATCAGGCGG CAGAGGACAC GATCCTGACG ACCGGCATCA 350 

AGGTGATCGA TCTTATTCTG CCGTACTGCA AGGGTGGTAA GATCGGTCTG 400 

TTCGGTGGTG CCGGTGTAGG CAAGACTGTG ATTATTATGG AGCTGATCAA 450 

TAACGTGGCG AAGGGCCACG GTGGGTTTTC CGTGTTTGCT GGCGTGGGCG 500 

AGCGCACGCG CGAGGGCACT GACCTGTACC TGGAGATGAT GCAGTCGAAG 550 

GTGATTGACC TGAAGGGCGA ATCAAAGTGC GTGCTTGTGT ACGGACAGAT 600 

GAACGAGCCC CCGGGTGCGC GTGCGCGCGT TGCGCAGTCT GCGCTGACGA 650 

TGGCCGAGTA CTTCCGCGAT GTGGAGGGCC AGAACGTGCT GCTGTTCATT 700 

GACAACATCT TCCGCTTCAC GCAGGCGAAC TCCGAGGTGT CTGCGCTGCT 750 

GGGTCGCATT CCTGCCGCCG TGGGCTACCA GCCGACGCTT GCGGAGGATC 800 

TGGGCATGCT GCAGGAGCGC ATTACGTCGA CGACGAAGGG CTCGATTACG 850 

TCTGTGCAGG CCGTGTACGT GCCTGCGGAT GATATCACGG ACCCGGCGCC 900 

CGCGACGACG TTCTCGCACC TGGACGCGAC GACTGTGCTG GACCGTGCGG 950 

TGGCAGAGTC GGGCATTTAC CCTGCGGTGA ACCCGCTGGA GTGCGCGTCG 1000 

CGTATCATGG ACCCCGATGT GATCGATGTG GACCACTACA ACGTTGCGCA 1050 

GGATATCGTG CAGATGCTGA CCAAGTACAA GGAGCTGCAG GATATCATTG 1100 

CCGTGCTTGG TATCGACGAG CTGAGCGAGG AGGACAAGGT TGTGGTGGAC 1150 

CGCGCGCGCA AGGTGACCCG GTTCCTGTCG CAGCCGTTCC AGGTTGCGGA 1200 

332 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 





10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



GGTGTTCACT GGCATGACGG GTCACTACGT TCAGCTGGAG GACACtGGTGG 125 



2) INFORMATION FOR SEQ ID NO: 537 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1297 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DN A 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania major 

(B) STRAIN: ATCC 50122 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 537 

CTTCTCGGAG GGCGTGCCGC CCGTGCTGAC GGCGCTGGAT GTGACGGAGG 50 

ACCTTGGCCG TGATGAGCCG CTGACGCTGG AGATCGTGCA GCACTTGGAC 100 

GCGAACACCG GCCGCTGCAT TGCGATGCAG ACGACGGACC TGCTGAAGCT 150 

GAAGTCGAAG GTTGTGTCGA CCGGCGGCAA CATCTCTGTG CCGGTGGGCC 2 00 

GTGAGACGCT GGGCCGCATC TTCAACGTTC TGGGCGATGC GATCGACCAG 250 

CGCGGCCCCG TGGGCGAGAA GATGCGCATG GCGATCCACG CCGAGGCCCC 300 

GAAGCTGGCG GATCAGGCCG CAGAGGACAC GATCCTGACG ACCGGCATCA 350 

AGGTGATCGA CTTGATCCTG CCCTACTGCA AGGGTGGCAA GATCGGCCTG 4 00 

TTCGGCGGTG CCGGTGTGGG CAAGACTGTG ATCATCATGG AGCTGATCAA 4 50 

CAATGTCGCG AAGGGCCACG GTGGTTTCTC CGTGTTTGCC GGCGTTGGCG 500 

AGCGCACGCG CGAGGGCACG GACCTGTACC TGGAGATGAT GCAGTCGAAG 550 

GTGATTGACC TGAAGGGCGA GTCGAAGTGC GTGCTTGTGT ACGGGCAGAT 600 

GAACGAGCCC CCGGGTGCGC GCGCGCGCGT TGCGCAGTCT GCGCTGACGA 650 

TGGCGGAGTA CTTCCGCGAC GTGGAGGGCC AGAACGTGCT GCTGTTCATC 700 

GACAACATCT TCCGCTTCAC GCAGGCGAAC TCCGAGGTGT CCGCGCTGCT 750 

GGGCCGCATT CCGGCCGCCG TGGGCTACCA GCCGACGCTT GCGGAGGATC 800 

TTGGTATGCT GCAGGAGCGC ATCACGTCGA CAACGAAGGG GTCGATCACG 850 

TCCGTGCAGG CCGTGTACGT GCCAGCGGAT GATATCACGG ATCCCGCGCC 900 

CGCGACGACG TTCTCGCACC TGGATGCGAC GACTGTGCTG GACCGCGCGG 950 

TGGCGGAGTC GGGCATCTAC CCTGCCGTGA ACCCGCTGGA GTGCGCGTCG 1000 

CGTATCATGG ACCCCGATGT GATCGATGTG GACCACTACA ACGTTGCGCA 1050 

GGATATCGTG CAGATGCTGA CCAAGTACAA GGAGCTGCAG GACATCATTG 1100 

CGGTGCTTGG CATCGACGAG CTGAGCGAGG AAGACAAGGT TGTGGTGGAC 1150 

CGCGCGCGCA AGGTGACCCG GTTCCTGTCG CAGCCGTTCC AGGTTGCGGA 1200 

GGTGTTCACG GGCATGACGG GCCACTACGT GCAGCTGGTC GACACGGTGG 1250 

AGTCGTTCTC TGGCCTGCTG ATGGGGTCGT ACGACCAGAT CCCGGAG 12 97 



2) INFORMATION FOR SEQ ID NO: 538 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12 97 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania amazonensis 

(B) STRAIN: ATCC 50131 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 538 

TTCTCGGAGG GCGTGCCGCC CGTGCTGACG GCGCTGGATG TGACGGAGGA 50 

CCTTGGCCGC GATGAGCCGC TGACGCTGGA GATCGTGCAG CACCTGGACG 100 

CGAACACCGG CCGCTGCATT GCGATGCAGA CGACGGACCT GTTGAAGCTG 150 

AAGTCGAAGG TTGTGTCGAC CGGCGGCAAC ATCTCTGTGC CGGTGGGCCG 200 

TGAGACGCTG GGCCGCATCT TCAACGTGCT GGGCGACGCG ATCGACCAGC 250 

GCGGCCCCGT GGGTGAGAAG ATGCGCATGG CGATCCACGC CGAGGCCCCG 300 

AAGCTGGCGG ATCAGGCCGC GGAGGACACG ATCCTGACGA CCGGCATCAA 350 

GGTGATCGAC CTGATTCTGC CCTACTGCAA GGGTGGCAAG ATCGGCCTGT 400 

TTGGTGGCGC CGGTGTGGGC AAGACCGTGA TCATCATGGA GTTGATTAAC . 450 

AACGTCGCGA AGGGCCACGG TGGTTTCTCG GTGTTTGCCG GCGTTGGCGA 50 0 

GCGCACGCGC GAGGGCACGG ACCTGTACCT GGAGATGATG CAGTCGAAGG 550 

TGATTGACCT GAAGGGCGAG TCGAAGTGCG TGCTTGTGTA CGGGCAGATG 600 

AACGAGCCCC CGGGTGCGCG CGCGCGCGTT GCGCAGTCTG CGCTGACGAT 650 

GGCGGAGTAC TTCCGAGACG TGGAGGGCCA GAATGTGCTG CTGTTCATCG 700 

ACAACATCTT CCGCTTCACG CAGGCGAACT CCGAGGTGTC TGCGCTGCTG 750 

GGCCGCATTC CGGCCGCCGT GGGCTACCAG CCGACGCTTG CGGAGGATCT 800 

TGGTATGCTG CAGGAGCGCA TCACGTCGAC GACGAAGGGG TCGATCACGT 850 

CCGTGCAGGC CGTGTACGTG CCTGCGGATG ATATCACGGA TCCGGCGCCC 900 

GCGACGACGT TCTCGCACCT GGACGCGACG ACTGTGCTGG ACCGCGCGGT 950 

GGCGGAGTCG GGGATCTACC CTGCCGTGAA CCCGCTGGAG TGCGCGTCGC 1000 

GTATCATGGA CCCCGATGTG ATCGACGTGG ACCACTACAA CGTTGCGCAG 1050 

GATATCGTGC AGATGCTGAC CAAGTACAAG GAGCTGCAGG ATATCATTGC 1100 

GGTGCTTGGT ATCGACGAGC TGAGCGAGGA GGACAAGGTC GTGGTGGACC 1150 

GCGCGCGCAA GGTGACCCGG TTCCTGTCGC AGCCGTTCCA GGTTGCGGAG 12 00 

GTGTTCACGG GCATGACGGG CCACTACGTG CAGCTGGCCG ACACGGTGGA 1250 

GTCGTTCTCT GGGCTGCTGA TGGGGTCGTA CGACCAGATC CCGGAGA 12 97 



2) INFORMATION FOR SEQ ID NO: 53 9 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 539 
CCITACATCC TBGTYGCICT IAACAAG 



2) INFORMATION FOR SEQ ID NO: 540 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA ■ 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54 0 
GGDGC I TC YT CRTCGWA I TC CTG 
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2) INFORMATION FOR SEQ ID NO: 541 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear. 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 541 
GTKGAAATGT TCCGCAAGCT GCT 



2) INFORMATION FOR SEQ ID NO: 542 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 542 
CGGAARTAGA ACTGSGGACG GTAG 



2) INFORMATION FOR SEQ ID NO: 543 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 543 
ATCTTAGTAG TTTCTGCTGC TGA 



2) INFORMATION FOR SEQ ID NO: 544 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 544 
AYGTTGTCGC CMGGCATTMC CAT 



2) INFORMATION FOR SEQ ID NO: 545 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE : ' Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 545 
TACATCCTBG TYGCICTIAA CAAGTG 



2) INFORMATION FOR SEQ ID NO: 546 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 546 
CCRCG I CCGG TRATGGTGAA GAT 



2) INFORMATION FOR SEQ ID NO: 547 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54 7 
GTACAGTTGC TTCAGGACGT ATC 



2) INFORMATION FOR SEQ ID NO: 548 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 548 

ACGTTCGATT TCATCACGTT G 21 



15 



25 



30 



2) INFORMATION FOR SEQ ID NO: 549 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54 9 
GAACGTGATA CTGACAAACC TTTA 24 

2) INFORMATION FOR SEQ ID NO: 550 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

4 0 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 550 
GAAGAAGAAC ACCAACGTTG 20 

45 



2) INFORMATION FOR SEQ ID NO: 551 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

55 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 551 

60 GAAGAAAAAA TCTTCGAACT GGCTA 25 
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2) INFORMATION FOR SEQ ID NO: 552 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 552 
TACACGGCCG GTGACTACG 



2) INFORMATION FOR SEQ ID NO: 553 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 553 
GGCCGTGTTG AACGTGGTCA AATCA 



2) INFORMATION FOR SEQ ID NO: 554 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 554 



GTTCCTTACA TCGTTGTTTT TCTC 



2) INFORMATION FOR SEQ ID NO: 555 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 555 
TCTCGAACTT TCTCTATGTA TGCA 2 4 

2) INFORMATION FOR SEQ ID NO: 556 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 556 

20 

CGGCGCNATC YTSGTTGTTG C 21 



25 2) INFORMATION FOR SEQ ID NO: 557 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 557 

CCMAGGCATR ACCATCTCGG TG 22 



2) INFORMATION FOR SEQ ID NO: 558 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 558 
TCITTYAART AYGCITGGGT 20 

2) INFORMATION FOR SEQ ID NO: 559 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 23 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 559 
CCGACRGCRA YIGTYTGICK CAT 



2) INFORMATION FOR SEQ ID NO: 560 

(i) SEQUENCE CHARACTERISTICS:, 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 560 
GAYTTCATYA ARAAYATGAT YAC 



2) INFORMATION FOR SEQ ID NO: 561 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 561 
AC I GT I CGGC CRCCCTCACG GAT 



2) INFORMATION FOR SEQ ID NO: 562 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 562 
CARATGRAYG ARCCICCIGG IGYIMGIATG 
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2) INFORMATION FOR SEQ ID NO: 563 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 563 
GGYTGRTAIC CIACIGCIGA IGGCAT 26 



2) INFORMATION FOR SEQ ID NO: 564 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 29 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 564 
TAYGGICARA TGAAYGARCC ICCIGGIAA 2 9 

30 

2) INFORMATION FOR SEQ ID NO: 565 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 565 
45 GGYTGRTAIC CIACIGCIGA IGGDAT 26 



2) INFORMATION FOR SEQ ID NO: 566 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 566 

60 
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2) INFORMATION FOR SEQ ID NO: 567 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ, ID NO: 567 
TCRTCIGCIG GIACRTAIAY IGCYTG 



2) INFORMATION FOR SEQ ID NO: 568 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 568 
RTI AT IGGIG CIGTIRTIGA YGT 



2) INFORMATION FOR SEQ ID NO: 569 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 569 
RTIRTIGGIS CIGTIRTIGA TAT 



2) INFORMATION FOR SEQ ID NO: 570 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 570 
5 RTIRYIGGIC CIGTIRTIGA YGT 23 



10 



40 



50 



55 



2) INFORMATION FOR SEQ ID NO: 571 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 571 

20 

RTIRTIGGIC CIGTIRTIGA TGT 23 



25 2) INFORMATION FOR SEQ ID NO: 572 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 572 

RTIRTIGGIS CIGTIRTIGA 20 



2) INFORMATION FOR SEQ ID NO: 573 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 573 
CCICCIACCA TRTARAAIGC 20 

2) INFORMATION FOR SEQ ID NO: 574 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 23 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 574 
ATIGCIATGG AYGGIACIGA RGG 



2) INFORMATION FOR SEQ ID NO: 575 

(i) SEQUENCE CHARACTERISTICS:, 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 575 
TIACCATTTC AGTACCTTCT GGTAA 



2) INFORMATION FOR SEQ ID NO: 576 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 576 
AACTTCRTCA AGAAGGTYGG TTACAA 



2) INFORMATION FOR SEQ ID NO: 577 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 577 
CATGATTGAA CCATCCACCA 
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2) INFORMATION *FOR SEQ ID NO: 578 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 578 
CATGATTGAA GCTTCCACCA 20 



2) INFORMATION FOR SEQ ID NO: 579 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57 9 
GAAGGCCGTG CTGGTGAGAA 20 

30 

2) INFORMATION FOR SEQ ID NO: 580 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 580 
4 5 GCTAAACCAG CTACAATCAC TCCAC 2 5 



2) INFORMATION FOR SEQ ID NO: 581 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 581 

60 
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2) INFORMATION FOR SEQ ID NO: 582 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 582 
TTTCAACTTC GTCGTTGACA CGAACAGT 



2) INFORMATION FOR SEQ ID NO: 583 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 583 
CAACTGCTTT TTGGATATCT TCTTTAATAC CAACG 



2) INFORMATION FOR SEQ ID NO: 584 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 584 
ACATGACACA TCTAAAACAA 



2) INFORMATION FOR SEQ ID NO: 585 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 585 
ACCACATACT GAATTCAAAG 2 0 

2) INFORMATION FOR SEQ ID NO: 586 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 586 

20 

CAGAAGTATA CGTATTATCA 20 



25 2) INFORMATION FOR SEQ ID NO: 587 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 587 

CGTATTATCA AAAGACGAAG 2 0 



2) INFORMATION FOR SEQ ID NO: 588 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 588 
TCTTCTCAAA CTATCGTCCA 20 

2) INFORMATION FOR SEQ ID NO: 589 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 20 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 589 
GCACGAAACT TCTAAAACAA 



2) INFORMATION FOR SEQ ID NO: 590 

(i) SEQUENCE CHARACTERISTICS:, 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 590 
TATACGTATT ATCTAAAGAT 



2) INFORMATION FOR SEQ ID NO: 591 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 591 
TCCTGGTTCT ATTACACCAC 



2) INFORMATION FOR SEQ ID NO: 592 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 592 
CAAAGCTGAA GTATACGTAT 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 593 
TTCACTAACT ATCGCCCACA 20 

2) INFORMATION FOR SEQ ID NO: 594 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 594 
ATTGGTATCC ATGACACTTC 2 0 

30 

2) INFORMATION FOR SEQ ID NO: 595 

3 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 595 

4 5 TTAAAGCAGA CGTATACGTT 20 



2) INFORMATION FOR SEQ ID NO: 596 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
55 (D) TOPOLOGY:. Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 596 

60 
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GAAATTATTG GTATCAAAGA 
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2) INFORMATION FOR SEQ ID NO: 597 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ, ID NO: 597 
ATTGGTATCA AAGAAACTTC 



2) INFORMATION FOR SEQ ID NO: 598 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 598 
AATTACACCT CACACAAAAT 



2) INFORMATION FOR SEQ ID NO: 599 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 599 
CGGTGAAGAA ATCGAAATCA 



2) INFORMATION FOR SEQ ID NO: 600 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 600 
5 ATGCAAGAAG AATCAAGCAA 2 0 



10 



2) INFORMATION FOR SEQ ID NO: 601 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 601 

20 

GTTTCACGTG ATGATGTACA 20 

25 2) INFORMATION FOR SEQ ID NO: 602 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 602 

AAGTTGAAGT TGTTGGTATT 20 



40 



50 



55 



2 ) INFORMATION FOR SEQ ID NO: 603 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 603 
GGTATTAAAG ACGAAACATC 2 0 

2) INFORMATION FOR SEQ ID NO: 604 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 20 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 604 
GGTGATGAAG TAGAAATCGT 



2) INFORMATION FOR SEQ ID NO:605 

(i) SEQUENCE CHARACTERISTICS:, 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 605 
GAAATGTTCC GTAAATTATT 



2) INFORMATION FOR SEQ ID NO:606 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
<B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 606 
ATTAGACTAC GCTGAAGCTG 



2) INFORMATION FOR SEQ ID NO: 607 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecal is 

(B) STRAIN: ATCC 29212 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 607 
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CGGAGCTATC TTAGTAGTTT CTGCTGCTGA 
GTGAACATAT CTTATTATCA CGTAACGTTG 
TTCTTAAACA AAATGGATAT GGTTGATGAC 
AGAAATGGAA GTTCGTGACT TATTATCAGA 
5 ATGTTCCAGT TATCGCAGGT TCTGCTTTGA 
TCTTATGAAG AAAAAATCTT AGAATTAATG 
CCCAACTCCA GAACGTGATA CTGACAAACC 
ACGTATTCTC AATCACTGGA CGTGGTACTG 
CGTGGTGAAG TTCGCGTTGG TGACGAAGTT 

10 CGAAACATCT AAAACAACTG TTACAGGTGT 
TAGACTACGC TGAAGCAGGC GACAACTTCG 
GCACGTGAAG ATATCGAACG TGGACAAGTA 
CACTCCACAC ACAAAATTCA AAGCTGAAGT 
AAGGCGGACG TCACACTCCA TTCTTCACTA 

15 TTCCGTACAA CAGACGTTAC TGGTGTTGTA 
AATGGTAATG CCTGGTGATA ACGTTGCTAT 
CAATCGCTAT CGAAGACGGA A 



PCT/CAOO/01150 

TGGTCCTATG CCTCAAACAC 50 

GTGTACCATA CATCGTTGTA 100 

GAAGAATTAT TAGAATTAGT 150 

ATACGATTTC CCAGGCGATG 200 

AAGCTTTAGA AGGCGACGAG 250 

GCTGCAGTTG ACGAATATAT 300 

ATTCATGATG CCAGTCGAAG 350 

TTGCTACAGG ACGTGTTGAA 4 00 

GAAATCGTTG GTATTAAAGA 450 

TGAAATGTTC CGTAAATTAT 500 

GTGCTTTATT ACGTGGTGTA 550 

TTAGCTAAAC CAGCTACAAT 600 

ATACGTATTA TCAAAAGAAG 650 

ACTACCGTCC TCAATTCTAC 700 

GAATTGCCAG AAGGTACTGA 750 

GGACGTTGAA TTAATTCACC 800 

821 



20 



2) INFORMATION FOR SEQ ID NO: 608 



25 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 751 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: ATCC 19434 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 608 



CGGAGCTATC 
GTGAACACAT 
TTCTTGAACA 

4 0 TGAAATGGAA 
ATGTTCCTGT 
TCATACGAAG 
CCCAACTCCA 
ACGTGTTCTC 

45 CGTGGACAAG 
AGAAACTTCA 
TAGACYACGC 
GCACGTGAAG 
CACACCTCRT 

50 AAGGTGGACG 
TTCCGTACAA 
A 



TTGGTAGTTT 
CCTATTGTCT 
AAGTAGACAT 
GTTCGTGACC 
AGTTGCTGGA 
AAAAAATTCT 
GAACGTGACA 
AATTACTGGA 
TTCGCGTTGG 
AAAACAACAG 
TGAAGCTGGA 
ACATCCAACG 
ACAAAATTCT 
TCATACTCCA 
CTGACGTAAC 



CTGCTGCTGA 
CGTCAAGTTG 
GGTTGATGAC 
TATTAACAGA 
TCAGCTTTGA 
TGAATTAATG 
ACGACAAACC 
CGTGGTACTG 
TGACGAAGTT 
TTACTGGTGT 
GACRACATTG 
TGGACAAGTT 
CTGCAGAAGT 
TTCTTCACTA 
AGGTGTTGTT 



CGGCCCAATG 
GTGTTCCTTA 
GAAGAATTAC 
ATACRAATTC 
AAGCTCTAGA 
GCTGCAGTTG 
ATTCATGATG 
TTGCTACAGG 
GAAGTTGTTG 
TGAAATGTTC 
GTGCTTTACT 
TTAGCTAAAC 
ATACGTGTTG 
ACTACCGTCC 
GAATTACCAG 



CCTCAAACTC 50 

CATCGTTGTA 10 0 

TAGAATTAGT 150 

CCTGGTGRCG 200 

AGGCGACGCT 250 

ACGAATACAT 300 

CCAGTTGAAG 350 

TCGTGTTGAA 400 

GTATTGCTGA 450 

CGTAAATTGT 500 

ACGTGGTGTT 550 

CAGGTACAAT 600 

ACAAAAGAAG 650 

ACAATTCTAC 700 

AAGGAACTGA 750 
751 



55 

2) INFORMATION FOR SEQ ID NO: 609 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 751 bases 
60 (B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: ATCC 4 9573 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 609 

CGGTGCGATC TTAGTAGTAT CTGCTGCTGA CGGTCCTATG CCTCAAACTC 50 

GTGAACACAT CTTGTTATCA CGTAACGTTG GCGWACCATA CATCGTTGTT 100 

TTCTTGAACA AAATGGATAT GGTTGAYGAC GAAGAATTGC TAGAATTAGT 150 

TGAAATGGAA GTTCGTGACC TATTGTCTGA- ATATGACTTC CCAGGCGACG 200 

ATGTTCCTGT AATCGCCGGT TCTGCTTTGA AAGCTCTTGA AGGAGATCCT 250 

TCATACGAAG AAAAAATCAT GGAATTGATG GCTGCAGTTG ACGAATACGT 300 

TCCAACTCCA GAACGTGATA CTGACAAACC ATTCATGATG CCAGTCGAAG 350 

ACGTATTCTC AATCACTGGA CGTGGTACTG TTGCTACAGG CCGTGTTGAA 400 

CGTGGACAAG TTCGCGTTGG TGATGAAGTA GAAATCGTTG GTATTGCTGA 450 

CGAAACTGCT AAAACAACTG TAACAGGTGT TGAAATGTTC CGTAAATTGT 500 

TAGACTATGC TGAAGCAGGG GATAACATTG GTGCATTGCT ACGTGGGGTT 550 

GCTCGTGAAG ACATCCAACG TGGACAAGTA TTGGCTAAAG CTGGTACRAT 600 

CACACCTCAT ACAAAATTCA AAGCTGAAGT TTATGTTTTG ACAAAAGAAG 650 

AAGGTGGRCG TCACACTCCA TTCTTCACTA ACTACCGTCC TCAGTTCTAC 700 

TTCCGTACAA CTGACGTAAC TGGTGTTGTT GAATTACCAG AAGGAACTGA 750 

A 751 



2) INFORMATION FOR SEQ ID NO: 610 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus influenzae Rd 

(B) STRAIN: KW20 

(C) ACCESSION NUMBER: extracted from U32739 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 610 

AATATGATTA CTGGTGCGGC ACAAATGGAT GGTGCTATTT TAGTAGTAGC 50 

AGCAACAGAT GGTCCTATGC CACAAACTCG TGAACACATC TTATTAGGTC 100 

GCCAAGTAGG TGTTCCATAC ATCATCGTAT TCTTAAACAA ATGCGACATG 150 

GTAGATGACG AAGAGTTATT AGAATTAGTC GAAATGGAAG TTCGTGAACT 200 

TCTATCTCAA TATGACTTCC CAGGTGACGA TACACCAATC GTACGTGGTT 250 

CAGCATTACA AGCGTTAAAC GGCGTAGCAG AATGGGAAGA AAAAATCCTT 300 

GAGTTAGCAA ACCACTTAGA TACTTACATC CCAGAACCAG AACGTGCGAT 350 

TGACCAACCG TTCCTTCTTC CAATCGAAGA TGTGTTCTCA ATCTCAGGTC 4 00 

GTGGTACTGT AGTAACAGGT CGTGTAGAAC GAGGTATTAT CCGTACAGGT 4 50 

GATGAAGTAG AAATCGTCGG TATCAAAGAT ACAGCGAAAA CTACTGTAAC 500 

GGGTGTTGAA ATGTTCCGTA AATTACTTGA CGAAGGTCGT GCAGGTGAAA 550 

ACATCGGTGC ATTATTACGT GGTACCAAAC GTGAAGAAAT CGAACGTGGT 600 

CAAGTATTAG CGAAACCAGG TTCAATCACA CCACACACTG ACTTCGAATC 650 
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AGAAGTGTAC GTATTATCAA AAGATGAAGG 
TCAAAGGTTA CCGTCCACAA TTCTATTTCC 
ACAATCGAAT TACCAGAAGG CGTGGAAATG 
CAAGATGACA GTAAGCTTAA TCCACCCAAT 
GTTTCGCAAT CCGTGAAGGT GGCCGTACAG 



TGGTCGTCAT ACTCCA^TCT 
GTACAACAGA CGTGACTGGT 
GTAATGCCAG GCGATAACAT 
TGCGATGGAT CAAGGTTTAC 
TAGGTGCAGG C 



700 

750- 

800 

850 

891 
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45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 611 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus epidermidis 

(B) STRAIN: ATCC 14990 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 611 

CGGCGGTATC TTAGTTGTAT CTGCTGCTGA CGGTCCAATG CCACAAACTC 50 

GTGAACACAT CTTATTATCA CGTAACGTTG GTGTACCAGC ATTAGTTGTA 100 

TTCTTAAACA AAGTTGACAT GGTAGACGAC GAAGAATTAT TAGAATTAGT 150 

TGAAATGGAA GTTCGTGACT TATTAAGCGA ATATGACTTC CCAGGTGACG 200 

ATGTACCTGT AATCGCTGGT TCTGCATTAA AAGCATTAGA AGGCGATGCT 250 

GAATACGAAC AAAAAATCTT AGACTTAATG CAAGCAGTTG ATGATTACAT 3 00 

TCCAACTCCA GAACGTGATT CTGACAAACC ATTCATGATG CCAGTTGAGG 350 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 4 00 

CGTGGTCAAA TCAAAGTTGG TGAAGAAGTT GAAATCATCG GTATGCACGA 4 50 

AACTTCTAAA ACAACTGTTA CTGGTGTAGA AATGTTCCGT AAATTATTAG 500 

ACTACGCTGA AGCTGGTGAC AACATCGGTG CTTTATTACG TGGTGTTGCA 550 

CGTGAAGACG TACAACGTGG TCAAGTATTA GCTGCTCCTG GTTCTATTAC 600 

ACCACACACA AAATTCAAAG CTGAAGTATA CGTATTATCT AAAGATGAAG 650 

GTGGACGTCA CACTCCATTC TTCACTAACT ATCGCCCACA ATTCTATTTC 700 

CGTACTACTG ACGTAACTGG TGTTGTAAAC TTACCAGAAG GTACAGAAAT 750 

GGTTATGCCT GGCGACAACG TTGAAATGAC AGTTGAATTA ATCGCTCCAA 800 

TCGCTATCGA AGACGGAA 818 



2) INFORMATION FOR SEQ ID NO: 612 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Parathyphi A 

(B) STRAIN: ATCC 9150 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 612 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



GGCGCGATCC 
TGAGCACATC 
TCCTGAACAA 
GAAATGGAAG 
CACGCCGATC 
AGTGGGAAGC 
CCGGAACCAG 
CGTATTCTCC 
GCGGTATCAT 
ACTCAGAAGT 
CGAAGGCCGT 
GTGAAGAAAT 
CCGCACACCA 
CGGCCGTCAT 
GTACTACTGA 
GTAATGCCGG 
CGCAATGGAC 



TGGTTGTTGC 
CTGCTGGGTC 
ATGCGACATG 
TTCGTGAACT 
GTTCGTGGTT 
GAAAATCATC 
AGCGTGCGAT 
ATCTCCGGTC 
CAAAGTGGGC 
CTACCTGTAC 
GCCGGTGAGA 
CGAACGTGGT 
AGTTCGAATC 
ACTCCGTTCT 
CGTGACTGGC 
GCGACAACAT 
GACGGTCTGC 



TGCGACTGAC 
GTCAGGTAGG 
GTTGATGACG 
TCTGTCTCAG 
CTGCTCTGAA 
GAACTGGCTG 
TGACAAGCCG 
GTGGTACCGT 
GAAGAAGTTG 
TGGCGTTGAA 
ACGTAGGTGT 
CAGGTACTGG 
TGAAGTGTAC 
TCAAAGGCTA 
ACCATCGAAC 
CAAAATGGTT 
GTTTC 



GGCCCGATGC CGCAbACCCG 
CGTTCCGTAC ATCATCGTGT 
AAGAGCTGCT GGAACTGGTT 
TACGACTTCC CGGGCGACGA 
AGCGCTGGAA GGCGACGCAG 
GCTTCCTGGA TTCTTACATC 
TTCCTGCTGC CGATCGAAGA 
TGTTACCGGT CGTGTAGAAC 
AAATCGTTGG TATCAAAGAG 
ATGTTCCGCA AACTGCTGGA 
TCTGCTGCGT GGTATCAAAC 
CTAAGCCGGG CACCATCAAG 
ATTCTGTCCA AAGATGAAGG 
CCGTCCGCAG TTCTACTTCC 
TGCCGGAAGG CGTAGAGATG 
GTTACCCTGA TCCACCCGAT 



Wo 



150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
825 



2) INFORMATION FOR SEQ ID NO: 613 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 778 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia ficaria 

(B) STRAIN: ATCC 33105 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 613 

GGCGCTATCC TGGTTGTTGC TGCGACTGAC GGCCCAATGC CTCAGACCCG 50 

TGAGCACATC CTGCTGGGTC GYCAGGTTGG CGTTCCTTTC ATCATCGTRT 100 

TCATGAACAA ATGCGACATG GTTGATGATG AAGAGCTGCT GGAACTGGTA 150 

GAAATGGAAG TTCGCGAACT GCTGTCCGCT TACGACTTCC CTGGCGATGA 200 

CCTGCCGGTG ATTCGCGGTT CCGCGCTGAA AGCGCTGGAA GGCGAAGCCG 250 

AGTGGGAAGC TAAAATCATC GAGCTGGCTG AMCMSCTGGA TACTTACATC 3 00 

CCAGAACCAG AGCGCGCTAT CGACAAGCCG TTCCTGCTGC CAATCGAAGA 350 

CGTATTCTCC ATCTCCGGTC GTGGTACCGT GGTTACCGGT CGTGTTGAGC 4 00 

GCGGTATCAT CAAAGTTGGC GAAGAAGTTG AAATCGTTGG TATCAAAGAC 450 

ACCGTCAAGT CTACCTGTAC CGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGCCGT GCCGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAGC 550 

GTGAAGACAT CGAACGTGGT CAGGTTCTGG CTAAACCAGG TTCCATCAAG 600 

CCGCACACCC AGTTCGATTC AGAAGTGTAC ATCCTGAGCA AAGAAGAAGG 650 

TGGTCGTCAC ACKCCATTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 700 

GTACTACTGA CGTGACCGGT ACCATCGAAC TGCCAGAAGG CGTTGAGATG 750 

GTAATGCCTG GCGACAACGT GAACATGA 778 



2) INFORMATION FOR SEQ ID NO: 614 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 653 bases 
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(B) TYPE: Nucleic, acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus malodoratus 

(B) STRAIN: ATCC 43197 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 614 



GTGCGATCTT AGTAGTATCA GCTACTGATG GTCCAATGCC TCAAACTCGT 50 

GAACACATTT TGTTATCACG TCAAGTTGGT GTTAAGCACT TGATCGTTTT 100 

15 CTTGAACAAA GTAGATTTAG TTGATGACGA AGAATTGATC GACTTAGTTG 150 

AAATGGAAGT ACGTGAATTA CTTTCTGAAT ATGGTTTCCC AGGTGATGAT 200 

ATTCCAGTGC TTAAAGGTTC TGCTTTGAAA GCATTAGAAG GCGATCCAGA 250 

ACAAGAACAA GTTATTCTTG ATTTGATGGA TACCGTTGAT GAATATATCC 300 

CAACACCTGA ACGTGACAAT GACAAACCGT TCTTGTTACC AGTTGAGGAT 350 

20 GTTTTCTCGA TCACAGGACG TGGTACTGTA GCTTCTGGTC GTATCGACCG 400 

TGGCGAAGTT AAAGTCGGCG ATGAAATTGA AATCATCGGG ATCAAACCTG 450 

AAGTTCAAAA AGCAATCGTT ACTGGACTTG AAATGTTCCG TAAAACATTG 500 

GATTATGGTG AAGCTGGCGA TAACGTTGGG GTTCTATTAC GTGGGATTAC 550 

ACGTGATGAA ATCGAACGTG GCCAAGTATT AGCTAAACCA GGTTCAATCA 600 

2 5 CACCACATAC TAAGTTCAAA GCCGAAGTAT ATGTGTTGAC GAAAGAAGAA 650 

GGT 653 



30 2) INFORMATION FOR SEQ ID NO: 615 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

3 5 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

4 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus durans 

(B) STRAIN: ATCC 19432 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 615 



CCATTCTAGT TGTATCTGCA ACAGATGGAC CAATGCCACA AACACGTGAA 50 

CATATTTTAT TGTCACGTCA AGTAGGTGTT AAATATTTGA TCGTCTTCTT 100 

GAACAAAATC GACTTAGTAG ATGATGAAGA ATTGATTGAT CTTGTCGAAA 150 

TGGAAGTTCG TGAATTATTA AGCGAATATG GTTTCCCAGG TGACGATACA 200 

50 CCAGTCATCA AAGGTTCAGC ATTAAAAGCT TTACAAGGAG ATCCTGATGC 250 

AGAAGCAGCT ATCATGGAAT TGATGGATAC TGTTGATGAA TATATCCCAA 300 

CACCAGAACG TGATACAGAC AAACCATTAT TGTTACCAGT GGAAGATGTC 350 

TTCTCAATCA CAGGTCGTGG GACTGTTGCT TCAGGTCGTA TCGATCGTGG 400 

TGCAGTTCGT GTAGGTGATG AAATCGAAAT CGTCGGTATC AAACCTGAAA 4 50 

55 CACAAAAAGC TGTTGTAACT GGGGTCGAAA TGTTCCGCAA GACATTAGAC 500 

TATGGTGAAG CAGGAGATAA CGTTGGGGTA TTGTTACGTG GTATCCAACG 550 

TGAAGATATC GAACGTGGAC AAGTAATCGC AAAACCAGGT TCAATCACAC 600 

CACATACAAA ATTCAAAGCA GAAGTGTACG TATTGACAAA AGAAGAAGGT 650 

GGACGTCATA CACCATTCTT CAATAACTAT CGTCCACAAT TCTACTTCCG 700 

60 TACAACTGAC GTAACTGGAA CAATCGTTTT ACCTGGAGGC ACTGAAATGG 750 

357 
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TTATGCCTGG AGATAACGTA ACGATCGACG TTGAATTGAT CCATCXJAGTT 8QO 
GCCATCGAAA ACGGAACAAC TTTCTCTAT 829 



2) INFORMATION FOR SEQ ID NO: 616 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 669 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus pseudoavium 

(B) STRAIN: ATCC 49372 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 616 

GGTGCAATTT TAGTAGTATC TGCTACTGAT GGCCCAATGC CACAAACACG 50 

TGAACATATC TTGTTATCAC GTCAAGTAGG GGTTAAACAC TTAATCGTCT 100 

TCTTGAACAA AGTTGATTTA GTTGATGATG AAGAATTGAT CGATTTAGTT 150 

25 GAAATGGAAG TTCGGGAATT GCTTTCTGAA TATGGTTTCC CAGGCGATGA 200 

TATTCCAGTA CTTAAAGGTT CTGCTTTGAA AGCTTTAGAA GGCGATCCTG 250 

AACAAGAACA AGTAATCCTT GACTTGATGG ATACGGTTGA TGAATACATC 300 

CCAACGCCTG AACGTGATAC TGACAAACCA TTCTTGTTAC CAGTCGAAGA 350 

TGTCTTCTCA ATCACAGGAC GTGGTACGGT TGCATCTGGT CGTATCGATC 4 00 

30 GTGGGGAAGT TAAAGTCGGT GATGAAGTTG AAATCATCGG GATCAAACCT 450 

GAAGTGCAAA AAGCTGTCGT AACTGGACTA GAAATGTTCC GTAAGACATT 500 

GGATTACGGT GAAGCTGGCG ATAACGTTGG GGTTCTATTA CGTGGGATTA 550 

CTCGTGATGA AATCGAACGT GGACAAGTAT TAGCTAAACC AGGTTCAATC 600 

ACTCCACATA CGAAATTCAG TGCAGAAGTT TATGTATTGA CGAAAGAAGA 650 

35 AGGTGGCCGT CATACGCCA 669 



2) INFORMATION FOR SEQ ID NO: 617 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Enterococcus dispar 

(B) STRAIN: ATCC 51266 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 617 

55 CGGGGCAATT TTAGTTGTAT CTGCAACTGA TGGCCCAATG CCACAAACAC 50 

GTGAACACAT TTTGTTAGCT CGTCAAGTAG GGGTTAAATA TTTAATCGTC 100 

TTCTTGAACA AAACAGATTT AGTTGATGAT GAAGAATTAT TGGAACTAGT 150 

TGAAATGGAA GTTCGTGAAT TATTAAATGA ATACAATTTC CCTGGCGATG 200 

ATATTCCTGT TATTCGCGGA TCTGCTTTAA AAGCATTAGA AGGCGATCCA 250 

60 GAACAAGAAG AAGTAATTAT GAACTTGATG GATACTGTGG ATGAATATAT 300 

358 
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CCCAACTCCA GAACGTGACA 
ATGTCTTCAC AATTACTGGT 
CGTGGTAAAG TCAACGTTGG 
AGAAACACAA AAAGCTGTTG 
5 TGGATTATGG TGAAGCTGGT 
ACTCGTGATG AAGTAGAACG 
TACACCGCAT ACCAAATTTA 
AAGGTGGACG TCATACTCCT 
TTCCGTACAA CTGATGTGAC 
10 AATGGTAATG CCTGGTGATA 
CAATCGCCGT TGAAAAAGGG 



ATGATAAACC ATTCTTGTTA 
CGTGGTACTG TTGCTTCAGG 
TGATGAAATT GAAATTATCG 
TAACCGGTTT GGAAATGTTC 
GATAACGTTG GGGTCTTATT 
TGGTCAAGTA TTAGCAAAAC 
AAGGTGAAGT TTATATCTTA 
TTCTTTAATA ACTATCGTCC 
TGGTAACATC GCATTACCTG 
ATGTAACAAT TGAAGTTGAA 
ACTACTTTCT CAATT 



PCT/CA00/01150 

CCAGUgGAAG 35</ 

TCGTATCGAC 400 

GAATTAAACC 4 50 

CGTAAAACTT 500 

ACGTGGGATT 550 

CAGGTTCCAT 600 

ACAAAAGAAG 650 

TCAATTTTAT 700 

AAGGAACTGA 750 

TTGATTCATC 800 
835 



15 2) INFORMATION FOR SEQ ID NO: 618 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 673 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus avium 

(B) STRAIN: ATCC 14025 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 618 



GGTGCAATCC TAGTAGTATC AGCTACTGAT GGTCCAATGC CGCAAACACG 50 

TGAACATATT TTGCTATCAC GGCAAGTGGG TGTTAAACAC TTAATCGTAT 100 

TTTTAAACAA AGTTGATTTA GTCGATGATG AAGAATTGAT CGATCTAGTT 150 

GAAATGGAAG TCCGTGAATT ACTTTCTGAA TATGGTTTCC CAGGTGACGA 200 

35 TATTCCAGTT CTCAAAGGTT CAGCTTTGAA AGCATTAGAA GGCGATCCTG 250 

AACAAGAACA AGTAATCCTT GATTTAATGG ATACAGTTGA CGAATATATC 3 00 

CCAACTCCAG AACGTGACAC TGACAAGCCA TTCTTGTTAC CAGTCGAAGA 3 50 

TGTATTTTCT ATCACTGGTC GTGGGACTGT AGCGTCTGGA CGGATTGATC 4 00 

GTGGTGAAGT TAAAGTCGGC GATGAAGTTG AAATCATCGG GATCAAACCT 4 50 

40 GAAATTCAAA AAGCAGTCGT AACTGGACTT GAAATGTTCC GTAAAACTTT 500 

AGATTATGGT GAAGCTGGCG ATAACGTTGG GGTTCTATTA CGTGGGATTA 550 

CACGTGATGA AATCGAACGT GGTCAAGTCT TAGCTAAACC AGGTTCAATC 600 

ACACCACATA CAAAATTCAG TGCAGAAGTT TACGTATTGA CGAAAGAAGA 650 

AGGTGGACGT CATACACCAT CTT 673 

45 



2) INFORMATION FOR SEQ ID NO: 619 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1713 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



55 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 
60 (C) ACCESSION NUMBER: K00428 

359 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 619 

TTTCATATTT TTAAGGATTT TGTTTTAGCA CCCATCCGAC CTCAGTCAAT 50 

5 ATATCCTTTC GCGACCAGGC TTTCCTCCCT TTTGCTGCTA ACTGGTTACA 100 

GATTTTCCTA TTTTTGGTCA TTTTTATCTT TGAAACTGAT TAAGCTGAAA 150 

AAATTTGAGC TTCTTTGTTG TAAACTATTT TGTGCTTTCA GTTTTATTCT 200 

AGCTCGACAA AGGTAACAGA CAAAAATGTC AGCTTTATTA CCAAGATTAC 250 

TCACAAGAAC AGCTTTTAAA GCTTCTGGGA AACTTCTGAG GCTCTCTTCA 300 

10 GTAATTTCTA GGACCTTTTC TCAAACTACT ACTTCCTATG CAGCTGCTTT 3 50 

TGATCGTTCC AAACCGCATG TAAATATAGG TACGATCGGC CATGTTGATC 4 00 

ATGGGAAGAC AACTTTAACC GCAGCCATTA CGAAAACGTT AGCCGCAAAA 450 

GGTGGTGCCA ACTTCTTGGA CTATGCTGCC ATCGATAAGG CTCCGGAAGA 500 

AAGAGCTCGT GGTATTACAA TTTCTACTGC ACACGTGGAA TACGAAACGG 550 

15 CCAAGAGACA TTATTCTCAC GTCGACTGTC, CAGGCCACGC TGATTACATC 600 

AAGAATATGA TTACCGGTGC TGCTCAAATG GATGGTGCTA TCATTGTTGT 650 

AGCTGCTACC GATGGACAAA TGCCCCAAAC TAGAGAACAT TTACTTTTGG 700 

CCAGACAAGT TGGTGTCCAA CATATTGTCG TTTTTGTTAA CAAGGTTGAT 750 

ACCATTGATG ATCCAGAAAT GTTAGAGTTA GTCGAAATGG AAATGAGAGA 800 

20 ACTTTTAAAC GAATATGGGT TTGACGGTGA TAATGCTCCA ATTATCATGG 850 

GTTCTGCCCT TTGCGCTTTG GAAGGTCGCC AACCTGAAAT TGGGGAGCAG 900 

GCCATCATGA AACTTTTGGA TGCAGTGGAT GAGTATATTC CTACACCTGA 950 

AAGAGATTTG AACAAGCCTT TCTTGATGCC CGTTGAAGAT ATCTTCTCTA 1000 

TCTCCGGTAG AGGTACTGTG GTCACTGGTC GTGTGGAAAG GGGTAATTTA 1050 

25- AAGAAAGGTG AGGAATTGGA AATTGTTGGT CACAACTCCA CCCCATTGAA 1100 

AACAACAGTT ACTGGTATTG AAATGTTTAG. AAAGGAATTG GACTCTGCTA 1150 

TGGCAGGTGA CAATGCCGGT GTTTTACTTA GAGGTATCAG GAGAGATCAA 1200 

TTGAAGAGAG GTATGGTCTT AGCTAAGCCA GGTACCGTTA AAGCCCATAC 1250 

AAAGATTCTA GCCTCTTTGT ACATTTTATC CAAAGAGGAA GGTGGTAGAC 1300 

30 ATTCTGGGTT TGGTGAAAAC TACAGACCAC AAATGTTTAT AAGAACAGCT 1350 

GATGTTACAG TTGTGATGAG ATTTCCTAAG GAGGTTGAAG ATCATTCTAT 14 00 

GCAAGTTATG CCAGGTGACA ATGTTGAAAT GGAATGTGAT TTGATCCATC 14 50 

CTACCCCATT AGAAGTTGGT CAACGTTTCA ATATCAGAGA GGGTGGAAGA 1500 

ACTGTTGGTA CCGGTCTAAT CACACGTATT ATTGAATAGA CTTATTGATG 1550 

35 CAACTGGAGT ATATTTCTAT ATATTCTGTT CATTTCCCCT CTCATAATAT 1600 

ATACTTGTTT CGTTAAAATT TTATACGTGT AAATAAAGTG CCATAAATTT 1650 

TTCAGCTTTA CTTTTGGTAG AGTCCTGCTA GCACTAGATT TTACAATTTC 1700 

ATGTGCACAC ACC 1713 

40 

2) INFORMATION FOR SEQ ID NO: 620 

(i) SEQUENCE CHARACTERISTICS: 
4 5 (A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 620 
ATTGGTGCAT TGCTACGT 18 

55 



2) INFORMATION FOR SEQ ID NO: 621 

60 (i) SEQUENCE CHARACTERISTICS: 
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20 



25 



30 
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(A) 
(B) 
(C) 
(D) 
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LENGTH: 751 bases 
TYPE: Nucleic acid 
STRANDEDNESS: Double 
TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: ATCC 19434 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 621 



TGGTGCAATC 
GTGAACACAT 
TTCTTGAACA 
AGAAATGGAA 
ACACTCCTGT 
GATGCTGAAG 
CCCAACACCA 
ACGTCTTCTC 
CGTGGTGCTG 
TGAAACACAA 
TAGATTACGG 
CAACGTGACG 
TACTCCACAT 
AAGGTGGACG 
TTCCGCACAA 
A 



TTAGTTGTTT 
TTTATTGTCA 
AAGTTGATTT 
GTTCGCGAGT 
GATCAAAGGT 
CTGCTATTAT 
GAACGTGATA 
AATTACAGGT 
TTCGTGTCGG 
AAAGCAGTTG 
GGAAGCTGGG 
ATATCGAACG 
ACAAAATTCA 
TCATACTCCA 
CTGATGTTAC 



CTGCAACTGA 
CGCCAAGTTG 
AGTCGATGAT 
TATTGAGCGA 
TCCGCATTAA 
GGAATTGATG 
CAGATAAACC 
CGAGGAACTG 
TGATGAGGTA 
TAACAGGTGT 
GATAACGTAG 
TGGACAAGTA 
AAGCAGAAGT 
TTCTTCAACA 
AGGAACAATC 



CGGTCCGATG 
GTGTAAAATA 
GAAGAATTGA 
ATATGGTTTC 
AAGCATTGCA 
GATACAGTAG 
ATTACTATTG 
TTGCCTCAGG 
GAGATCGTAG 
AGAAATGTTC 
GCGTGTTGTT 
CTTGCTAAAC 
GTACGTGTTG 
ACTATCGTCC 
ACATTGCCAG 



CCTCAAACAC 50 

CCTGATTGTT 100 

TCGATTTGGT 150 

CCAGGCGATG 200 

AGGCGATCCA 250 

ATGAATATAT 300 

CCAGTGGAAG 350 

TCGTATTGAT 400 

GGATCAAACC 450 

CGTAAAACGT 500 

ACGGGGGATC 550 

CAGGTTCCAT 600 

ACAAAAGAAG 650 

ACAGTTCTAC 700 

AAGATACAGA 750 
751 



35 



40 



45 



50 



55 



60 



2) INFORMATION FOR SEQ ID NO: 622. 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 

(B) STRAIN: ATCC 13264 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 622 



GTCAAATGGG 
CTTTATCAAG 
TCTCTGGTTG 
TGGTACAAGG 
GACTTTGTTG 
ACAAGCCATT 
GGTACTGTGC 
GGTTGTTACT 
AAATGCATCA 
TTCAACGTCA 
TGGTGACGCT 
CCGTCATTGT 



ACGAATCCAG 
AAGGTTGGTT 
GAACGGTGAC 
GTTGGGAAAA 
GAAGCCATTG 
GAGATTGCCA 
CAGTCGGTAG 
TTCGCCCCAG 
CGAACAATTG 
AGAACGTTTC 
AAGAACGATC 
TTTGAACCAT 



ATTCCAAGAA 
ACAACCCAAA 
AACATGATTG 
GGAAACCAAG 
ACGCCATTGA 
TTGCAAGATG 
AGTTGAAACC 
CTGGTGTTAC 
GAACAAGGTG 
CGTTAAGGAA 
CACCAAAGGG 
CCAGGTCAAA 
36 



ATTGTCAAGG 
GACTGTTCCA 
AAGCTACCAC 
GCCGGTGTCG 
ACAACCATCT 
TTTACAAGAT 
GGTGTCATCA 
CACTGAAGTC 
TTCCAGGTGA 
ATCAGAAGAG 
TTGCGCTTCT 
TCTCTGCTGG 
1 



AAACCTCCAA 50 

TTCGTCCCAA 100 

CAACGCTCCA 150 

TCAAGGGTAA 2 00 

AGACCAACTG 250 

TGGTGGTATT 300 

AGCCAGGTAT 350 

AAGTCCGTTG 4 00 

CAACGTTGGT 450 

GTAACGTCTG 500 

TTCAACGCTA 55 0 

TTACTCTCCA 600 
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GTTTTGGATT GTCACACTGC TCACATTGCT 
GGAAAAGAAC GACAGAAGAT CTGGTAAGAA 
TCTTGAAGTC CGGTGACGCT GCTTTGGTCA 
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TGTAGATTCG ACGAATTGTT 6Sj) 
GTTGGAAGAC CATCCAAAGT 700 
AGTTCGTTCC ATCTAAGCCA 750 



2) INFORMATION FOR SEQ ID NO: 623 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1269 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptococcus neoformans 

(B) STRAIN: ATCC 44104 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 623 

TCTTGAAAGC TTAAGGCCGA GCGAGAGCGA GGTATCACCA TCGACATTGC 50 

TCTTTGGAAG TTCGAGACCC CCAGGTACCA GGTCACCGTC ATTGACGCCC 100 

CCGGTCACCG AGACTTCATC AAGAACATGA TCACCGGTAC CTCCCAGGCT 150 

GACTGTGCCA TCCTCATCAT TGCCACCGGT ATCGGTGAGT TCGAGGCCGG 200 

TATCTCCAAG GACGGTCAGA CCCGAGAGCA CGCCCTCCTC GCCTTCACCC 2 50 

TCGGTGTCAG GCAGCTCATT GTTGCTTGCA ACAAGATGGA CACCTGCAAG 300 

TGGTCCGAGG ACCGATTCAA CGAAATCGTC AAGGAGACCA ACGGTTTCAT 350 

CAAGAAGGTT GGTTACAACC CCAAGGCTGT CCCCTTCGTC CCCATCTCTG 4 00 

GTTGGCACGG TGACAACATG TTGGAGGAGA CCACCAAGTC AGTGAATCCG 450 

CTTCTACGTG ATGAGATGTT TTTCTGACTT TCCGTGCAGC ATGCCCTGGT 50 0 

ACAAGGGATG GACCAAGGAG ACCAAGTCTG GTGTTTCCAA GGGTAAGACC 550 

CTCCTCGAGG CCATCGACGC CATCGAGCCC CCTACCCGAC CCACCGACAA 600 

GCCCCTCCGT CTCCCTCTCC AGGACGTCTA CAAGATCGGT GGTATCGGCA 650 

CAGTCCCTGT CGGCCGAGTC GAGACCGGTG TCATCAAGGC CGGTATGTTG 700 

TCTCATCTCT CTTGTCTCGT AACATGCGTC TCGTAACATG CGCTTACTTC 750 

ATTTTCAGGT ATGGTCGTCA AGTTCGCCCC CACCAACGTC ACCACTGAAG 800 

TCAAGTCCGT TGAGATGCAC CACGAGCAGA TCCCCGAGGG TCTTCCCGGA 850 

GACAACGTTG GTTTCAACGT CAAGAACGTT TCCATCAAGG ACATCCGACG 900 

AGGTAACGTC TGTGGTGACT CCAAGAACGA CCCCCCTATG GAGGCTGCTT 950 

CTTTCAACGC CCAGGTTATC GTCCTTAACC ACCCTGGTCA GATCGGTGCC 1000 

GGTTACACCC CCGTTCTCGA CTGTCACACT GCCCGTAAGC CTGACCCAAT 1050 

ACCTCCAACA TACCTTTGAA GCTGACCCTT TCTAGACATT GCCTGCAAGT 1100 

TTGCTGAGTT GATCGAGAAG ATTGACCGAC GAACCGGTAA GGTCATGGAG 1150 

GCCGCCCCCA AGTTCGTCAA GTCTGGTGAC GCCGCCATTG TCAAGCTTGT 1200 

TGCCCAGAAG CCCCTCTGTG TTGAGACCTA CGCCGACTAC CCCCCTCTTG 1250 

GTCGATTCGC CGTCCGAGA 1269 



2) INFORMATION FOR SEQ ID NO: 624 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 753 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida albicans 

(B) STRAIN: ATCC 36801 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 624 

TCTGTCAAAT GGGACAAAAA CAGATTTGAA GAAATCATCA AGGAAACCTC 50 

CAACTTCGTC AAGAAGGTTG GTTACAACCC AAAGACTGTT CCATTCGTTC 100 

10 CAATCTCTGG TTGGAATGGT GACAACATGA TTGAACCATC CACCAACTGT 150 

CCATGGTACA AGGGTTGGGA AAAGGAAACC AAATCCGGTA AAGTTACTGG 200 

TAAGACCTTG TTAGAAGCTA TTGACGCTAT TGAACCACCA ACCAGACCAA 250 

CCGACAAACC ATTGAGATTG CCATTGCAAG ATGTTTACAA GATCGGTGGT 300 

ATTGGTACTG TGCCAGTCGG TAGAGT.TGAA ACTGGTATCA TCAAAGCCGG 350 

15 TATGGTTGTT ACTTTCGCCC CAGCTGGTGT TACCACTGAA GTCAAATCCG 400 

TTGAAATGCA TCACGAACAA TTGGCTGAAG GTGTTCCAGG TGACAATGTT 450 

GGTTTCAACG TTAAGAACGT TTCCGTTAAA GAAATTAGAA GAGGTAACGT 500 

TTGTGGTGAC TCCAAGAACG ATCCACCAAA GGGTTGTGAC TCTTTCAATG 550 

CCCAAGTCAT TGTTTTGAAC CATCCAGGTC AAATCTCTGC TGGTTACTCT 600 

20 CCAGTCTTGG ATTGTCACCC TGCCCACATT GCTTGTAAAT TCGACACTTT 650 

GGTTGAAAAG ATTGACAGAA GAACTGGTAA GAAATTGGAA GAAAATCCAA 700 

AATTCGTCAA ATCCGGTGAT GCTGCTATCG TCAAGATGGT CCCAACCAAA 750 

CCA 753 



25 



2) INFORMATION FOR SEQ ID NO: 625 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 625 
CGTTGAAGAC ACGACCCAAA GTATCC 26 

40 

2) INFORMATION FOR SEQ ID NO: 626 

45 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 626 
55 TACCACCTTT TAAGTAAGGT GCTAAT 26 



60 



2) INFORMATION FOR SEQ ID NO: 627 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 627 
ATTGTCTATA AAAATGGCGA TAAGTC 



2) INFORMATION FOR SEQ ID NO: 628, 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 628 
AAAATGGCGA TAAGTCACAA AAAGTA 



2) INFORMATION FOR SEQ ID NO: 629 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 629 
AAGTTCCATC TCAACAAGGT CAATA 



2) INFORMATION FOR SEQ ID NO: 630 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 630 
CGGAGCTATC CTAGTCGTTT CA 
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10 



20 



2) INFORMATION FOR SEQ ID NO: 631 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 631 
15 CAGACCAACY GAIAARCCAT TRAGAT 26 



2) INFORMATION FOR SEQ ID NO: 632 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 632 

30 

CCCTTTGGTG GRTCSTKCTT GGA 23 



35 2) INFORMATION FOR SEQ ID NO: 633 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 633 

CAGACCAACY GAIAARCCIT TRAGAT 2 6 



50 



60 



2) INFORMATION FOR SEQ ID NO: 634 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

365 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 634 

AACACYGTCA GRRCIATTGC YATGGA 2 6 



2) INFORMATION FOR SEQ ID NO: 635 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 635 
AAACCRGTIA RRGCRACTCT IGCTCT 26 

20 

2) INFORMATION FOR SEQ ID NO: 636 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 636 
3 5 ACTGGYGTTG AIATGTTCCG YAA 23 



40 



2) INFORMATION FOR SEQ ID NO: 637 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
4 5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 637 

50 

ACGTCAGTI G TACGGAARTA GAA 2 3 



55 2 ) INFORMATION FOR SEQ ID NO: 638 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 638 
CCAATGCCAC AAACICGTGA RCACAT 



2) INFORMATION FOR SEQ ID NO: 639 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 bases. 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 639 
TTTACGGAAC ATTTCWACAC CWGTIACA 



2) INFORMATION FOR SEQ ID NO: 64 0 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 640 
TCCATGGTIT WYGGICARAT GAA 



2) INFORMATION FOR SEQ ID NO: 641 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 641 
TGATAACCWA CIGCIGAIGG CATACG 



2 ) INFORMATION FOR SEQ ID NO: 642 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 642 
GGCGTIGGIG ARCGIACICG TGA 



2) INFORMATION FOR SEQ ID NO: 643, 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 643 
ACTGGIGTIG ARATGTTCCG YAA 



2) INFORMATION FOR SEQ ID NO: 644 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 644 
ACGTCIGTIG TICKGAARTA GAA 



2) INFORMATION FOR SEQ ID NO: 645 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64 5 
ACGTCIGTIG TICKGAARTA RAA 
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2) INFORMATION FOR SEQ ID NO: 64 6 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 646 
ATCGACAAGC CITTCYTIAT GSC 



2) INFORMATION FOR SEQ ID NO: 647 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 647 
ACGTCCGTSG TRCGGAAGTA GAACTG 



2) INFORMATION FOR SEQ ID NO: 648 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64 8 
ACGTCSGTSG TRCGGAAGTA GAACTG 



2) INFORMATION FOR SEQ ID NO: 649 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 649 

GTCCTATGCC TCARACWCGI GAGCAC 26 



2) INFORMATION FOR SEQ ID NO: 650 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 650 
TTACGGAACA TYTCAACACC IGT 23 

20 

2) INFORMATION FOR SEQ ID NO: 651 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 651 
35 TGACGACCAC CITCYTCYTT YTTCA 25 



40 



2) INFORMATION FOR SEQ ID NO: 652 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 652 

50 

CCWAYAGTIY KICCICCYTC YCTIATA 27 



55 2) INFORMATION FOR SEQ ID NO: 653 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 653 
GAYTTCATIA ARAAYATGAT 



2) INFORMATION FOR SEQ ID NO: 654 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid , 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 654 
TACAARATYK GIGGTATYGG 



2) INFORMATION FOR SEQ ID NO: 655 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 655 
CCRATACCIC MRATYTTGTA 



2) INFORMATION FOR SEQ ID NO: 656 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 656 
AATTAATGGC TGCAGTTGAY GA 



2) INFORMATION FOR SEQ ID NO: 657 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 657 
TTGTCCACGT TCGATRTCTT CA 



2) INFORMATION FOR SEQ ID NO: 658 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 658 
GATYTAGTCG ATGATGAAGA ATT 



2) INFORMATION FOR SEQ ID NO: 659 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 659 
GCTTTTTGIG TTTCWGGTTT RAT 



2) INFORMATION FOR SEQ ID NO: 660 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 660 
GTAGAATTGA GGACGGTAGT TAG 
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2) INFORMATION FOR SEQ ID NO: 661 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 661 
GTAGAAYTGT GGWCGATART TRT 



2) INFORMATION FOR SEQ ID NO: 662 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterixim diphtheriae 

(B) STRAIN: ATCC 27010 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 662 

CGGCGCAATC CTCGTTGTTG CTGCCACCGA CGGCCCAATG CCTCAGACCC 50 

GTGAGCACGT TCTGCTCGCT CGCCAGGTCG GCGTTCCTTA CATCCTCGTT 100 

GCTCTGAACA AGTGCGACAT GGTTGATGAT GAGGAAATCA TCGAGCTCGT 150 

CGAGATGGAG ATCCRTGAGC TGCTCGCTGA GCAGGATTAC GACGAAGAGG 200 

CTCCAATCAT CCACATCTCC GCACTGAAGG CTCTTGAGGG CGACGAGAAG 250 

TGGACCCAGT CCATCATCGA CCTCATGCAG GCTTGCKATG ATTCCATCCC 300 

AGACCCAGAG CGTGAGACCG ACAAGCCATT CCTCATGCCT ATCGAGGACA 3 50 

TCTTCACCAT CACCGGCCGC GGTACCGTTG TTACCGGCCG TGTTGAGCGT 4 00 

GGCTCCCTGA AGGTCAACGA GGACGTCGAG ATCATCGGTA TCCGCGAGAA 4 50 

KGCTACCACC ACCACCGTTA CCGGTATCGA GATGTTCCGT AAGCTTCTCG 500 

ACTACACCGA GGCTGGCGAC AACTGTGGTC TGCTTCTCCG TGGCGTTAAG 550 

CGCGAAGACG TTGAGCGTGG CCAGGTTGTT GTTAAGCCAG GCGCTTACAC 600 

CCCTCACACC GAGTTCGAGG GCTCTGTCTA CGTTCTGTCC AAGGACGAGG 650 

GTGGCCGCCA CACCCCATTC TTCGACAACT ACCGCCCACA GTTCTACTTC 700 

CGCACCACCG ACGTTACCGG TGTTGTGAAG CTTCCTGAGG GCACCGAGAT 750 

GGTCATGCCT GGCGACAACG TCGACATGTC CGTCACCCTG ATCCAGCCTG 800 

TCGCTATGGA TGAGGGCCTG CGCTTCGCTA TC 832 



2) INFORMATION FOR SEQ ID NO: 663 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1192 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
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(ii) MOLECULE TYPE: Genomic DNA 

■ (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida catenulata 

(B) STRAIN: ATCC 10565 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 663 

AACGGCGAGC AAGACTTGGT GTTGGAGGTT TCTCAGCACT TGGGTGAGAA 50 

CACCGTGCGT ACCATTGCCA TGGACGGTAC CGAGGGTTTG GTGAGAGGTA 100 

CCGCTGTCAC TGACACCGGT GCTCCCATCT CGGTCCCCGT TGGTCAGGGT 150 

ACCTTGGGCC GGATCATCAA CGTTGTCGGT GAGCCCATCG ACGAGCGTGG 2 00 

TCCCATCCAG TGCAAGCAGA GAAACCCCAT TCACGCCGAG CCCCCGTCTT 250 

TCACCGAGCA GTCCGTCGAG GCTGAGGTGT TGGAGACCGG TATCAAGGTT 300 

GTCGACTTGT TGGCTCCCTA CGCCCGTGGT GGTAAGATTG GTCTTTTCGG 350 

TGGTGCCGGT GTCGGTAAGA CCGTGTTCAT CCAGGAGTTG ATTAACAACA 4 00 

TTGCCAAGGC CCACGGTGGT TTCTCCGTGT TCACTGGTGT CGGTGAGCGT 450 

ACTCGTGAGG GTAACGACTT GTACCGTGAA ATGAAGGAGA CCGGTGTCAT 500 

CAACTTGGAG GGCGACTCCA AGGTGGCCTT GGTGTTCGGT CAGATGAACG 550 

AGCCCCCGGG GGCTCGTGCC CGTGTCGCCT TGACCGGTTT GACCATTGCC 600 

GAGTACTTCC GTGACGAGGA AGGCCAGGAC GTGTTGTTGT TCGTTGACAA 650 

CATTTTCAGA TTCACCCAGG CCGGTTCCGA GGTGTCGGCG TTGTTGGGTC 700 

GTATCCCCTC CGCCGTCGGT TACCAGCCCA CTTTGGCCAC CGACATGGGT 750 

TTGTTGCAGG AGAGAATTAC CACCACCAAG AAGGGTTCCG TCACCTCTGT 800 

GCAGGCCGTG TACGTCCCTG CCGATGACTT GACTGACCCT GCCCCCGCCA 850 

CCACTTTCGC TCACTTGGAC GCCACCACCG TGTTGTCGCG TGGTATCTCC 900 

GAGTTGGGTA TCTACCCCGC CGTCGACCCC TTGGACTCCA AGTCGAGATT 950 

GTTGGACGTC GAGGTTGTTG GCCAGGAGCA CTACGACGTC GCCACCGGTG 1000 

TCCAGGAGTG CTTGCAGGCC TACAAGTCGT TGCAGGACAT CATTGCCATT 1050 

TTGGGTATGG ACGAGTTGTC CGAGCAGGAC AAGTTGACCG TCGAGAGAGC 1100 

CCGTAAGATC CAGCGTTTCT TGTCGCAGCC CTTCGCTGTC GCCGAGGTTT 1150 

TCACTGGTAT CCCCGGTAGA TTGGTGAGAT TGCAGGACAC CG 1192 



2) INFORMATION FOR SEQ ID NO: 664 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 664 
AAYATGATI A C I GG I GC I GC I CARATGGA 



2) INFORMATION FOR SEQ ID NO: 665 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1377 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 

(C) ACCESSION NUMBER: extracted from X00779 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 665 

ATGGGTAAAG AGAAGTCTCA CATTAACGTT GTCGTTATCG GTCATGTCGA 50 

TTCTGGTAAG TCTACCACTA CCGGTCATTT GATTTACAAG TGTGGTGGTA 100 

TTGACAAGAG AACCATCGAA AAGTTCGAAA AGGAAGCCGC TGAATTAGGT 150 

AAGGGTTCTT TCAAGTACGC TTGGGTTTTG GACAAGTTAA AGGCTGAAAG 200 

AGAAAGAGGT ATCACTATCG ATATTGCTTT GTGGAAGTTC GAAACTCCAA 250 

AGTACCAAGT TACCGTTATT GATGCTCCAG, GTCACAGAGA TTTCATCAAG 300 

AACATGATTA CTGGTACTTC TCAAGCTGAC TGTGCTATCT TGATTATTGC 350 

TGGTGGTGTC GGTGAATTCG AAGCCGGTAT CTCTAAGGAT GGTCAAACCA 4 00 

GAGAACACGC TTTGTTGGCT TTCACCTTGG GTGTTAGACA ATTGATTGTT 450 

GCTGTCAACA AGATGGACTC CGTCAAATGG GACGAATCCA GATTCCAAGA 500 

AATTGTCAAG GAAACCTCCA ACTTTATCAA GAAGGTTGGT TACAACCCAA 550 

AGACTGTTCC ATTCGTCCCA ATCTCTGGTT GGAACGGTGA CAACATGATT 600 

GAAGCTACCA CCAACGCTCC ATGGTACAAG GGTTGGGAAA AGGAAACCAA 650 

GGCCGGTGTC GTCAAGGGTA AGACTTTGTT GGAAGCCATT GACGCCATTG 700 

AACAACCATC TAGACCAACT GACAAGCCAT TGAGATTGCC ATTGCAAGAT 750 

GTTTACAAGA TTGGTGGTAT TGGTACTGTG CCAGTCGGTA GAGTTGAAAC 800 

CGGTGTCATC AAGCCAGGTA TGGTTGTTAC TTTTGCCCCA GCTGGTGTTA 850 

CCACTGAAGT CAAGTCCGTT GAAATGCATC ACGAACAATT GGAACAAGGT 900 

GTTCCAGGTG ACAACGTTGG TTTCAACGTC AAGAACGTTT CCGTTAAGGA 950 

AATCAGAAGA GGTAACGTCT GTGGTGACGC TAAGAACGAT CCACCAAAGG 1000 

GTTGCGCTTC TTTCAACGCT ACCGTCATTG TTTTGAACCA TCCAGGTCAA 1050 

ATCTCTGCTG GTTACTCTCC AGTTTTGGAT TGTCACACTG CTCACATTGC 1100 

TTGTAGATTC GACGAATTGT TGGAAAAGAA CGACAGAAGA TCTGGTAAGA 1150 

AGTTGGAAGA CCATCCAAAG TTCTTGAAGT CCGGTGACGC TGCTTTGGTC 12 00 

AAGTTCGTTC CATCTAAGCC AATGTGTGTT GAAGCTTTCA GTGAATACCC 1250 

ACCATTAGGT AGATTCGCTG TCAGAGACAT GAGACAAACT GTCGCTGTCG 1300 

GTGTTATCAA GTCTGTTGAC AAGACTGAAA AGGCCGCTAA GGTTACCAAG 1350 

GCTGCTCAAA AGGCTGCTAA GAAATAA 1377 



2) INFORMATION FOR SEQ ID NO: 666 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1536 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 

(C) ACCESSION NUMBER: extracted from M12082 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 666 

ATGGTTTTGC CAAGACTATA TACTGCTACA TCCCGTGCTG CTTTTAAAGC 50 

AGCCAAACAA TCCGCTCCGC TTCTATCCAC TTCGTGGAAA AGATGTATGG 100 

CCTCAGCTGC TCAATCTACT CCAATCACCG GTAAAGTTAC CGCTGTCATT 150 

GGTGCCATTG TTGACGTTCA TTTTGAACAA TCAGAGTTGC CCGCTATTTT 200 
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GAACGCTTTA 
CTCAACATTT 
GAAGGTTTGG 
CGTCCCAGTT 
5 AACCTATTGA 
CACGCAGACC 
GGAAACAGGT 
GTAAGATTGG 
CAAGAATTGA 

10 CGCCGGTGTT 
TGAAGGAAAC 
GTTTTCGGTC 
AACTGGTTTG 
TCTTGTTGTT 

15 GTCTCTGCCC 
TTTGGCCACT 
AGGGTTCTGT 
ACAGATCCGT 
CTTGTCAAGA 

20 TGGATTCTAA 
TATGACGTCG 
ACAAGATATC 
AACTAACTGT 
TTTGCTGTCG 

25 AAAGGACACC 
ATATACCAGA 
CGTAAAGCTG 



GAAATTAAAA 
GGGTGAAAAC 
TCCGTGGTGA 
GGGAGAGAAA 
TGAAAGAGGT 
CTCCTAGTTT 
ATCAAAGTCG 
TCTTTTCGGT 
TTAACAATAT 
GGTGAAAGGA 
TGGAGTCATT 
AAATGAACGA 
ACGATCGCTG 
TATCGACAAT 
TTTTGGGTCG 
GATATGGGTC 
CACTTCTGTG 
CTCCGTCCAC 
GGTATTTCAG 
ATCAAGGTTA 
CCTCCAAGGT 
ATTGCTATTT 
CGAAAGGGCA 
CCGAAGTCTT 
GTTGCCTCGT 
ACATGCTTTC 
AAAAGTTAGC 



CACCTCAAGG 
ACTGTCAGAA 
AAAGGTTCTT 
CTTTAGGGAG 
CCAATTAAGT 
TGCAGAACAA 
TCGATCTATT 
GGTGCAGGTG 
CGCCAAGGCC 
CCAGAGAGGG 
AACTTGGAAG 
ACCTCCAGGA 
AATATTTCAG 
ATCTTTAGAT 
TATTCCATCT 
TCTTACAAGA 
CAAGCCGTTT 
ATCTTTTGCC 
AATTAGGTAT 
TTGGATGCCG 
TCAAGAAACT 
TGGGTATGGA 
AGAAAGATTC 
TACTGGTATC 
TCAAAGCCGT 
TATATGGTTG 
CCGTGAAGCC 



TAAGTTGGTT 
CCATTGCTAT 
GACACTGGTG 
AATCATCAAC 
CCAAACTAAG 
TCTACTTCGG 
AGCTCCTTAT 
TCGGTAAGAC 
CATGGTGGTT 
TAATGACTTG 
GTGAATCCAA 
GCCAGAGCCA 
AGATGAAGAA 
TTACTCAAGC 
GCCGTCGGTT 
AAGAATTACC 
ATGTTCCAGC 
CATTTGGACG 
TTACCCTGCA 
CCGTTGTCGG 
TTACAGACCT 
TGAATTGTCC 
AAAGATTCTT 
CCAGGTAAAT 
TTTGGAAGGT 
GTGGTATTGA 
AACTAG 
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TTGGAAGTTG 250 

GGATGGTACC 3 00 

GCCCTATCTC 350 

GTTATCGGTG 4 00 

AAAGCCAATT 450 

CTGAAATTTT 500 

GCCAGAGGTG 550 

TGTGTTCATT 600 

TTTCCGTTTT 650 

TACCGTGAAA 700 

GGTCGCCTTA 750 

GAGTCGCTTT 800 

GGTCAAGACG 850 

TGGTTCAGAA 900 

ATCAACCAAC 950 

ACCACAAAGA 1000 

CGATGATTTA 1050 

CATCATCCGT 1100 

GTGGATCCAT 1150 

TCAAGAACAT 12 00 

ATAAATCTTT 12 50 

GAACAAGATA 1300 

ATCTCAACCA 1350 

TAGTGAGATT 1400 

AAATACGATA 1450 

AGATGTTGTT 1500 
1536 



30 



40 



2) INFORMATION FOR SEQ ID NO: 667 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1293 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma cruzi 

(B) STRAIN: Y 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 667 

CGGAAGGCGT GCCGCCGGTG TTAACTGCTT TGGATGTGGT GGAAAAACTT 50 

GGCCGTGACG AGCCCTTGAC ACTTGAAATT GTACAGCACT TGGACGCCCA 100 

CACCGGCCGT TGCATTGCGA TGCAGACGAC AGATCTGCTT AAGCTGAAAT 150 

50 CGAAGGTAGT CTCGAGTGGC GGGAACATTT CTGTGCCTGT GGGTCGGGAG 200 

ACACTTGGCA GAATCTTCAA TGTGCTGGGC GATGCCATTG ACCAGCGCGG 250 

CCATGTTGGA GAGAAGCAAC GCATGCCTAT TCACGCTGTG GCACCAAAGT 300 

TGGCGGATCA GGCGGCAGAG GACACCATTC TCACAACGGG TATCAAGGTG 350 

ATTGATCTGA TTTTGCCCTA CTGCAAGGGT GGGAAGATTG GTCTTTTTGG 4 00 

55 TGGTGCTGGC GTGGGCAAGA CGGTTATTAT CATGGAACTT ATTAACAACG 4 50 

TTGCCAAGGG CCATGGTGGG TTCTCCGTCT TTGCGGGTGT TGGTGAACGC 500 

ACGCGTGAGG GGACGGATCT TTACCTGGAG ATGATGCAAT CCAAAGTTAT 550 

TGACCTGAAG GGTGACTCGA AGTGTGTGTT GGTGTATGGT CAGATGAACG 600 

AGCCTCCGGG TGCCCGTGCT CGTGTGGCGC AGTCTGCCTT GACAATGGCC 650 

60 GAGTACTTCC GTGATGTGGA AGGGCAAGAC GTGTTGCTTT TTATCGACAA 700 
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CATTTTCCGC TTTACGCAGG CAAACTCTGA GGTGTCAGCG CTGCTGGGTC 750 

GTATTCCCGC TGCCGTCGGC TACCAGCCTA CCCTTGCTGA GGATCTTGGG 400 

CAGTTGCAGG AGCGCATTAC GTCCACGACA AAAGGTTCCA TTACCTCTGT 850 

GCAGGCTGTG TACGTGCCAG CCGATGATAT TACCGACCCT GCGCCAGCAA 900 

CGACCTTTTC CCACCTCGAT GCCACGACGG TGCTGGACCG TGCCGTTGCC 950 

GAATCCGGCA TTTACCCCGC TGTCAACCCA CTGGAGTGTG CCTCGCGTAT 1000 

CATGGACCCG GATGTAATCA GCGTTGACCA CTACAACGTG GCGCAGGATG 1050 

TGGTGCAGAT GCTTACCAAG TACAAGGAGC TGCAGGATAT CATTGCGGTG 1100 

CTTGGCATTG ATGAGCTCAG TGAGGAGGAT AAACTTATCG TGGACCGTGC 1150 

GCGTAAGGTG ACAAAGTTTC TCTCCCAGCC TTTCCAGGTG GCGGAGGTGT 12 00 

TTACTGGCAT GACAGGCCAC TACGTGCAGC TGGAGGAGAC CATTGAGTCT 1250 

TTTTCCGGCC TGTTGATGGG CACATATGAT CAGGTGCCGG AGA 12 93 



2) INFORMATION FOR SEQ ID NO: 668 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1191 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium glutamicum 
(C) ACCESSION NUMBER: X77034 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 668 

GTGGCAAAGG CGAAGTTCGA GCGTACCAAG CCCCACGTAA ACATCGGCAC 50 

CATCGGTCAC GTTGACCACG GTAAGACCAC CACCACCGCG GCTATCACCA 100 

AGGTTCTGGC TGACACTTAC CCTGAGCTCA ACGAGGCTTT CGCCTTCGAC 150 

TCCATCGATA AGGCTCCTGA GGAGAAGGAG CGTGGCATCA CGATCAACAT 2 00 

CTCCCACGTT GAGTACCAGA CTGAAAAGCG CCACTACGCA CACGTTGACG 250 

CTCCAGGCCA CGCCGACTAC ATCAAGAACA TGATTACCGG CGCTGCTCAG 300 

ATGGACGGCG CAATCCTCGT TGTTGCTGCT ACCGACGGCC CAATGCCTCA 350 

GACCCGTGAG CACGTTCTTC TTGCTCGCCA GGTTGGCGTT CCTTACATCC 400 

TCGTTGCTCT TAACAAGTGC GACATGGTTG AGGATGAGGA AATCATCGAG 450 

CTCGTCGAGA TGGAAGTTCG TGAACTTCTT GCTGAGCAGG ACTACGACGA 500 

AGAGGCTCCA ATTGTTCACA TCTCCGCTCT GAAGGCTCTT GAGGGCGACG 550 

AGAAGTGGGG CAAGCAGATC CTTGAGCTCA TGCAGGCTTG CGATGACAAC 600 

ATCCCTGACC CAGTTCGTGA GACCGACAAG CCATTCCTCA TGCCTATCGA 650 

GGACATCTTC ACCATCACCG GTCGTGGCAC CGTTGTTACC GGTCGTGTTG 700 

AGCGCGGTAC CCTGAACGTG AACGATGATG TTGACATCAT CGGCATCAAG 750 

GAGAAGTCCA CCTCCACCAC CGTTACCGGT ATCGAGATGT TCCGTAAGCT 800 

TCTTGACTCC GCTGAGGCTG GCGACAACTG TGGTCTGCTT CTCCGTGGTA 850 

TCAAGCGCGA AGATGTTGAG CGTGGCCAGG TTATCGTTAA GCCAGGCGCT 900 

TACACCCCTC ACACCGAGTT CGAGGGCTCT GTCTACGTTC TGTCCAAGGA 950 

TGAAGGTGGC CGCCACACCC CATTCTTCGA CAACTACCGT CCTCAGTTCT 1000 

ACTTCCGCAC CACCGACGTT ACCGGTGTTG TGAAGCTTCC AGAGGGCACC 1050 

GAGATGGTCA TGCCTGGCGA CAACGTCGAC ATGTCCGTCA CCCTGATCCA 1100 

GCCTGTCGCT ATGGACGAGG GCCTGCGTTT CGCTATCCGC GAAGGCTCCC 1150 

GCACCGTTGG CGCTGGTCGT GTCACCAAGA TCATCAAGTA A 1191 



2) INFORMATION FOR SEQ ID NO: 669 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1383 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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10 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(C) ACCESSION NUMBER: extracted from V00267 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 669 



15 ATGGCTACTG 
ATTCCCTCAG 
ATGGTAATGA 
ATCGTACGTA 
GGATGTAAAA 

20 CTCTGGGCCG 
GAGATCGGTG 
CGAAGAGCTG 
TCGACCTGAT 
GGTGCGGGTG 

2 5 CGCGATCGAG 
CTCGTGAGGG 
GACAAAGTAT 
TCTGCGCGTT 
AAGGTCGTGA 

30 GCCGGTACGG 
TTATCAGCCG 
CCTCCACCAA 
GCGGATGACT 
CGCAACCGTG 

35 CCGTTGACCC 
GGTCAGGAAC 
TTATCAGGAA 
CTGAAGAAGA 
CTGTCCCAGC 

40 ATACGTCTCC 
GCGAATACGA 
GAAGAAGCTG 



GAAAGATTGT 
GATGCCGTAC 
GCGTCTGGTG 
CCATCGCAAT 
GACCTCGAAC 
TATCATGAAC 
AAGAAGAGCG 
TCAAACTCTC 
GTGTCCGTTC 
TAGGTAAAAC 
CACTCCGGTT 
TAACGACTTC 
CCCTGGTGTA 
GCTCTGACCG 
CGTTCTGCTG 
AAGTATCCGC 
ACCCTGGCGG 
AACTGGTTCT 
TGACTGACCC 
GTACTGAGCC 
GCTGGACTCC 
ACTACGACAC 
CTGAAAGACA 
CAAACTGGTG 
CGTTCTTCGT 
CTGAAAGACA 
TCACCTGCCG 
TGGAAAAAGC 



CCAGGTAATC 
CGCGCGTGTA 
CTGGAAGTTC 
GGGTTCCTCC 
ACCCGATTGA 
GTACTGGGTG 
TTGGGCGATT 
AGGAACTGCT 
GCTAAGGGCG 
CGTAAACATG 
ACTCTGTGTT 
TACCACGAAA 
TGGCCAGATG 
GTCTGACCAT 
TTCGTTGACA 
ACTGCTGGGC 
AAGAGATGGG 
ATCACCTCCG 
GTCTCCGGCA 
GTCAGATCGC 
ACCAGCCGTC 
CGCGCGTGGC 
TCATCGCCAT 
GTAGCGCGTG 
GGCAGAAGTA 
CCATCCGTGG 
GAGCAGGCGT 
CAAAAAACTT 



GGCGCCGTAG 
CGATGCTCTT 
AGCAGCAGCT 
GACGGTCTGC 
AGTCCCGGTA 
AACCGGTCGA 
CACCGCGCAG 
GGAAACCGGT 
GTAAAGTTGG 
ATGGAGCTCA 
TGCGGGCGTA 
TGACCGACTC 
AACGAGCCGC 
GGCTGAGAAA 
ACATCTATCG 
CGTATGCCTT 
CGTTCTGCAG 
TACAGGCAGT 
ACCACCTTTG 
GTCTCTGGGT 
AGCTGGACCC 
GTTCAGTCCA 
CCTGGGTATG 
CTCGTAAGAT 
TTCACCGGTT 
CTTTAAAGGC 
TCTACATGGT 
TAA 



TTGACGTCGA 
GAGGTGCAAA 
CGGCGGCGGT 
GTCGCGGTCT 
GGTAAAGCGA 
CATGAAAGGC 
CACCTTCCTA 
ATCAAAGTTA 
TCTGTTCGGT 
TTCGTAACAT 
GGTGAACGTA 
CAACGTTATC 
CGGGAAACCG 
TTCCGTGACG 
TTACACCCTG 
CAGCGGTAGG 
GAACGTATCA 
ATACGTACCT 
CGCACCTTGA 
ATCTACCCGG 
GCTGGTGGTT 
TCCTGCAACG 
GATGAACTGT 
CCAGCGCTTC 
CTCCGGGTAA 
ATCATGGAAG 
CGGTTCCATC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1100 
1150 
1200 
1250 
1300 
1350 
1383 



45 



55 



60 



2) INFORMATION FOR SEQ ID NO: 670 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1410 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(B) STRAIN: NCTC 11638 

(C) ACCESSION NUMBER: extracted from AF004014 

378 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CAOO/01150 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 670 

ATGAAAGCGA TGGAAGGTAA AATCATTCAG GTTTTAGGCC CGGTGGTAGA 50 

TGTGGAGTTT GAATCCTATC TGCCGGCGAT TTTTGAAGCA CTAGACATTA 100 

5 ATTTTGAAGT TAATGGCGTT CAAAAATCTT TAGTTTTAGA GGTGGCAGCC 150 

CATTTGGGTG GTAATCGGGT GCGAGCGATT GCTATGGATA TGACAGAAGG 200 

CTTAGTGCGT AACCAAGCCG TCAAAGCTCG TGGCAAAATG ATTGAAGTGC 250 

CTGTGGGCGA AGAAGTGTTA GGGCGTATTT TTAATGTTGT GGGCGAGAGC 300 

ATTGATAATT TAGAGCCGCT TAAGCCGTCC TTAACTTGGC CCATTCACAG 350 

10 AAAAGCCCCT AGTTTTGAGC AGCAAAGCAC TAAAACAGAA ATGTTTGAAA 4 00 

CCGGTATTAA AGTCATTGAC TTGCTCGCGC CTTATTCTAA GGGCGGTAAA 450 

GTAGGCTTGT TTGGTGGGGC TGGCGTAGGC AAAACGGTGA TCATTATGGA 500 

GCTTATCCAC AATGTGGCTT ATAAGCATAA CGGGTATTCG GTGTTTGCAG 550 

GTGTGGGGGA GCGCACCAGA GAAGGGAACG ATCTGTATTT TGAGATGAAA 600 

15 GAAGGGGGCG TTTTAGACAA AGTTGCGTTG, TGCTATGGGC AAATGAATGA 650 

GCCACCAGGT GCAAGGAATC GCATCGCATT CACCGGCTTG ACGATGGCGG 700 

AGTATTTCCG TGATGAAAAG GGCTTAGATG TGTTGATGTT TATTGATAAC 750 

ATCTTTAGAT ACGCTCAAAG CGGTGCGGAA ATGAGCGCGC TATTAGGCCG 800 

TATCCCTTCA GCGGTGGGGT ATCAGCCCAC GCTAGCCGGG GAAATGGGGA 850 

20 AACTTCAAGA GCGTATCGCT TCCACTAAAA ATGGCTCTAT CACTTCGGTT 900 

CAAGCGGTGT ATGTGCCAGC AGACGACTTG ACTGACCCAG CCCCTGCTTC 950 

GGTGTTTGCG CATTTAGATG CGACTACGGT GTTGAATAGA AAGATCGCTG 1000 

AAAAAGGGAT TTATCCGGCG GTGGATCCTT TGGATTCCAC TTCAAGGATT 1050 

TTAAGCCCTC AAATGATTGG CGAGAAGCAC TATGAAATCG CCACCGGTAT 1100 

25 CCAGCAAGTT TTGCAAAAAT ACAAGGATTT GCAAGATATT ATTGCGATTT 1150 

TGGGATTGGA CGAATTGAGC GAAGAGGATA AAAAAACGGT TGAAAGGGCC 1200 

AGAAAAATTG AGAAGTTTTT ATCCCAGCCG TTTTTTGTGG CTGAAGTGTT 1250 

TACAGGAAGT CCCGGTAAGT ATGTGACTCT CCAAGAGACT TTAGAGGGCT 1300 

TTGGAGGGAT TTTAGAGGGC AAATACGATC ACATTCCTGA AAACGCGTTT 1350 

30 TACATGGTGG GCAGCATTCA AGAGGTTTTA GAAAAAGCTA AAAACATGAA 1400 

AAATTCCTAA 1410 



35 2) INFORMATION FOR SEQ ID NO: 671 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1401 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

4 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium acetoJbu tylicum 

(B) STRAIN: DSM 792 

(C) ACCESSION NUMBER: extracted from AF101055 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 671 

ATGCCAGAAC ATGTAGGTAA AATTGTTCAG GTAATAGGAC CTGTTGTGGA 50 

TATTAAATTT GATGCAGAGA ACCTTCCTGA CATCTATAAT TCCATAGAAA 100 

TAGATATGGG AGATAATAAA AAACTCATTG CTGAAGTTGA ACAACATGTA 150 

55 GGAGATGACA TAGTAAGAAC AATAGCAATG GAAGGTACTG ACGGATTAAA 200 

AAGAGGAATG GAAGCAGTTA ACACTGGTAA ACCAATATCT GTACCAGTTG 250 

GAGAAAATGT TTTAGGACGT CTTTTTAATG TTTTAGGTCA GACAATAGAT 300 

GAAGCAGGAG ACATGAATGC TGATAAGTAT TATCCAATTC ATAGACCAGC 350 

TCCAACCTTT GAAGAACAAT CAGTTCAACC AGAAATGTTT GAAACAGGTA 4 00 

60* TTAAGGTTAT AGATTTACTT GCTCCATATC AAAAGGGTGG AAAAATCGGT 450 

379 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

TTGTTCGGTG 
AAATAATATA 
GAGAAAGAAC 
GGAGTTATAA 
5 TGGCGCTAGA 
TTAGAGACAA 
TTTACACAAG 
TGCCGTTGGT 
AGAGAATAAC 

10 TATGTTCCTG 
GCATCTTGAT 
TATATCCTGC 
AGAATTGTTG 
ACTTGAAAGA 

15 ATGAACTTTC 
CAGAGATTCT 
GAAAGGTCAG 
TATTAGATGG 
GGAACAATAG 

20 A 



GTGCCGGTGT 
GCAAAAGAAC 
AAGAGAAGGT 
ATAAAACAGC 
ATGAGAGTTG 
AGGTCAAGAT 
CTGGTTCAGA 
TATCAGCCAA 
ATCAACAAAA 
CTGATGACCT 
GCAACAACAG 
AGTTAGTCCT 
GAGAAGAGCA 
TATCAAGAAC 
AGATGAGGAT 
TATCTCAAGC 
TTTGTACCTG 
TAAGTGTGAT 
AAGATGTAAA 



TGGTAAAACA 
ACGGTGGATT 
AATGACCTTT 
TCTAGTATTT 
CTTTAACAGG 
GTGCTTCTAT 
GGTTTCAGCT 
CACTTGCAAA 
CAGGGTTCAA 
TACAGACCCA 
TTCTTTCAAG 
CTTGAATCAA 
TTATGAAGTT 
TTCAAGATAT 
AGATTGTTAG 
TTTTAGTGTT 
TAAAAGATAC 
GATCTTCCAG 
AGAAAAAGCT 



GTTCTTATTC 
ATCAGTATTC 
ATTATGAAAT 
GGTCAGATGA 
ACTTACAATG 
TTATAGATAA 
TTACTTGGTA 
TGAAATGGGT 
TCACATCCGT 
GCTCCAGCAA 
AGAAATATCA 
CTTCAAGAAT 
GCTAACAAGG 
CATAGCTATA 
TTGGAAGAGC 
GCTGAACAAT 
TATAAGAAGT 
AAGCTGCATT 
AAAAAAATGA 



AGG£ACTTAT 
ACAGGTGTTG 
GAAAGATTCA 
ATGAACCACC 
GCTGAATATT 
TATATTCAGA 
GAATACCTAG 
GCTCTTCAAG 
TCAGGCTGTA 
CAACATTTAC 
AACTTAGGAA 
ACTTGATCCA 
TTAAACATAT 
CTTGGTGTTG 
AAGAAGAGTA 
TTACAGGAAT 
TTTAAAGAAA 
TTTATTTGCA 
TGGAAAGCTA 



PCT/CAOO/01150 
550 



600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1100 
1150 
1200 
1250 
1300 
1350 
1400 
1401 



2) INFORMATION FOR SEQ ID NO: 672 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1509 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Cytophaga lytica 

(B) STRAIN: DSM 2039 

(C) ACCESSION NUMBER: M22 535 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 672 



ATGTCTAAAG TTACAGGTAA AGTTTCCCAA ATTATTGGCC CAGTTATAGA 50 

TGTGGAGTTC CAAGCAGGGG TAGATCTTCC AAAAATTTAT GATTCATTAG 100 

AAATTAAAAA AGCAGATGGA TCAATTTTGG TTTTGGAAGT ACAATCACAC 150 

ATTGGTGAGA ACACAGTAAG AACTATATCT ATGGATTCTT CTGATGGTTT 200 

45 AAGTCGTGGA GCAGAGGTTA ATGCAACAGG AAGCGCTATA CAAATGCCAG 250 

TTGGAGATGA CGTTTACGGA CGTTTATTTA ACGTAATTGG AGACGCTATT 300 

GATGGTCTTG GGAATTTACC AAAATCTGGT AAAGATGGTT TGCCAATACA 350 

CAGAGAGGCA CCAAAATTTG AAGACTTATC TACTTCTACA GAAGTATTAT 4 00 

TTACAGGTAT TAAAGTAATT GACCTTATTG AGCCTTATGC AAAAGGTGGT 450 

50 AAGATTGGTT TATTTGGAGG TGCCGGAGTA GGTAAAACAG TATTAATTCA 500 

GGAATTAATT AACAACATTG CAAAAGGTCA CGGTGGACTT TCTGTATTTG 550 

CTGGTGTAGG TGAGCGTACT CGTGAGGGTA ACGATTTACT ACGTGAGATG 600 

TTAGAGTCTG GTATTATTAA ATACGGAGAT GACTTTATGC ACTCTATGGA 650 

AGAAGGTGGT TGGGATTTAT CTAAAGTTGA TAAATCTGTA ATGAAAGATT 700 

55 CTAAAGCAAC CTTTGTATTT GGACAAATGA ATGAGCCACC AGGAGCACGT 750 

GCACGTGTTG CATTATCTGG TTTAACTATT GCAGAATATT TCCGTGATGG 800 

AGCAGGTGAA GGTCAAGGTA AAGATGTACT TTTCTTTGTG GATAACATTT 850 

TCCGTTTTAC ACAAGCTGGT TCTGAGGTAT CTGCATTACT TGGTCGTATG 900 

CCATCTGCGG TAGGTTACCA ACCAACATTA GCAACAGAAA TGGGTGCTAT 950 

60 GCAAGAGCGT ATTACATCAA CAAAAAGAGG TTCTATTACA TCTGTACAGG 1000 

380 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

CGGTTTACGT ACCTGCGGAT GATTTAACGG 
TTTGCTCACT TAGATGCAAC AACGGTATTG 
AGGTATTTAC CCAGCGGTAG ATCCATTAGA 
CTCCAGAAAT TTTAGGAAAA GATCACTACT 
GAGTTGTTAC AACGTTATAA AGAATTACAA 
TATGGAAGAA TTATCTGAGG AAGATAAAAT 
GTGTACAACG TTTCTTATCT CAGCCTTTCC 
GGTCTTAAAG GTGTTTTAGT AGATATCAAG 
TATGATTATG GATGGTGAAT TAGATCACTT 
TTAAAGGTAC TATTGAAGAA GCTATAGAAG 
GAAGCATAA 



PCT/CA00/01150 

ATCCAGCACC GGCAACTACC 1^50 

TCTCGTAAAA TTGCAGAGTT 1100 

TTCTACTTCT AGAATCTTAG 1150 

CTTGTGCACA ACGTGTAAAA 1200 

GATATTATTG CTATCCTTGG 1250 

GGCAGTTGGT AGAGCAAGAC 1300 

ACGTAGCAGA GCAATTTACA 1350 

GATACTATTA AAGGATTTAA 14 00 

ACCAGAATCT GCATTTAACC 14 50 

CAGGAGAAAA AATGCTTGCT 1500 

1509 



2) INFORMATION FOR SEQ ID NO: 673, 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ehrlichia risticii 

(B) STRAIN: HRC-IL 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 673 

CCTAAAATAT ATGATGCATT ATATGTAAAA CTAGATAATG AAAATTTGTG 50 

TTTAGAAGTT TCACAAATTA TTGGAGATAA TGTTGTTAGA TGTATTGCAA 10 0 

TGGGAGCTAC TTATGGATTA AATCGTGGTT TAGAAGTAGT TTGTTCAGGA 15 0 

AATCCAATTC AGGTTCCTGT AGGTGAACAA GTTTTAGGTA GAATGTTTAA 20 0 

TGTTGTTGGT AAAACAATTG ACAATCTTGA ATCTTTAGAT GATAAAAATA 250 

TAAAAATGAT GCCAATTCAT CGAAATCCAC CATCATTTGA AGAGCAATCC 300 

AATGAAATTG AAATTTTTGA AACAGGCATT AAAGTTATTG ATTTATTAAT 350 

TCCATATGCT AAAGGTGGTA AGATTGGATT ATTTGGTGGA GCAGGGGTTG 4 00 

GGAAAACGGT TCTTGTTCAA GAATTAATTC ACAATATCGC AAAAGGTCAT 450 

GGTGGTCTAT CTGTTTTTGC TGGAGTTGGT GAAAGAACTC GTGAAGGTAA 500 

TGACTTGTAT TATGAAATGA TTGAAGGTGG AGTTATAGAT AAAACAGCCT 550 

TAGTGTTTGG GCAAATGAAT GAACCTCCTG GCGCAAGAAT GCGCGTAGCA 600 

TTAACTGCTT TAACAATGGC TGAATATTTC CGTGATGTTC AAAACCAAGA 650 

TGTTTTGTTA TTTATTGATA ATATCTTTAG ATTTACACAA GCTGGTAGTG 700 

AAGTTTCAGC ATTATTAGGA AGAATGCCAT CTGCTGTTGG TTATCAACCA 750 

ACTTTGGCAT ATGAAATGGG ATTGTTACAA GAAAGAATCA CTTCCACTAA 800 

AAGTGGTTCT ATAACATCT 819 



2) INFORMATION FOR SEQ ID NO: 674 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 840 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

381 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Vibrio cholerae 

(B) STRAIN: ATCC 25870 



PCT/CA00/01150 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 674 



AGAGCGAAGT 
GAACGTCTGG 
CGCTATCGTT 
AAAACACTGG 
CGTATCATGA 
CGCAGAAGAG 
AGTCTAGTGC 
ATCTGTCCGT 
TGTAGGTAAG 
AGCACTCAGG 
GGTAACGACT 
ACAACCAGAA 
CACCAGGCAA 
AAGTTCCGTG 
CCGTTACACC 
CTTCAGCGGT 
CAAGAACGTA 



ACCAAGTGTT 
TTCTGGAAGT 
ATGGGTAGCT 
CGCTCCAATT 
ACGTGCTTGG 
GTGTACTCGA 
AACTGAACTT 
TTGCGAAAGG 
ACCGTTAACA 
TTTGTCAGTA 
TCTACCACGA 
CTGTCGAAAG 
CCGTCTGCGT 
ATGAAGGCCG 
CTAGCGGGAA 
AGGTTACCAA 
TCACGTCAAC 



TACGATGCTC 
TCAACAGCAG 
CGGATGGTTT 
TCAGTACCAG 
TGATGCGATT 
TTCACCGTCC 
TTGGAAACGG 
CGGTAAAATC 
TGATGGAACT 
TTTGCTGGGG 
AATGCAGGAA 
TAGCGATGGT 
GTAGCACTGA 
TGACGTACTG 
CGGAAGTATC 
CCAACACTGG 
CAAAAAAGGT 



TGAATGTTGT GGATTCCAAA 
CTAGGCGGTG GCGTGATTCG 
ACGTCGTGGA ATGACAGTAC 
TAGGTACTAA AACCCTAGGT 
GACGAACGTG GCGACATTGG 
TGCTCCAAGC TACGAAGAAC 
GTGTTAAGGT TATCGACCTG 
GGTCTGTTCG GTGGTGCGGG 
TATCAACAAC ATCGCGCTAC 
TAGGTGAGCG TACTCGTGAG 
GCGGGCGTTG TAAACGTTGA 
TTACGGTCAG ATGAACGAGC 
CTGGTCTGAC TATGGCGGAA 
CTGTTTATCG ACAACATCTA 
TGCTCTGCTT GGCCGTATGC 
CTGAAGAGAT GGGTGTTCTG 
TCTATCACCT 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
84 0 



2) INFORMATION FOR SEQ ID NO: 675 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Vibrio cholerae 

(B) STRAIN: ATCC 25870 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 675 

CGGCGGTATC CTAGTTGTAG CGGCAACTGA CGGTCCAATG CCACAAACTC 50 

GTGAGCACAT CCTGCTGGGT CGCCAAGTAG GTATTCCTTA CATCATCGTA 100 

TTCATGAACA AGTGTGACAT GGTTGACGAT GAAGAGCTTC TAGAGCTGGT 150 

AGAGATGGAA GTTCGTGAGC TGCTGTCTGA GTACGATTTC CCAGGTGATG 200 

ACCTGCCAGT AATCCAAGGT TCAGCACTAG GCGCGCTAAA CGGCGAAGCA 250 

CAGTGGGAAG CGAAGATTGT TGAGCTAGCA GAAGCACTGG ATACTTAYAT 300 

TCCAGAGCCA GAGCGTGCAG TAGACATGGC ATTCCTGATG CCAATCGAAG 350 

ACGTATTCTC AATCCAAGGT CGTGGTACAG TAGTAACTGG CCGTATCGAG 400 

CGCGGCATCC TGAAAGTGGG TGACGAAGTA GCGATCGTTG GTATCAAAGA 450 

GACAGTAAAA ACGACCTGTA CAGGTGTAGA GATGTTCCGT AAGCTGCTTG 500 

ACGAAGGTCG TGCAGGTGAG AACGTAGGTG CACTRCTACG TGGTACTAAG 550 

CGTGAAGAAG TAGAGCGTGG TCAAGTACTG GCGAAGCCAG GTTCAATCAC 600 

ACCACACACT AAGTTCGAAT CAGAAGTATA CGTACTGTCA AAAGATGAAG 650 

GTGGCCGTCA TACTCCATTC TTCAAAGGTT ACCGTCCACA GTTCTACTTC 700 

CGTACAACTG ACGTAACAGG CAGCATTGAG CTACCAGARG GCGTAGAAAT 750 

GGTAATGCCA GGCGACAACG TGAAGATGGT TGTAGACCTG ATTGCACCAA 800 

TCGCGATGGA CGAAGGTCTA CGCTTCGC 828 



382 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



2) INFORMATION FOR SEQ ID NO: 676 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 98 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania enriettii 

(B) STRAIN: ATCC 50120 , 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 676 

CTTCTCGGAT GGCGTGCCGC CGGTGCTGAC GGCGCTGGAC GTGACGGAGG 50 

AACTCGGGCG CGACGAGCCG CTGACGCTAG AGATCGTGCA ACACTTGGAC 100 

GCGCACACCG GCCGCTGCAT TGCGATGCAG ACGACGGACC TACTGAAGCT 150 

GAAGTCGAAG GTCGTGTCGA CCGGCGGGAA CATCTCTGTG CCTGTGGGCC 200 

GCGAGACGCT TGGCCGCATC TTCAACGTAC TGGGCGACGC GATTGACCAG 25 0 

CGCGGCCCCG TGGGCGAGAA GATGCGCATG GCGATCCACG CGGAGGCGCC 300 

GAAGCTGGCG GACCAGGCGG CGGAGGACAC GATCCTGACG ACCGGCATCA 350 

AAGTGATCGA TCTGATCCTG CCGTACTGCA AGGGCGGCAA GATCGGTCTG . 4 00 

TTCGGCGGTG CCGGTGTGGG GAAGACTGTG ATCATCATGG AGCTGATCAA 450 

CAACGTCGCG AAGGGCCACG GTGGCTTCTC CGTGTTCGCC GGCGTTGGGG 500 

AGCGCACCCG CGAGGGGACG GATCTGTACC TGGAGATGAT GCAATCGAAG 550 

GTGATCGACC TGAAAGGTGA GTCGAAGTGT GTGCTTGTGT ACGGGCAGAT 600 

GAACGAGCCC CCGGGTGCGC GCGCGCGCGT TGCGCAGTCT GCGCTGACGA 650 

TGGCCGAGTA CTTCCGCGAC GTGGAGGGCC AGAACGTGCT GCTGTTCATC 700 

GACAACATCT TCCGCTTCAC GCAGGCGAAC TCCGAGGTGT CAGCGCTGCT 750 

GGGCCGCATC CCCGCCGCTG TGGGCTACCA ACCGACGCTT GCGGAGGATC 8 00 

TCGGCATGTT GCAGGAGCGC ATCACGTCGA CGACGAAGGG GTCGATCACG 850 

TCTGTGCAGG CGGTGTACGT GCCTGCGGAT GATATCACGG ATCCCGCGCC 900 

GGCGACGACA TTCTCGCACC TGGACGCGAC GACGGTGCTG GACCGCGCGG 950 

TGGCGGAGTC TGGCATCTAC CCCGCGGTCA ACCCACTGGA GTGCGCGTCG 10 00 

CGTATCATGG ACCCTGATGT GATCGACGTG GATCACTACA ACGTTGCGCA 1050 

GGACATTGTT CAGATGCTGA CCAAGTACAA GGAGCTGCAG GATATCATTG 1100 

CGGTGCTTGG TATCGACGAG CTGAGCGAGG AGGACAAGGT TGTGGTGGAC 1150 

CGTGCGCGCA AGGTGACCCG GTTCCTGTCA CAGCCGTTCC AGGTTGCGGA 12 00 

GGTGTTTACT GGCATGACAG GCCATTACGT ACAGCTGAGC GACACGGTGG 12 50 

AGTCGTTCTC CGGTTTGCTG ATGGGGTCGT ACGACCAGAT TCCGGAGA 1298 



2) INFORMATION FOR SEQ ID NO: 677 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1083 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Babesia microtti 

(B) STRAIN: Persing-1 

383 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 677 



CAAGCTCAAG 
5 GGAAATTTGA 
CATCGTGACT 
TGCTATGCTT 
CCAAAGAAGG 
GTCAAACAGA 

10 GGAGGATCGT 
AGGTGGGTTA 
CACGGTGACA 
TAAGACCCTC 
TCGAAAAACC 

15 ATTGGTACGG 
AATGATCATT 
TTGAAATGCA 
GGATTTAATG 
GGCTTCGGAT 

20 CACAAGTCAT 
CCTGTGGTTG 
AGACACTAGG 
AGACTATTAA 
CCCATGGTTG 

25 CGTGCGTGAT 



TCTGAGCGTG 
GACCCAGAAA 
TTATCAAAAA 
GTCGTTCCTG 
TCAGACCCGT 
TGATTGTTGC 
TATATGGAAA 
CAAAGTTGAA 
ACATGGTTGA 
GTAGAGGCAC 
TCTTAGATTG 
TACCAGTCGG 
ACTTTTGCCC 
TCACGAGGTT 
TCAAGAATGT 
TCGAAAAATG 
TGTACTCAAT 
ATTGCCATAC 
ATTGACAAGC 
GAATGGTGAC 
TGGAAACTTT 
ATGCGCCAAA 



AGAGAGGTAT 
TACGAGTACA 
TATGATTACT 
CTGAATCTGG 
GAACACGCCT 
TATTAACAAA 
TTTTCAAGGA 
AGCGTGCCGT 
AAAATCTACT 
TTGATCAAAT 
CCCCTGCAGT 
AAGGGTCGAA 
CCACTGGTTT 
GTGGAAGTGG 
GTCTGTTAAG 
ACCCAGCCAA 
CACCCTGGTA 
TGCCCACATT 
GTACTGGCAA 
GCTGCCATGG 
CACCGACTAC 
CCGTTGCCGT 



TACTATTGAC 
CTGTCATAGA 
GGGACTTCAC 
CGGATTCGAG 
TACTAGCCTT 
ATGGATTCTT 
AGTACAGCAG 
TTGTTGCTAT 
AACATGCCTT 
GGAGCCTCCA 
CAGTCTATAA 
ACAGGACAAC 
GACCACTGAA 
CTAGCCCCGG 
GATATTAAGA 
GGAAGCTACC 
CCATCAAGGC 
GCTTGCAAAT 
GACTTTGGAA 
TGACTATGAA 
GCCCCGTTGG 
CGG 



ATTACTCTCT 50 

CGCACCTGGT 100 

AAGCCGACGT 150 

GCTGCTTTTT 200 

CACACTTGGC 250 

GTCAGTACAA 300 

TACTTGAAGA 350 

TTCAGGATTC 4 00 

GGTATAAGGG 4 50 

AAACGTCCGG 500 

AATTGGAGGT 550 

TGAAAGCAGG 600 

TGTAAATCTG 650 

TGATAACGTT 700 

GAGGAAATGT 750 

TCTTTCTCTG 8 00 

CGGTTACTCA 850 

TCGAATCTCT 900 

GAAAATCCTA 950 

ACCAAATAAA 1000 

GCCGGTTCGC 1050 
1083 



2) INFORMATION FOR SEQ ID NO: 678 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 551 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Cryptococcus neoformans 

(B) STRAIN: Lev- 12 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 678 

4 5 TGTGCTCATT CAGGAGCTCA TTAACAACAT TGCCAAGGCT CACGGTGGTT 50 

ACTCTGTCTT CACTGGTGTC GGTGAGCGAA CTCGTGAGGG TAACGACTTG 100 

TACCACGAAA TGAGGGAGAC TGGTGTTATC AACCTTGAGG GTGACTCCAA 150 

GGTCGCTCTT GGTGAGTTCT TTTTTTTCTT CAGGCTAATT AGTCGATGAC 2 00 

GTGGGCCCTG ACTAAAACTG TTTCTTCCAG TCTTCGGTCA GATGAACGAG 2 50 

50 CCCCCTGGAG CCCGTGCCCG AGTTGCCCTT ACTGGTTTGA CCATTGCCGA 300 

GTACTTCCGT GACGAGGAAG GCCAGGATGT GTTGCTTTTC ATTGACAACA 350 

TTTTCCGATT CACCCAGGCC GGTTCCGAGG TGTCTGCCTT GCTCGGTCGT 4 00 

ATCCCCTCTG CCGTCGGTTA CCAGCCCACT CTTTCCACCG ACATGGGTGG 4 50 

TATGCAGGAG CGAATGTAGG TTGCATTCTC TGTGATTTTA CGGCAAGCCT 500 

55 TGACTTTTTT TTTCTAGTAC CACCACCAAG AAGGGTTCCA TTACCTCCGT 550 

C 551 



60 2) INFORMATION FOR SEQ ID NO: 679 

384 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 552 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptococcus neoformans 

(B) STRAIN: ATCC 44104 

(xi) SEQUENCE DESCRIPTION: SEQ 

TGTGCTCATT CAGGAGCTCA TTAACAACAT 
ACTCCGTCTT CACCGGTGTC GGTGAGCGAA 
TACCACGAAA TGAGGGAGAC TGGTGTTATC 
GGTCGCTCTT GGTGAGTTCT TTTTTTTGGG 
GGGCCCTGAC CAAAACCGTT TCTTTCAGTC 
CCCTGGAGCC CGTGCCCGAG TTGCTCTTAC 
ACTTCCGTGA CGAGGAAGGC CAGGATGTGT 
TTCCGATTCA CCCAGGCCGG TTCCGAGGTG 
CCCCTCTGCC GTCGGTTACC AGCCCACTCT 
TGCAAGAGCG AATGTAAGTT GCATTTTTTG 
ACTTACATTT TTTTCTAGCA CCACCACCAA 
TC 



2) INFORMATION FOR SEQ ID NO: 680 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1018 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cunninghamella bertholletiae 

(B) STRAIN: ATCC 42115 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 680 

TCTCCCTGCT ATTTTAAACG CTCTTGAAGT TAAGGATCAT GCTGGTGGTC 50 

GTCTTGTTCT TGAAGTTGCT CAACACTTGG GTGAAAACAC TGTACGTACT 100 

ATTGCTATGG ATGGTACTGA AGGTAAGTTT ATTTTTAGAT GATCATAAAT 150 

AATTGATCAT AATGATAAAA AAAAAAGAAG AAGAAGAACA GGATGTATAT 200 

AATGGTTAAT AAATAATATT TTCATATTGK ATATAACTAT TTAATCTGTT 250 

TTTTTTTCTT CATGATTATA TATATATATG TMCTAATATC TAATATGAAC 300 

CTTTTTTATA AAATTAATCA GGTCTTGTCC GTGGTCAAAA GGTTGTTGAT 350 

ACTGGTGCTC CTATTACCAT TCCTGTTGGT AAGGAAGTTC TTGGTCGTAT 400 

CATCAACGTT ATTGGTGAAC CCATTGATGA ACGTGGTCCT ATTAAGGCCA 450 

AGTCTCACCG TGCTATTCAC GCTGAAGCTC CAGAATTCGT TGATCAATCT 500 

CCTACTCCCG AAATTCTTGA AACTGGTATT AAGGTTGTCG ATTTATTAGC 550 

TCCTTATGCT CGTGGTGGTA AGATTGGTCT TTTCGGTGGT GCTGGTGTAG 600 

GTAAAACTGT CTTGATTCAA GAACTTATTA ACAACATTGC TAAAGCCCAT 650 

GGTGGTTACT CTATTTTCTG TGGTGTTGGT GAACGTACTC GTGAAGGTAA 700 

385 



ID NO: 679 



TGCCAAGGCC 


CACGGTGGTT 


50 


CTCGTGAGGG 


TAACGACTTG 


100 


AACCTTGAGG 


GTGACTCCAA 


150 


AGCTAATTAG 


TCGATGACGT 


200 


TTCGGTCAGA 


TGAACGAGCC 


250 


TGGTTTGACC 


ATTGCCGAGT 


300 


TGCTTTTCAT 


TGACAACATT 


350 


TCTGCCTTGC 


TCGGTCGTAT 


400 


TTCCACCGAT 


ATGGGTGGTA 


450 


TGATTTTACG 


GCAAGTCTTG 


500 


GAAGGGTTCC 


ATTACCTCCG 


550 
552 
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CGATTTATAC CACGAAATGA TGGAAACTGG TGTCATTAAA CTTGXAGGTG 750 

ACTCCAAGTG TGCTCTTGTA TTCGGTCAAA TGAACGAACC TCCTGGTGCT 800 

CGTGCCCGTG TTGCTTTAAC TGGTTTAACC ATTGCTGAAT ACTTCCGTGA 850 

TGAAGAAGGT CAAGATGTGT TACTTTTCAT TGATAACATT TTCCGTTTCA 900 

CTCAAGCTGG TTCTGAAGTA TCTGCCCTTT TAGGTCGTAT TCCATCTGCT 950 

GTAGGTTACC AACCCACTTT ATCTACTGAT ATGGGTGGTA TGCAAGAACG 1000 

TATTACTACT ACCAAGAA 1018 



2) INFORMATION FOR SEQ ID NO: 681 



' (i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases. 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 681 
GGISSITTYG GIISIGGIAA RAC 23 

2) INFORMATION FOR SEQ ID NO: 682 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 682 
GTIACIGGYT CYTCRAARTT ICCICC 26 

40 

2) INFORMATION FOR SEQ ID NO: 683 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 683 
55 GTIACIGGIT CISWIAWRTC ICCICC ' 26 



60 



2) INFORMATION FOR SEQ ID NO: 684 

386 



SUBSTITUTE SHEET (RULE 26) 



10 



WO 01/23604 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3267 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida tropicalis 
(C) ACCESSION NUMBER: M64984 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 684 



PCT/CAOO/01150 



15 ATGGCTGGTG 
AGATGACACC 
CGGTTGTTAT 
AAAGTTGGTC 
TAAAGCAACC 

20 ATCCAGTTTT 
TTAATGGAAA 
AGATGAATCC 
TATCAAGAAC 
GATCATATCA 

25 ATTGGATGAC 
CTTCTATTGC 
GTTGAATTTG 
AGTTAGAGTT 
TGTTGACCGG 

30 GGTACTACTT 
TCAATCTTTG 
GTTTCACTAA 
ATTGAATCTA 
AAGAGAAGTT 

35 GTCAACTTTC 
TTCACTGTTT 
CAAGATTGCT 
TCTATGTTTT 
ACTAAAGTTT 

4 0 AGCTGCTACT 
TCATTGAAGC 
ACCACTCAAA 
CTGGTTACAC 
ACTTGTTGGG 

4 5 ACCATGAACT 
CTCTTCAAAA 
ATGTCGCTGA 
GATGAAGCTG 
ATTGACTATC 

50 TGTGGAATGC 
GTCAAGAGTA 
ATCTTTGATT 
CCAATGGTTC 
AGAGGTCTTG 

55 TATTAAAGAT 
GTATTGTCAA 
GCACTTGCAA 
TTTGTTCAAC 
TTACTTTGGC 

60 TTGGTGCACA 



CTTTAGAAAA 
AATGAATCCC 
TGCCGAAAAC 
ATGATAATTT 
ATTCAAGTTT 
AAGAACTGGT 
CTATTTATGA 
CAATCTATTT 
TGTTCAATAT 
CTGGTGGGGA 
CATAAGATTT 
TGAAGCCGGT 
ATGGTAAGAA 
CCAAGACCAG 
TCAAAGAGTC 
GTATCCCAGG 
TCCAAATTCT 
AGGTACTCAA 
TTGAAGTTGG 
GTTGGCTTAC 
TAGTACTAGA 
CTGCTGATCA 
ACACGTAAAA 
GGAAAAGACT 
TCGGTCATCA 
TTTGCTGCTG 
TAGAGATTAT 
TGATCAACCC 
GAACACAAGC 
TACTTGGGCT 
CCAAAGATGA 
TTGGGCATGA 
AAACGAAGAA 
GTGATTTCAC 
AATGTTCATG 
TTTGAAATCT 
TTCCTCAACA 
GCCGGTTTAG 
TATTGAAGCT 
TCAAGATTGC 
ACTCACATTG 
TTTGACTGGT 
GAAACCAAAC 
TTTGATTTGA 
TGAAGAAACT 
ACTGTGGTGA 



CGCAAGAAAA 
AATATGGTCA 
ATGATTGGAT 
AGTTGGGGAA 
ATGAAGAAAC 
AAACCATTAT 
TGGTATTCAA 
ATATCCCAAG 
GATTTCACTC 
CATTTTTGGT 
TGTTACCTCC 
TCTTATAATG 
ACATAAATAC 
TTGCTGAAAA 
TTGGATTCTT 
GGCTTTTGGT 
CCAACTCTGA 
GTCATGATGG 
TGACAAAGTC 
CAAGAGGTTA 
CGTAATGCTA 
TAAACTTATC 
TTGGTGGCAA 
AAGACTGGTA 
TATCCATGGT 
GCATTGACTC 
GTACAAGTTG 
AGTTCATTTT 
AAAACAAATC 
GGTATTGGAA 
TGTTAAATTA 
CTTGTTCTTC 
GAATTTTTCA 
TTTTGATGAA 
GTGCAGCTGC 
CTTGGTTTCA 
TATTGCTGTT 
TTGATGCTGC 
GTTGTTAGAA 
TCATTCTTTG 
ATGCTGCTGG 
GCTCCACTTG 
TCCAGTTGTC 
TCAAATCTGC 
GATCATCAAT 
ACGTGGTAAT 
38 



GAAATTAAAC 
AATCTATTCT 
GTGCCATGTA 
GTTATTAGAA 
TGCAGGGGTC 
CTGTTGAATT 
AGACCTTTAA 
AGGTATTGAT 
CAGGTCAATT 
TCTATTTATG 
AAGAGCAAGA 
TTGAAGAACC 
TCTATGATGC 
ATTGACTGCT 
TATTCCCATG 
TGTGGTAAAA 
TGTTATTATC 
CTGATGGTGC 
ATGGGTAAAG 
TGATGATATG 
AATCCGAAGG 
TTGAAAACTA 
CACCTATACT 
TTGAATTAGT 
CAAAATGGCG 
TAAAGAATAC 
ATGAAATTGT 
GAATCTGGTA 
ACTTGCTCCA 
ATGTTAAATC 
GCTACAAGAA 
TACTGAATCC 
ATAACCTTGG 
TTTACCGATG 
AAGCAAGAAG 
GAGCCAAGTC 
GATGATATTG 
TGGTAATGTT 
CTTCTTTCAG 
GGTATTGAAT 
TGTTAGACAA 
CTGGTGTTCT 
AAATTTACCA 
AAAAGAAAAC 
TCCTTTTATC 
GAGATGGCTG 
7 



GTCTTTCATT 50 

GTTTCCGGTC 100 

CGAATTGGTT 150 

TTAATGGTGA 200 

ACTGTTGGTG 250 

AGGTCCTGGT 300 

AAGCCATTAA 350 

GTTCCTGCTT 400 

GAAAGTTGGT 450 

AAAACTCTTT 500 

GGTACTATTA 550 

AGTTTTGGAA 600 

ATACATGGCC 650 

GATCATCCAT 700 

TGTTCAAGGT 750 

CTGTTATTTC 800 

TATGTTGGTT 850 

CGACAAATCT 900 

ATGGTATGCC 95 0 

TACAAGGTTC 1000 

CTTGATGGAT 1050 

AACAAGATGT 1100 

GGTGTTACTT 1150 
TAAAGCCAAG . 12 00 

CTGAAGAAAA 1250 

ATTGATTGGA 1300 

CAAGACCAGC 1350 

AACTCGGTAA 1400 

CAATTGGGTT 14 50 

TTCTGCTTTC 1500 

TTATGAACTA 1550 

GGTGAACTCA 1600 

TGCTGAAAAG 1650 

CTATGGATGA 1700 

AACAATTTGT 1750 

TACTGATATT 1800 

TTGTCAGAGA 1850 

GAAACCAAAT 1900 

ACATGTCGCT 1950 

CATCTATTAA 2000 

GAATTTGCTT 2050 

TTCTAAATGT 2100 

GAGACCCAGT 2150 

TATTATGGTA 2200 

CAACATGGCC 2250 

AAGTTTTGAT 2300 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

GGAATTCCCA GAATTGTTTA CTGAAATTTC 
TGAAACGTAC CACTTTGGTT GCCAATACTT 
AGAGAAGCTT CTATTTATAC TGGTATTACA 
TCAAGGTAAG AATGTTTCTA TGATTGCTGA 
5 AAGCTTTGAG AGAAATTTCT GGTAGATTGG 
GGTTTCCCAG CTTATTTGGG TGCTAAATTG 
CGGTAAAGCC ACTGCTTTGG GTTCACCAGA 
TTGTTGCTGC TGTTTCTCCA GCTGGTGGTG 
ACTTCTACTT TGGGTATTAC TCAAGTTTTC 

10 GGCCCAAAGA AAACATTTCC CATCTATTAA 
AATACACCAA TGTTTTGAAC AAATACTATG 
CCACAATTGA GAGACAAAAT TAGAGAAATT 
GGAACAAGTT GTTCAATTAG TTGGTAAATC 
AGATTACTTT AGATGTTGCT ACCTTGATTA 

15 AATGGTTATT CTTCATATGA TGCATTCTGT 
TATGATGAGA GCATTTATTT CATATTATGA 
CCAATGGTGC TCAATGGTCT AAATTAGCTG 
CATGCTGTTT CTTCAGCTAA ATTCTTTGAA 
AGGTGAAAAA GAATTTGGAG ATTTATTAAC 

2 0 CTGAAGCTTC AGAATAA 



PCT/CA00/01150 

TGGTAGAAAA GAACCAATTA 2350 

CTAATATGCC AGTCGCTGCC 2400 

TTGGCTGAAT ATTTCAGAGA 24 50 

TTCTTCTTCA CGTTGGGCTG 2500 

GTGAAATGCC TGCTGATCAA 2550 

GCTTCTTTCT ATGAGCGTGC 2600 

TAGAGTTGGT TCAGTTTCTA 2650 

ATTTCTCTGA TCCAGTTACT 2700 

TGGGGGTTGG ATAAGAAATT 2 750 

CACCAGTGTT TCTTATTCTA 2800 

ATTCCAACTA TCCAGAATTC 2850 

TTATCTAATG CTGAAGAATT 2 900 

TGCATTGTCT GATTCTGATA 2 950 

AAGAAGATTT CTTGCAACAA 3000 

CCAATTTGGA AGACTTTTGA 3050 

TGAAGCACAA AAAGCAATTG 3100 

AAAGTACTAG TGATGTTAAA 3150 

CCATCAAGAG GTCAAAAAGA 3200 

CACTATCTCC GAAAGATTTG 3250 

3267 



25 



30 



2) INFORMATION FOR SEQ ID NO: 685 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1782 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



35 



40 



45 



50 



55 



60 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus hirae 

(B) STRAIN: ATCC 9790 

(C) ACCESSION NUMBER: D17462 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 685 



TTGCAAATTG GAAAAATCAT AAAAGTCTCC 
AAATATGTCA GAAGCAAGTA TTCAAGACAT 
GAGTCATCGG CGAAATCATT GAGATGCGTC 
GTATATGAAG AAACTTCAGG AATTGGTCCC 
TGGGGAAGCA CTATCTGTTG AGCTAGGACC 
TTGACGGGAT TCAAAGACCA CTGGATACAT 
AACTTCTTAG GACGTGGGGT CCAATTACCA 
ATGGTGGTTT GAAGCGACAA TCGAAGAAGG 
ACATCATTGG GTACGTGGAT GAAACGAAGA 
GTCCCTAATG GTATCAAAGG AACTGTACAA 
TACGATCGAT GATCCGATTT GTGTGATCGA 
AGCTGACGAT GATGCAAAAA TGGCCAGTAC 
CAAAAATTAA ATCCAGATGT ACCGATGATC 
CACGTTTTTC CCAGTAACTA AAGGAGGAGC 
TTGGTGCAGG GAAGACAGTT GTGCAACACC 
GTAGATCTAG TGGTTTACGT TGGTTGTGGG 
GGATGTCGTC AATGAATTTC CTGAACTGAT 
CTTTGATGGA ACGAACTGTG TTGATCGCTA 
GCTGCTCGAG AAGCTTCTAT TTATACGGGA 
CCGTGACATG GGGTATGATG TAGCAATCAT 

3 



GGTCCTCTCG 
GTGTTTAGTG 
AAGATGTGGC 
GGAGAACCTG 
AGGAATCATT 
TTATGGAAGT 
GCTTTAGATC 
AACAGAAGTA 
TCATTCAGCA 
AAAATTGAAT 
AACGGAACAA 
GTCGTGGTCG 
ACCGGTCAAA 
GGCAGCCGTT 
AGATTGCTAA 
GAACGAGGAA 
CGATCCAAAT 
ATACATCGAA 
ATCACGATTG 
GGCAGATTCC 
88 



TTATGGCAGA 50 

GGAGATTTAG 100 

GTCTATTCAA 150 

TTCGTTCCAC 200 

TCACAAATGT 250 

GACTCAAAGT 300 

ATGAGAAACA 350 

AGTGCTGGAG 400 

CAAAATCATG 450 

CTGGATCATT 500 

GGCTTAAAAG 550 

ACCAATCAAA 600 

GGGTCATTGA 650 

CCAGGTCCGT 700 

GTGGTCGGAC 750 

ATGAAATGAC 800 

ACAGGCGAGT 850 

CATGCCAGTA 900 

CCGAGTACTT 950 

ACTTCTCGTT 1000 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 
GGGCAGAAGC ACTGCGTGAA 
GATGAAGGTT ATCCCGCTTA 
ACGTTCAGGA CGTGTCATTG 
TCACTGCCAT CAGTGCGGTT 
5 GTGACTCAAA ATACCTTACG 
TAGTCTTGCT CAAAAAAGAC 
ACTCATTATA TTCAACAGAA 
CAGGATTGGT CTGATATGGT 
AGAACAATTA AATGAAATCG 

10 ATAACGATCG CTTGACCCTT 
TTACAACAAA ATGCTTTTGA 
ACAATTCAAC ATGTTGAAAG 
AAGCCTTATC GTTGGGAGCG 
GCGGTCAGAG AACGCATTAG 

15 AGCCAAAATT AGTAGTATAA 
TTGTTTCAGA AGGAGGGATG 



ATGAGCGGAC GTTTAGAAGA 
TCTGGGCTCT CGTCTAGCTG 
CTCTAGGCTC TGACCAACGT 
TCTCCTTCTG GTGGAGATAT 
TGTGGTGAAG GTTTTCTGGG 
ATTTTCCATC GATTAACTGG 
GTTGGCAGAT ATATGGATCA 
AACTGAAGGC ATGCGGATCT 
TGCGCTTGGT AGGGATCGAT 
GAAGTAGCAA AATCGATTCG 
TGATGTAGAT ACGTTTACTT 
TTATTTTGAC TTTTGGGAAA 
TACTTCAATG AAATCATGGA 
TCGGAGCAAG TATATTCCAG 
ATGAAGAAAT, CAAAGAAACG 
ACCGATGATT AA 



PCT/CAOO/01150 

AATGCSTGGT 105CT 

AATACTATGA 1100 

GAGGGCAGTA 1150 

CTCTGAACCA 1200 

GATTAGATTC 1250 

ATCCAAAGTT 1300 

AATCTTACAA 1350 

TGCAAGAAGA 14 00 

TCGCTTTCTG 1450 

AGAAGACTAT 1500 

CAAGAGAAAA 1550 

GAAGCTCGAA 1600 

AGGTACAGTA 1650 

AAGAAGAGTT 1700 

ATCCAATTGA 1750 
1782 



20 2) INFORMATION FOR SEQ ID NO: 686 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1781 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Chlamydia pneumoniae 

(B) STRAIN: CWL 029 

(C) ACCESSION NUMBER: Genome project 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68 6 

CAGGGACATG TTATAGAAGC TTATGGAAAC TTGTTACGTG TACGCTTTGA 50 

CGGATATGTT AGACAAGGTG AAGTTGCATA TGTCAACGTA GATAATACCT 100 

GGTTAAAAGC AGAAGTGATT GAAGTTGCTG ATCAAGAAGT CAAGGTTCAG 150 

4 0 GTATTTGAAG ATACACAAGG CGCGTGTCGA GGAGCTCTTG TTACGTTTTC 200 

AGGACATCTT TTAGAAGCCG AGTTAGGGCC TGGCTTGCTT CAGGGCATTT 250 

TCGATGGACT TCAAAATCGT CTTGAGGTGC TAGCTGAAGA TAGTTCTTTC 3 00 

TTGCAGAGAG GCAAGCATGT TAATGCTATT TCTGATCATA ATTTATGGAA 3 50 

TTATACTCCC GTAGCTTCTG TTGGGGATAC TTTAAGACGA GGAGATCTTC 4 00 

45 TAGGAACAGT ACCTGAAGGA CGATTTACTC ATAAGATTAT GGTTCCTTTT 4 50 

TCTTGCTTTC AAGAGGTTAC CCTGACTTGG GTAATTTCTG AAGGAACCTA 500 

TAATGCTCAT ACTGTGGTCG CAAAAGCTCG AGATGCTCAG GGTAAAGAAT 550 

GTGCCTTTAC TATGGTGCAA AGATGGCCGA TCAAACAAGC TTTTATTGAA 600 

GGAGAGAAGA TCCCTGCGCA TAAGATTATG GATGTGGGTT TGCGAATCTT 650 

50 AGATACGCAA ATTCCAGTAT TGAAGGGGGG AACTTTCTGT ACCCCAGGAC 700 

CTTTTGGTGC AGGGAAAACA GTCTTACAAC ACCATCTTTC TAAGTACGCT 750 

GCTGTAGATA TTGTGATTTT GTGTGCGTGC GGAGAGCGTG CTGGTGAAGT 800 

TGTTGAGGTA TTACAAGAGT TCCCTCATCT TATCGACCCC CATACCGGAA " 850 

AGTCTTTAAT GCACAGAACA TGTATTATTT GTAACACATC ATCCATGCCT 900 

55 GTGGCTGCCC GAGAGTCTTC GATCTATTTA GGAGTGACGA TTGCAGAATA 950 

CTATCGCCAG ATGGGACTAG ATATTCTGCT TTTAGCTGAT TCTACATCCC 1000 

GATGGGCACA AGCCCTTAGA GAGATTTCGG GACGTCTTGA AGAAATCCCT 1050 

GGAGAGGAAG CATTTCCTGC ATACCTGTCT TCTAGAATAG CTGCTTTTTA 1100 

TGAGCGAGGA GGAGCTATCA CCACGAAAGA TGGTTCTGAA GGATCTTTAA 1150 

60 CTATATGTGG TGCGGTGTCT CCTGCAGGAG GAAACTTTGA AGAACCAGTC 1200 

389 
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ACTCAATCTA CATTAGCTGT 
ACGAGCTGAC GCACGTAGGT 
CAAAATATTT GAACCAGGTA 
TGGGGTGGTG CTGTGAAAAA 
5 AATCGGCAAG CGTATGGAAG 
ACATGGAAAT CTACTTAAAG 
CAGAACGCAT TCGATCCTGT 
AGAGTTATTT TCATTAATCA 
ATAGTCCTGA TGATGCAAGA 
10 AAGACATTAA ATGGCCTGAA 
AGAGGTCATA GTTAGACTGT 
GATATGCAAA CAATCTACAC 



AGTCGGAGCG TTCTGTGGTC 
ATCCTTCAAT AGACCCTTTG 
GGACAAATTT TAGAAGAGAA 
AGCAGCACAG TTTCTAGAGA 
TTGTCAGTGA AGAAGGGGTT 
GCAGAACTTT ATGATTTTTG 
GGACTGTTAT TGTCCTTTTG 
GTCGTATTTT TGATGCTAAA 
AGCTTTTTCC TTGAGCTGCA 
ATTTCTTTCA GAGGAATATC 
TGGAAAAAAC AATGGTACAA 
AAAAATAACT G 



PCT/CA00/01150 

TTTCAAAAGC 12 50 

ATTTCTTGGT 1300 

GGTTTCAGGC 1350 

AAGGTTCAGA 14 00 

TCTATGGAAG 14 50 

TkATCTCCAG 1500 

AGAGACAGAT 1550 

TTTGTTTTTG 1600 

GAGCAAGATT 1650 

ATGAGAGTAA 1700 

ATGGCGTAAG 1750 
1781 



15 



2) INFORMATION FOR SEQ ID NO: 687 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1758 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Halobacterium salinarum 
(C) ACCESSION NUMBER: S56356 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 687 

ATGAGTCAAG CTGAAGCAAT CACTGACACC GGCGAAATCG AGAGCGTGAG 50 

CGGTCCGGTC GTGACCGCCA CGGGCCTCGA CGCGCAGATG AACGACGTCG 100 

TCTACGTGGG CGACGAGGGT CTGATGGGCG AGGTCATCGA GATCGAAGGC 150 

35 GACGTAACCA CCATCCAGGT CTACGAGGAG ACGTCCGGCA TCGGGCCGGG 2 00 

CCAGCCCGTC GACAACACGG GCGAACCGCT C AC CGTGGAC CTGGGCCCGG 250 

GGATGCTGGA CTCCATCTAC GATGGTGTGC AGCGTCCACT GGACGTCCTC 300 

GAAGACGAGA TGGGGGCGTT CCTCGATCGC GGTGTTGACG CACCCGGCAT 350 

CGACCTCGAC ACCGACTGGG AGTTCGAGCC CACCGTCGAG GCGGGCGACG 4 00 

40 AGGTCGCGGC CGGCGATGTC GTCGGCACCG TCGACGAAAC GGTCAGCATC 450 

GAACACAAGG TTCTGGTGCC CCCCCGCAGT GACGGCGGCG AAGTCGTCGC 500 

CGTCGAATCC GGCACGTTCA CCGTCGACGA CACGGTCGTC GAGTTGGACA 550 

CCGGCGAGGA GATCCAGATG CACCAGGAGT GGCCGGTCCG CCGCCAGCGC 600 

CCCACCGTCG ACAAGCAGAC GCCGACGGAG CCGCTGGTGT CCGGCCAGCG 650 

45 CATCCTCGAC GGCCTGTTCC CGATCGCGAA AGGCGGGACG GCCGCGATCC 7 00 

CGGGGCCGTT CGGGTCCGGG AAGACGGTCA CCCAGCAGTC CCTCGCGAAG 750 

TTCGCCGACG CGGACATCGT TGTCTACATC GGCTGTGGTG AGCGCGGCAA 800 

CGAGATGACG GAAGTCATCG AGGACTTCCC GGAGCTGCCC GACCCCCAGA 850 

CCGGGAACCC GCTGATGGCC CGCACCACGC TCATCGCGAA CACGTCGAAC 90 0 

50 ATGCCGGTTG CCGGGCGTGA GTCCTGCATT TACACGGGAA TCACCATCGC 950 

GGAGTACTAC CGCGACATGG GCTACGACGT GGCGCTGATG GCCGACTCCA 1000 

CCTCGCGGTG GGCGGAGGCC ATGCGGGAGA TCTCCTCGCG ACTCGAGGAG 1050 

ATGCCCGGCG AGGAGGGGTA TCCCGCGTAC CTGGCCGCCC GCCTCTCGGA 1100 

GTTCTACGAG CGCGCCGGCT ACTTCGAGAA CTTCAACGGG ACCGAGGGCT 1150 

55 CCATCTCGGT CATCGGTGCG GTGTCGCCGC CCGGCGGGGA CTTCTCCGAG 1200 

CCGGTCACCC AGAACACGCT GCGCATCGTG AAGACGTTCT GGGCGCTTGA 1250 

CTCGGACCTC GCCGAGCGCC GGCACTTCCC GGCGATCAAC TGGGACGAGT 1300 

CCTACAGCCT CTACAAGGAC CAACTCGACC CGTGGTTCAC GGACAACGTC 1350 

GTCGACGACT GGGCCGAGCA GCGCCAGTCG GCGGTCGACA TCCTCGACGA 14 00 

60 GGAATCCGAA CTCGAAGAGA TCGTGCAGCT CGTCGGGAAG GACGCGCTGC 1450 
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CCGAGGACCA GCAGCTCACG CTGGAAGTCG 
TGGCTCCAGC AGAACGCGCT CCACGACGTG 
GAAGACGTAC GCCATCCTCT CCGGCATCAA 
TTGAGGCGTT GGACGCCGGT GTGCCAGTCG 
5 GCCGCGCCGC GCCTGAATCG TCTCGGCACG 
GGCGGAGGTC GCGGAGATCA AACAGCAGAT 
TCTACTGA 



PCT/CA00/01150 

CGCGGTACAT CCGCGAGGCG L500 

GATCGCTACT GCCCGCCCGA 1550 

GACGCTTCAC GAGGAGTCCT 1600 

AGGAGATCAC GTCGATCGAC 1650 

ACGCCCGACG ACGAGCACGA 1700 

TACCGAGCAG CTTCGGGAGC 1750 

1758 



10 



2) INFORMATION FOR SEQ ID NO: 688 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3118 bases 
15 (B) TYPE: Nucleic acid , 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Human 

(C) ACCESSION NUMBER: L09234 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 688 

GAATTCCGGC AGCTGACTAG TCTTGTGATT GGGGTCCTGG GCTGATAAAA 50 

TCATTCCAAA TGACGAGCAC ATTGATAAAG ACGTCCGATG AGGACCGGGA 100 

GTCCAAATTC GGCTTTGTTT TTGCCGTATC TGGACCTGTG GTGACAGCTG 150 

30 AACGAATGGC CGGTTCTGCT ATGTACGAAC TGGTGCGTGT CGGTTATTAT 200 

GAACTGGTCG GAGAGATCAT CCGGTTGGAG GGTGACATGG CAACAATCCA 250 

AGTATACGAA GACACCTCAG GTGTGACAGT AGGCGATCCC GTGCTGCGCA 300 

CAGGCAAGCC GCTGTCCGTG GAACTGGGAC CCGGAATCAT GGGCAGCATC 350 

TTCGACGGTA TCCAGCGACC GCTGAAGGAT ATCAATGAAC TGTCAAATAG 4 00 

35 TATCTACATC CCGAAAGGTG TCAATGTGCC TGCCCTGAGT CGCACTGCAC 450 

AGTGGGACTT CAGTCCCGTC AGTGTCAAGG TTGGAAGCCA CATTACTGGT 500 

GGTGACCTGT ACGGTTTGGT CCACGAAAAT ACTCTGGTGA AACACAAGTT 550 

GCTGCTGCCG CCCCGTGCCA AGGGAACTGT CACGTACATT GCAGAACCTG 600 

GAAACTACAC AGTTGATGAT GTTGTCCTGG AGACAGAATT TGACGGCGAG 650 

4 0 CGATCAAAGT TCACCATGCT GCAAGTGTGG CCTGTACGTC AGCCCAGGCC 700 

TGTTACAGAA AAGTTGCCAG CTAACTACCC CCTCCTTACT GGCCAGCGTG 750 

TGCTCGACTC CCTATTCCCG TGTGTCCAGG GTGGAACAAC AGCTATTCCT 800 

GGGGCCTTCG GATGTGGCAA GACTGTAATA TCACAGTCTT TGTCAAAATA 850 

CTCAAACTCC GATGTAATTA TCTATGTAGG TTGTGGTGAG CGAGGTAATG 900 

45 AAATGTCAGA AGTACTCAGG GATTTCCCGC AGTTGTCGTT GGAGATTGAT 950 

GGTGTGACTG AATCAATCAT GAAGAGAACA GCCCTGGTCG CAAACACATC 1000 

AAACATGCCT GTGGCTGCTC GAGAAGCATC TATCTACACA GGTATTACAC 1050 

TGTCAGAATA CTTCAGGGAC ATGGGTTACA ATGTATCCAT GATGGCTGAC 1100 

TCAACTTCAC GATGGGCCGA AGCTCTTCGA GAAATCTCAG GTCGATTGGC 1150 

50 TGAAATGCCT GCCGACAGCG GTTATCCCGC CTACCTAGGT GCACGACTTG 1200 

CCAGTTTCTA CGAGCGTGCC GGCCGTGTGA AGTGCTTGGG TAACCCAGAC 1250 

AGGGAGGGCT CCGTGAGTAT AGTGGGCGCC GTGTCGCCGC CCGGTGGAGA 1300 

CTTCTCAGAT CCCGTGACGA CGGCCACACT AGGTATCGTC CAGGTGTTCT 1350 

GGGGTCTCGA CAAGAAACTT GCCCAGCGAA AGCACTTCCC ATCCATCAAC 14 00 

55 TGGCTCATCT CGTACAGTAA ATACATGCGT GCTCTGGATG ACTTCTACGA 14 50 

CAAGAATTTC CCAGAGTTTG TCCCACTGCG TACAAAGGTG AAGGAGATTT 1500 

TGCAGGAGGA AGAAGACCTG TCTGAAATTG TGCAGTTGGT CGGTAAAGCT 1550 

TCATTGGCAG AAACTGACAA GATCACACTT GAGGTTGCCA AACTATTAAA 1600 

GGATGATTTC CTGCAACAGA ACAGCTATTC ACCATATGAC CGTTTCTGCC 1650 

60 CATTCTACAA GACAGTAGGA ATGCTGAAAA ATATGATTGC TTTCTACGAT 1700 
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ATGTCTCGGC ATGCAGTTGA ATCTACTGCT 

TTGGAATGTT ATTAGAGATT CTATGGGCAA 

CCATGAAATT CAAGGATCCA GTCAAGGATG 

GACTTTGAGC AGCTTCATGA AGACATTCAG 

5 GGATTAAAGT GGTAGCTGCC AGTGGTTCTC 

GGCAAGCTCT GTAGGGTTGC CGAGTGGCAT 

TTCCTTTGCC ACATAAAGAC TAAAGCAGGT 

CTGGTTCCAT TGGTGCTAAG ATTATGTTGT 

TTCCAACAGA GGAATTTACT TCCAGTTTTC 

10 TAAGTGTCGG TACAGAGGCA ATAATCTGAT 

AAGCAGGGAG AATTTGTAAT TATTACAAAT 

CAGCCTTGTA AATTCATTTG TCCCAGGACT 

ATTGCCGTCT GTATGTATGT ACACACCGTA 

CAGAAGGTGA ATTACACCAC TTACTCATTG 

15 AACTGCCATC CATTGTCCTA TTTATTCACA 

TCCAGTGTTG CAAATTGTGT TTAGAAAATT 

ACCTCACATT GTAACTCAGT ATTTACACAC 

TGTAGAAAAA ATAATTGTTG TATATTGAAA 

TTTAAAATGG TGAATGTATT TTATATTTCT 

20 TGCATTTTGC TTAATGGAGA TGTATGTAAA 

ACAAATTATG TATATGTGAA ATGGAGATGG 

TGCCAGTATT AATTTAAACA ACTGTAGTTG 

TTTTTTTTCC TATAAAATTG GTGGATGTAT 

ATGTGTAAAA TAGTGATCCC AGTAACTGTA 

25. TGACTGTATA GTTGTACAGG TGTTGTTACT 

AAGGTGAAAG TCTATTATTG TATTGTAATG 

AAAATCCTAC TTCTGTGTAT AAATGTTACC 

GTGTAGAAAT ACAGTCAACT TCATGTTCAT 

ATAAATTATG CCCGGAAT 

30 



PCT/CA00/01150 

CAGAGCGAGA ACAAfiATCAC 175ft 

TATTCTGTAT CAGCTTTCCT 1800 

GAGAAGCGAA GATCAAGGCA 1850 

CAAGCCTTCA GGAACCTGGA 1900 

TCGGTGCAGT TGTCACATTT 1950 

CGGTGCTAGA CACCTGAGCA 2000 

GGAATTTCAG TTGTAAAAAG 2 050 

GCCCTTTTCT GCTTCTCACA 2100 

TTCCATTTTC CTCCTCATTT 2150 

AACTCTGTAC CGTCACTTAC 2200 

CCCATTATCT CTGTGCACCA 2250 

CCCTCTTGTG TGTACGTGAG 2300 

CTGCAGTATT TGAAGTCAGT 2350 

TGTCACGTAG CAAGTGTGCA 24 00 

TAACTAGTTT TCTTTGCATT 2450 

ATGCCATCGA GACTGGTCGA 2500 

ACGTTTACTT GCTACAGAAA 2550 

GTACAAGTGA CAAAGTTGCA 2 600 

TTTGTAGACA CAAGAGTTAA 2 650 

CCTAAAATAG CAGTTTGTGC 2700 

TTTCTAATTT GCTGATTGAT 2750 

TGGGATGTAG TGGGAAGATT 2 800 

GTGTCGGAGA TTTTGATTGT 2 850 

AAGCTTTAGA ATACAGTTAC 2 900 

TTTAAGAATT TATTGACACA 2 950 

TTTAAAGCAT TTAAGGTTTA 3000 

ATTCCTCATA TAACATAACT 3050 

TAGCATTTCA CTGTTGTCAC 3100 

3118 



2) INFORMATION FOR SEQ ID NO: 689 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1836 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Plasmodium falciparum 
45 (B) STRAIN: 3D7 

(C) ACCESSION NUMBER: L08200 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 689 

50 ATGACAAAAG TTGCTGTTGA AAAAGAGGAA CCAGGAGTTG TTTATAAGGT 50 

GGCTGGTTCA TTAGTTATTG CTGAGAATAT GAGTGGAACT CGTATGTACG 100 

AGTTAGCTAA AGTAGGATGG AATAAATTGG TTGGAGAAAT TATTAGATTA 150 

GAAGGGAATT ATGCATATAT ACAAGTTTAT GAAGATACTT CAGGTTTATC 200 

TGTAGGAGAC CCTGTTATAA AAACAGGAAA TGCTTTATCA GTCGAATTGG 250 

55 GTCCTGGTAT TTTAGATAAT ATTTATGATG GTATTCAAAG ACCATTAGAA 300 

AGAATAGCAA ATGTGTGTGG TGATGTATAT ATATATAAAG GTATTGATAT 350 

GACATCTTTA GATCATGATA AACAATGGCA ATTTTATGCT GATAAGAAAT 4 00 

TAAAATTAAA TGATATTGTT ACTGGTGGAG ATATCTTTGG ATTTGTTGAT 450 

GAAAATAAAT TATTTAAAGA ACACAAAATT ATGGCTCCAC CTAATGCTAA 500 

60 AGGGAGGCTT ACATATATTG CTCCAGATGG ATCATATACT TTAAAAGATA 550 
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AAATATTTGA ATTAGAATAT CAAGGAAAAA AATATACATA TSGTTTATCT 600 

CATTTATGGC CTGTTCGTGA TCCTAGACCT GTTTTAGAAA AGGTAACAGG 650 

GGATACTTTA TTATTAACAG GGCAAAGAGT TTTAGATTCG TTATTTCCAA 700 

CAGTTCAAGG AGGTACTTGT GCTATTCCTG GTGCATTTGG TTGTGGAAAA 750 

5 ACTTGTGTTT CTCAGGCCTT ATCAAAATAT TCTAATAGTG AAGTTATTAT 800 

ATATGTAGGA TGTGGTGAAA GAGGTAATGA AATGGCTGAA ATTTTATCCG 850 

ACTTTCCTGA ATTAACTACT AAAGTAGATA ATGAAGATGT AGGTATTATG 900 

CAAAGAACGT GTTTAGTTGC TAATACTTCT AACATGCCTG TCGCTGCAAG 950 

AGAAGCTAGT ATTTATACAG GTATTACTTT ATGTGAATAT TTCCGTGATA 1000 

10 TGGGTTATAA TGCTACCATG ATGGCTGATA GTACAAGTAG ATGGGCAGAA 1050 

GCCTTAAGAG AAATTTCAGG ACGTTTAGCT GAAATGCCTG CAGATAGTGG 1100 

TTATCCAGCT TATTTAGGTG CTAGATTAGC TTCCTTTTAT GAACGTGCAG 1150 

GAAAAGTCAA ATGTATTGGT TCTCCATCTC GTATAGGATC CATTACAATT 1200 

GTGGGTGCTG TGTCTCCACC AGGTGGTGAT TTCTCTGACC CTGTAACTAC 1250 

15 AGCAACCATG TCTATTGTTC AAGCATTTTG , GGGGTTAGAT AAAAAACTAG 1300 

CTCAAAGAAA ACATTTCCCT TCTGTTAATT GGTCTACATC CTTTTCAAAG 1350 

TATGTCAGAC AATTAGAACA ATACTTTGAT AATTTTGATC AAGATTTCTT 1400 

ATCTTTAAGA CAAAAAATTA GTGATATTTT ACAACAAGAA AGTGACTTGA 1450 

ATGATATTGT TCAACTAGTA GGAAAGGATT CATTATCAGA AGACCAAAAA 1500 

2 0 GTTGTTATGG AAGTAGCCAA AATTATTAGA GAAGATTTTC TTCAACAAAA 1550 

TGCATTTAGC GATTATGATT ATATGTGCCC ATTACAAAAA ACAGTTGGTA 1600 

TGATGAGAAT TATTTGCCAC TTTTATGCTC AATGCTTAAG AACATTACAA 1650 

GAATATGACT CAAGAGAAAG AAAAATTGGT TGGGGATCTA TATATAATAC 1700 

ATTAAGACCA ACTATAAATA AAATTACACA TATGAAATTT GAAAACCCAA 1750 

25 AAAATTCAGA TGAATATTTC AAAAAGTATT TTAAGGCACT TGAAGAAGAA 1800 

ATAACAGTAG GTTTAAGAAA CTTGATGGAA AAATGA 1836 



30 2) INFORMATION FOR SEQ ID NO: 690 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3216 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

4 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 

(B) STRAIN: X2180-1A 

(C) ACCESSION NUMBER: J05409 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 690 

ATGGCTGGTG CAATTGAAAA CGCTCGTAAG GAAATAAAAA GAATCTCATT 50 

AGAAGACCAT GCTGAATCTG AATATGGTGC CATCTATTCT GTCTCTGGTC 100 

CGGTCGTCAT TGCTGAAAAT ATGATTGGTT GTGCCATGTA CGAATTGGTC 150 

50 AAGGTCGGTC ACGATAACCT GGTGGGTGAA GTCATTAGAA TTGACGGTGA 200 

CAAGGCCACC ATCCAAGTTT ACGAAGAAAC TGCAGGCCTT ACGGTCGGTG 250 

ACCCTGTTTT GAGAACAGGT AAGCCTCTGT CGGTAGAATT GGGTCCTGGT 300 

CTGATGGAAA CCATTTACGA TGGTATTCAA AGACCTTTGA AAGCCATTAA 350 

GGAAGAATCG CAATCGATTT ATATCCCAAG AGGTATTGAC ACTCCAGCTT 400 

55 TGGATAGGAC TATCAAGTGG CAATTTACTC CGGGAAAGTT TCAAGTCGGC 450 

GATCATATTT CCGGTGGTGA TATTTACGGT TCCGTTTTTG AGAATTCGCT 500 

AATTTCAAGC CATAAGATTC TTTTGCCACC AAGATCAAGA GGTACAATCA 550 

CTTGGATTGC TCCAGCTGGT GAGTACACTT TGGATGAGAA GATTTTGGAA 600 

GTTGAATTTG ATGGCAAGAA GTCTGATTTC ACTCTTTACC ATACTTGGCC 650 

60 TGTTCGTGTT CCAAGACCAG TTACTGAAAA GTTATCTGCT GACTATCCTT 700 
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TGTTAACAGG TCAAAGAGTT TTGGATGCTT TGTTTCCTTG TGTTCAAGGT TfSO 

GGTACGACAT GTATTCCAGG TGCTTTTGGT TGTGGTAAGA CCGTTATCTC 800 

TCAATCTTTG TCCAAGTACT CCAATTCTGA CGCCATTATC TATGTCGGGT 850 

GCTTTGCCAA GGGTACCAAT GTTTTAATGG CGGATGGGTC TATTGAATGT 900 

5 ATTGAAAACA TTGAGGTTGG TAATAAGGTC ATGGGTAAAG ATGGCAGACC 950 

TCGTGAGGTA ATTAAATTGC CCAGAGGAAG AGAAACTATG TACAGCGTCG 1000 

TGCAGAAAAG TCAGCACAGA GCCCACAAAA GTGACTCAAG TCGTGAAGTG 1050 

CCAGAATTAC TCAAGTTTAC GTGTAATGCG ACCCATGAGT TGGTTGTTAG 1100 

AACACCTCGT AGTGTCCGCC GTTTGTCTCG TACCATTAAG GGTGTCGAAT 1150 

10 ATTTTGAAGT TATTACTTTT GAGATGGGCC AAAAGAAAGC CCCCGACGGT 1200 

AGAATTGTTG AGCTTGTCAA GGAAGTTTCA AAGAGCTACC CAATATCTGA 1250 

GGGGCCTGAG AGAGCCAACG AATTAGTAGA ATCCTATAGA AAGGCTTCAA 1300 

ATAAAGCTTA TTTTGAGTGG ACTATTGAGG CCAGAGATCT TTCTCTGTTG 1350 

GGTTCCCATG TTCGTAAAGC TACCTACCAG ACTTACGCTC CAATTCTTTA 1400 

15 TGAGAATGAC CACTTTTTCG ACTACATGCA AAAAAGTAAG TTTCATCTCA 1450 

CCATTGAAGG TCCAAAAGTA CTTGCTTATT TACTTGGTTT ATGGATTGGT 1500 

GATGGATTGT CTGACAGGGC AACTTTTTCG GTTGATTCCA GAGATACTTC 1550 

TTTGATGGAA CGTGTTACTG AATATGCTGA AAAGTTGAAT TTGTGCGCCG 1600 

AGTATAAGGA CAGAAAAGAA CCACAAGTTG CCAAAACTGT TAATTTGTAC 1650 

20 TCTAAAGTTG TCAGAGGTAA TGGTATTCGC AATAATCTTA ATACTGAGAA 1700 

TCCATTATGG GACGCTATTG TTGGCTTAGG ATTCTTGAAG GACGGTGTCA 1750 

AAAATATTCC TTCTTTCTTG TCTACGGACA ATATCGGTAC TCGTGAAACA 1800 

TTTCTTGCTG GTCTAATTGA TTCTGATGGC TATGTTACTG ATGAGCATGG 1850 

TATTAAAGCA ACAATAAAGA CAATTCATAC TTCTGTCAGA GATGGTTTGG 1900 

25 TTTCCCTTGC TCGTTCTTTA GGCTTAGTAG TCTCGGTTAA CGCAGAACCT 1950 

GCTAAGGTTG ACATGAATGG CACCAAACAT AAAATTAGTT ATGCTATTTA 2000 

TATGTCTGGT GGAGATGTTT TGCTTAACGT TCTTTCGAAG TGTGCCGGCT 2050 

CTAAAAAATT CAGGCCTGCT CCCGCCGCTG CTTTTGCACG TGAGTGCCGC 2100 

GGATTTTATT TCGAGTTACA AGAATTGAAG GAAGACGATT ATTATGGGAT 2150 

30 TACTTTATCT GATGATTCTG ATCATCAGTT TTTGCTTGCC AACCAGGTTG 2200 

TCGTCCATAA TTGCGGAGAA AGAGGTAATG AAATGGCAGA AGTCTTGATG 2250 

GAATTCCCAG AGTTATATAC TGAAATGAGC GGTACTAAAG AACCAATTAT 2300 

GAAGCGTACT ACTTTGGTCG CTAATACATC TAACATGCCG GTTGCAGCCA 2350 

GAGAAGCTTC TATTTACACT GGTATCACTC TTGCAGAATA CTTCAGAGAT 2400 

35 CAAGGTAAAA ATGTTTCTAT GATTGCAGAC TCTTCTTCAA GATGGGCTGA 2450 

AGCTTTGAGA GAAATTTCTG GTCGTTTGGG TGAGATGCCT GCTGATCAAG 2500 

GTTTCCCAGC TTATTTGGGT GCTAAGTTGG CCTCCTTTTA CGAAAGAGCC 2550 

GGTAAAGCTG TTGCTTTAGG TTCCCCAGAT CGTACTGGTT CCGTTTCCAT 2600 

CGTTGCTGCC GTTTCGCCAG CCGGTGGTGA TTTCTCAGAT CCTGTTACTA 2650 

4 0 CTGCTACATT GGGTATCACT CAAGTCTTTT GGGGTTTAGA CAAGAAATTG 2 700 

GCTCAAAGAA AGCATTTCCC ATCTATCAAC ACATCTGTTT CTTACTCCAA 2750 

ATACACTAAT GTCTTGAACA AGTTTTATGA TTCCAATTAC CCTGAATTTC 2800 

CTGTTTTAAG AGATCGTATG AAGGAAATTC TATCAAACGC TGAAGAATTA 2850 

GAACAAGTTG TTCAATTAGT TGGTAAATCG GCCTTGTCTG ATAGTGATAA 2900 

45 GATTACTTTG GATGTTGCCA CTTTAATCAA GGAAGATTTC TTGCAACAAA 2950 

ATGGTTACTC CACTTATGAT GCTTTCTGTC CAATTTGGAA GACATTTGAT 3000 

ATGATGAGAG CCTTCATCTC GTATCATGAC GAAGCTCAAA AAGCTGTTGC 3050 

TAATGGTGCC AACTGGTCAA AACTAGCTGA CTCTACTGGT GACGTTAAGC 3100 

ATGCCGTTTC TTCATCTAAA TTTTTTGAAC CAAGCAGGGG TGAAAAGGAA 3150 

50 GTCCATGGCG AATTCGAAAA ATTGTTGAGC ACTATGCAAG AAAGATTTGC 3200 

TGAATCTACC GATTAA 3216 



55 2) INFORMATION FOR SEQ ID NO: 691 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1860 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 



PCT/CA00/01150 



(ii) MOLECULE TYPE: Genomic DNA 

5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Schizosaccharomyces powbe 

(B) STRAIN: 972 h- 

(C) ACCESSION NUMBER: S47814 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 691 

ATGGCGGGAG GAATTGAACT GGCCAAGAAG GCTATCAGGA GCCTCAAAAA 50 

TTACGACGAG CATGAAAACC GATATGGATC TATTTTCAGC GTTTCTGGTC 100 

CTGTCGTTGT TGCAGCCAAT ATGCTTGGAT GTTCGATGTA CGAACTCGTT 150 

15 CGCGTTGGTC ATGAAGAACT AGTTGGTGAA GTAATTCGTA TCCATCAAGA 200 

TAAATGTACT ATTCAAGTAT ACGAAGAGAC GTCCGGTCTC ACTGTTGGTG 250 

ATCCTGTCCA ACGCACTGGA AAGCCATTAT CTGTTGAATT AGGTCCTGGT 300 

TTAGCTGAGA CTATTTATGA TGGTATCCAA CGTCCGTTAA AGCAAATTTT 350 

CGACAAATCT CAAAGTATTT ATATTCCTAG AGGTATTAAT ACAGAATCAC 400 

20 TTAATCGTGA GCATAAGTGG GATTTCACAC CAAATAAGGA TTTACGCATT 450 

GGCGATCATG TATCCGGTGG TGATGTTTTC GGTTCTGTAT TTGAAAACTC 500 

TCTTTTCAAT GATCATAAAA TTATGTTACC CCCTAGAGCC CGTGGTACCG 550 

TCACATATAT TGCTGAAGCT GGATCATACC ATGTTGATGA AAAACTTCTT 600 

GAAGTCGAGT TTAATGGCAA GAAACATTCT TTTAGTATGT TGCATACTTG 650 

2 5 GCCTGTCCGT GCTGCTCGTC CAGTTGCGGA CAACTTAACT GCTAATCAAC 700 

CTTTATTGAC TGGTCAACGT GTTTTGGATG CGTTATACCC CTGTGTTCAA 750 

GGTGGCACTA CTGCTATCCC CGGTGCCTTT GGTTGTGGTA AAACAGTTAT 800 

TTCACAATCT CTTTCTAAGT ACTCTAATTC TGATTTGATT GTTTACGTCG 850 

GTTGTGGTGA ACGTGGAAAC GAAATGGCAG AAGTGTTAAT GGATTTCCCA 900 

30 GAACTAACAA TTGATATTAA TGGTAAACCA GAGCCCATTA TGAAGCGTAC 950 

TACATTGGTA GCCAACACTT CTAACATGCC TGTCGCTGCT CGTGAAGCTT 1000 

CCATTTATAC CGGTATTACA CTTGCTGAAT ATTATCGTGA TCAAGGTAAG 1050 

AACGTTTCAA TGATGGCTGA TTCTACATCT CGTTGGGCTG AAGCTTTGCG 1100 

TGAAATTTCT GGTCGTTTGG CTGAGATGCC TGCCGATTCT GGTTATCCCG 1150 

3 5 CTTATTTGGG TGCCAAATTG GCTTCTTTTT ACGAACGTGC TGGTCGTGCT 120 0 

CGTTGCTTGG GAAGTCCTGA CCGTGAAGGA ACAGTTTCAA TTGTTGGAGC 1250 

TGTTTCTCCT CCGGGTGGTG ATTTTTCTGA TCCTGTTACT AGTGCAACCT 1300 

TGGGAATTGT TCAAGTCTTC TGGGGTTTGG ACAAGAAATT GGCCCAACGT 1350 

AAACACTTTC CCTCAATCAA CACCTCTCTT TCCTATTCTA AATACATCAA 14 00 

40 TGCTTTGCAA CCTTGGTATG AGGAAAGAGT TCCAGGCTTT AATACTCTTC 1450 

GTGATCAAAT CAAACAGATC ATTCAACAAG AAGATTCCAT GTTGGAAATT 150 0 

ATTCAGTTGG TTGGTAAGTC GGCTCTTTCT GAAACGGATA AAGTTACTTT 1550 

GGACATAGCC GGTATTATTA AGAATGACTT CTTACAACAA AACGGTTATT 1600 

CTGATTACGA TCGCTGTTGC CCTCTTTACA AGACTTATCA TATGATGCGA 1650 

4 5 AACATGATTG CTTACTACAC AAAGGCTAAA AGTGCCGTTG AAACTGGTAG 1700 

CGTTCCTTGG TCAAAGATTA AAGAAAGTAC TTCAGATATC TTTTATGAGT 1750 

TAACCTCGAT GAAATTCGAA AACCCTAATG AAGGCGAGAA GGAAATAGTC 1800 

GAACACTATG AAACTCTGCA CAAGAAGATT GAGGACAAGT TTCACACTCT 1850 

GACTGAGTAA I860 

50 



2) INFORMATION FOR SEQ ID NO: 692 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1833 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 
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(ii) MOLECULE TYPE: Genomic DNA 



PCT/CA00/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma congolense 
5 (B) STRAIN: IL3000 

(C) ACCESSION NUMBER: Z25814 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 692 

10 ATGACGAGCG ATAAAAACCC TTACAAAACA GAGCAGCGCA TGGGGGCCGT 50 

GAAGGCCGTC TCCGGGCCAG TTGTCATTGC TGAAAACATG GGCGGTAGCG 100 

CTATGTATGA GCTTGTGCAG GTAGGTTCCT TCCGGTTAGT GGGCGAGATC 150 

ATTCGTCTAG AGGGCGATAC CGCCACTATT CAGGTCTATG AGGAAACAGG 200 

TGGCCTCACT GTCGGAGACC CGGTGTACTG TACGGGTAAG CCTCTTTCGC 250 

15 TTGAGCTTGG ACCTGGAATC ATGTCTGAAA, TATTTGACGG TATCCAGCGG 3 00 

CCTCTTGACA CCATCTACCG CATGGTGGAA AACGTGTTTA TCCCCAGGGG 350 

CGTTCAGGTG AAGTCACTCA ATGACCAGAA ACAGTGGGAC TTTAAGCCAT 4 00 

GCCTGAAGGT TGGAGATCTT GTGTCTGGTG GTGATATCAT TGGCTCAGTG 4 50 

GTGGAGAACT CTCTCATGTA CAATCACAGC ATTATGATTC CGCCCAATGT 500 

20 GCGGGGCCGT GTTACTTCCA TTGTTCCTTC AGGAAATTAC ACCCTCCAAG 550 

ATGACATTAT TGAATTGGAA TATAATGGGA CAGTGAAATC ACTAAAACTT 600 

ATGCATCGCT GGCCAGTACG GACCCCGCGT CCTGTGGCGT CAAAAGAATC 650 

CGGCAATCAT CCGCTTCTCA CCGGACAGCG TGTGCTCGAT GCTCTCTTTC 700 

CATCCGTCCA GGGTGGAACA TGCGCCATCC CTGGCGCGTT TGGATGCGGA 750 

2 5 AAGACGGTTA TCAGTCAGGC TCTTTCGAAG TTCTCCAACA GCGACGCTGT 800 

TATCTATGTC GGCTGCGGCG AGCGTGGGAA TGAGATGGCA GAGGTGCTCA 850 

TGGACTTCCC CACACTCACC ACCGTTATTG ATGGTCGTGA GGAGTCCATC 900 

ATGAAGCGTA CCTGCCTGGT GGCAAACACC TCAAATATGC CTGTCGCTGC 950 

TCGTGAGGCG TCTATTTACA CTGGCATCAC TTTAGCTGAG TATTATCGTG 1000 

3 0 ATATGGGCAA GCACATTGCT ATGATGGCCG ACTCTACCTC TCGATGGGCT 1050 

GAGGCTCTCC GTGAGATCTC TGGGCGTCTC GCTGAAATGC CCGCTGATGG 1100 

TGGTTACCCT GCGTACCTCA GTGCGCGTCT TGCTTCCTTC TACGAGCGTG 1150 

CGGGGCGCGT GACATGCATC GGTGGGCCAA AACGCGAGGG CTCAGTAACC 12 00 

ATCGTTGGTG CCGTTTCTCC TCCTGGAGGT GACTTTTCTG ACCCAGTGAC 12 50 

3 5 GTCCGCTACG CTTGGTATTG TGCAAGTCTT TTGGGGTCTT GAGAAGCGTC 13 00 

TTGCGCAACG TAAACACTTT CCTTCTGTTA ATTGGCTCAT TTCCTATTCA 13 50 

AAATACCTTA ATGCTTTGGA GCCCTTCTTC AACACGCTTG ACCCTGACTA 14 00 

CATGCGCCTG CGGTCAGTTG CTGCGGAGAT CCTTCAGCGT GAGGAAGAGT 1450 

TGCAAGAAAT TGTTCAACTT GTCGGTAAGG ACTCACTTTC GGAGTCTGAC 1500 

4 0 AAAATTATTC TAGAAACGGC TAAGGTTATT CGTGAAGAGT TTCTCCAGCA 1550 

GAATGCCTTT ACGCCGTACG ACAAGTATTG CCCGCCGTAC AAGACCTGCT 1600 

GGATGCTACG TAACATTGTC GCGTTCTACG AGGAGAGCCA GCGCGTTGTA 1650 

GCTGAGTCCG CTGGGGAACT TAAGATTACG TGGAACTACA TTCGTGAAAT 1700 

GATTCCTCAT ATTTACACGG GTTTAACTGA GATGAAGTTC CGTGATCCTC 1750 

45 AGGAGGGTGA GGAGGCCAAC GTAGAATTCT ACAGAAAACA AAATGAGGAA 1800 

ATTGTCAGCG CATTCGCCTC GCTGCTGCAA TAA 1833 



50 2) INFORMATION FOR SEQ ID NO: 693 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1758 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Thermus thermophilics 

(B) STRAIN: HB8 

(C) ACCESSION NUMBER: D63799 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 693 

ATGATCCAAG GGGTGATCCA GAAGATCGCG GGCCCGGCGG TGATCGCCAA 50 

GGGCATGCTC GGGGCCCGCA TGTACGACAT CTGCAAGGTG GGCGAAGAGG 100 

GCCTCGTGGG CGAGATCATC CGCCTGGACG GGGACACGGC CTTCGTCCAG 150 

10 GTCTACGAGG ACACCTCGGG CCTAAAGGTG GGGGAGCCCG TGGTCTCCAC 200 

GGGCCTTCCC TTGGCGGTGG AGCTCGGCCC CGGGATGCTG AACGGCATCT 250 

ACGACGGCAT CCAGCGCCCC CTGGAGCGCA TCCGGGAGAA GACGGGGATC 300 

TACATCACCC GGGGCGTGGT GGTCCACGCC CTGGACCGGG AGAAGAAGTG 350 

GGCCTGGACG CCCATGGTCA AGCCCGGGGA CGAGGTGCGG GGGGGTATGG 4 00 

15 TCCTGGGCAC GGTGCCCGAG TTCGGCTTCA CCCACAAGAT CCTGGTACCC 450 

CCGGACGTGC GGGGCCGGGT CAAGGAGGTG AAGCCCGCCG GGGAGTACAC 500 

CGTGGAGGAG CCGGTGGTGG TCCTCGAGGA CGGCACCGAG CTCAAGATGT 550 

ACCACACCTG GCCCGTTCGC CGGGCGAGGC CCGTGCAAAG GAAGCTTGAC 600 

CCCAACACCC CCTTCCTCAC GGGGATGCGC ATCCTGGACG TCCTCTTCCC 650 

20 CGTGGCCATG GGGGGCACCG CCGCCATCCC TGGGCCCTTC GGCAGCGGCA 700 

AGACCGTGAC CCAGCAGTCC CTGGCCAAGT GGTCCAACGC CGACGTGGTG 750 

GTCTACGTGG GCTGCGGGGA GCGGGGGAAC GAGATGACCG ACGTGCTCGT 800 

GGAGTTCCCC GAGCTCACCG ACCCCAAGAC GGGTGGGCCC TTGATGCACC 850 

GCACCGTCCT CATCGCCAAC ACCTCCAACA TGCCCGTGGC CGCCCGCGAG 900 

25 GCCAGCATCT ACGTGGGCGT GACCATCGCC GAGTACTTCC GCGACCAGGG 950 

CTTCTCCGTG GCCCTCATGG CCGACTCCAC GAGCCGCTGG GCCGAGGCTT 1000 

TGCGCGAGAT CTCTAGCCGC CTCGAGGAGA TGCCCGCCGA GGAGGGCTAC 1050 

CCGCCCTACC TCGCCGCCAG GCTCGCCGCC TTCTACGAGC GGGCGGGCAA 1100 

GGTCATCACC CTGGGCGGCG AGGAGGGGGC GGTGACCATC GTGGGGGCCG 1150 

3 0 TCTCCCCGCC GGGCGGCGAC ATGTCCGAGC CCGTGACCCA GTCCACCTTG 1200 

AGGATCGTGG GGGCCTTCTG GCGGCTTGAC GCCTCCCTGG CCTTCCGCCG 1250 

CCACTTCCCC GCCATCAACT GGAACGGCTC CTACAGCCTC TTCACCTCCG 1300 

CCCTTGACCC CTGGTACCGG GAGAACGTGG CCGAGGACTA CCCCGAGCTC 1350 

CGCGACGCCA TCTCCGAGCT TTTGCAGCGG GAGGCGGGCC TCCAGGAGAT 14 00 

35 CGTCCAGCTC GTGGGGCCGG ACGCCCTCCA GGACGCCGAG CGCCTCGTCA 1450 

TTGAGGTGGG CCGGATCATC CGCGAGGACT TCCTGCAGCA GAACGCCTAC 1500 

CACGAGGTGG ACGCCTACTG CTCCATGAAG AAGGCCTACG GGATCATGAA 1550 

GATGATCCTC GCCTTCTACA AGGAGGCGGA GGCGGCCATC AAGCGGGGGG 1600 

TTTCCATAGA CGAGATCCTG CAGCTCCCCG TTCTGGAGCG CATCGGCCGC 1650 

40 GCCCGCTACG TGAGCGAGGA GGAGTTCCCC GCCTACTTTG AGGAGGCCAT 1700 

GAAGGAGATC CAGGGGGCCT TCAAGGCTGG CCTAAAGGGG GAGAGATGGA 1750 

CCTTCTGA 1758 



45 



55 



2) INFORMATION FOR SEQ ID NO: 694 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



60 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 694 
CGGCGCIATC YTSGTTGTTG C 21 
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2) INFORMATION FOR SEQ ID NO: 695 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 695 
GTTTCACGTG ATGACGTACA 20 

15 

2 ) INFORMATION FOR SEQ ID NO: 696 

2 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 696 

3 0 AT I GG I CAYR TIGAYCAYGG IAARAC 26 



35 



2) INFORMATION FOR SEQ ID NO: 697 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
4 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 697 

45 

CCIACIGTIC KICCRCCYTC RCG 23 



50 2) INFORMATION FOR SEQ ID NO: 698 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1185 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) . TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Escherichia coli 

(C) ACCESSION NUMBER: extracted from J01690 



10 



15 



20 



25 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 698 



GTGTCTAAAG 
TATCGGCCAC 
CCGTACTGGC 
GATAACGCGC 
CGTTGAATAC 
GGCACGCCGA 
GGCGCGATCC 
TGAGCACATC 
TCCTGAACAA 
GAAATGGAAG 
CACTCCGATC 
AGTGGGAAGC 
CCGGAACCAG 
CGTATTCTCC 
GCGGTATCAT 
ACTCAGAAGT 
CGAAGGCCGT 
GTGAAGAAAT 
CCGCACACCA 
CGGCCGTCAT 
GTACTACTGA 
GTAATGCCGG 
CGCGATGGAC 
TTGGCGCGGG 



AAAAATTTGA 
GTTGACCACG 
TAAAACCTAC 
CGGAAGAAAA 
GACACCCCGA 
CTATGTTAAA 
TGGTAGTTGC 
CTGCTGGGTC 
ATGCGACATG 
TTCGTGAACT 
GTTCGTGGTT 
GAAAATCCTG 
AGCGTGCGAT 
ATCTCCGGTC 
CAAAGTTGGT 
CTACCTGTAC 
GCTGGTGAGA 
CGAACGTGGT 
AGTTCGAATC 
ACTCCGTTCT 
CGTGACTGGT 
GCGACAACAT 
GACGGTCTGC 
CGTTGTTGCT 



ACGTACAAAA 
GTAAAACTAC 
GGCGGTGCTG 
AGCTCGTGGT 
CCCGTCACTA 
AACATGATCA 
TGCGACTGAC 
GTCAGGTAGG 
GTTGATGACG 
TCTGTCTCAG 
CTGCTCTGAA 
GAACTGGCTG 
TGACAAGCCG 
GTGGTACCGT 
GAAGAAGTTG 
TGGCGTTGAA 
ACGTAGGTGT 
CAGGTACTGG 
TGAAGTGTAC 
TCAAAGGCTA 
ACCATCGAAC 
CAAAATGGTT 
GTTTCGCAAT 
AAAGTTCTGG 



CCGCACGTTA 
TCTGACCGCT 
CTCGTGCATT 
ATCACCATCA 
CGCACACGTA 
CCGGTGCTGC 
GGCCCGATGC 
CGTTCCGTAC 
AAGAGCTGCT 
TACGACTTCC 
AGCGCTGGAA 
GCTTCCTGGA 
TTCCTGCTGC 
TGTTACCGGT 
AAATCGTTGG 
ATGTTCCGCA 
TCTGCTGCGT 
CTAAGCCGGG 
ATTCTGTCCA 
CCGTCCGCAG 
TGCCGGAAGG 
GTTACCCTGA 
CCGTGAAGGC 
GCTAA 



ACGTTGGTAC 50 

GCAATCACCA 100 

CGACCAGATC 150 

ACACTTCTCA 200 

GACTGCCCGG 250 

TCAGATGGAC 300 

CGCAGACTCG 350 

ATCATCGTGT 4 00 

GGAACTGGTT 450 

CGGGCGACGA 500 

GGCGACGCAG 550 

TTCTTATATT 600 

CGATCGAAGA 650 

CGTGTAGAAC 700 

TATCAAAGAG 750 

AACTGCTGGA 800 

GGTATCAAAC 850 

CACCATCAAG 900 

AAGATGAAGG 950 

TTCTACTTCC 1000 

CGTAGAGATG 1050 

TCCACCCGAT 110 0 

GGCCGTACCG 1150 
1185 



35" 



40 



45 



2) INFORMATION FOR SEQ ID NO: 699 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 
"(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 699 
GTIACIGGYT CYTYRARRTT ICCICC 



26 



2) INFORMATION FOR SEQ ID NO: 700 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 700 

60 
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2) INFORMATION FOR SEQ ID NO: 701 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ, ID NO: 701 
GTGTTCACGA TCATCGATGC G 



2) INFORMATION FOR SEQ ID NO: 702 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 702 
CTCTCGATAT CCGCGAAGCG 



2) INFORMATION FOR SEQ ID NO: 703 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 703 
TATGGAAATT CGAAACATCT 



2) INFORMATION FOR SEQ ID NO: 704 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

400 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 704 
5 AGTGCTCCAA TTAATGTTGG 20 



2) INFORMATION FOR SEQ ID NO: 705 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 705 

20 

GTACAGTTCC AATACCTGAA 20 



25 2) INFORMATION FOR SEQ ID NO: 706 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 706 

TGAAATCTTC ACATCCAACA 20 



40 



50 



55 



2) INFORMATION FOR SEQ ID NO: 707 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 707 
TWACCATTTC AGTACCTTCT GGTAA 25 

2) INFORMATION FOR SEQ ID NO: 708 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 26 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 708 
TCRTCCATIC CIARIATIGC IATIAT 



2) INFORMATION FOR SEQ ID NO: 709 

(i) SEQUENCE CHARACTERISTICS: , 

(A) LENGTH: 1656 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Borrelia burgdorferi 

(C) ACCESSION NUMBER: extracted from AE001122 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 709 

ATGAATGAAG TTTTATTTGT AAAGACTGCT GGTAGGAATT TAAAAGCAGA 50 

AGTAATTCGT ATTAGGGGCA ATGAAGTTGA TGCACAGGTT TTTGAATTGA 100 

CAAAAGGGAT ATCTGTTGGA GACCTAGTTG AATTTACAGA CAAACTTTTA 150 

ACAGTTGAAC TCGGACCAGG GCTTTTAACT CAAGTATATG ATGGGCTTCA 200 

AAATCCTTTG CCTGAATTGG CTATTCAATG TGGATTTTTT TTAGAAAGGG 250 

GAGTATATTT AAGGCCCTTG AATAAAGATA AAAAGTGGAA TTTTAAAAAA 3 00 

ACCTCCAAAG TTGGAGATAT CGTTATTGCA GGAGATTTTT TAGGTTTTGT 3 50 

AATTGAGGGA ACTGTTCACC ATCAAATAAT GATTCCATTT TATAAAAGGG 4 00 

ATTCTTATAA AATTGTGGAG ATTGTAAGTG ATGGCGACTA TTCGATTGAT 4 50 

GAGCAAATTG CTGTAATTGA AGATGATTCT GGTATGAGGC ATAATATTAC 500 

AATGTCTTTT CATTGGCCTG TTAAAGTTCC TATTACTAAT TATAAGGAAC 550 

GCCTTATTCC TAGTGAACCT ATGTTGACTC AAACTAGAAT TATAGATACA 600 

TTTTTCCCAG TTGCCAAAGG TGGAACTTTT TGCATTCCGG GTCCTTTTGG 650 

AGCAGGAAAA ACGGTTCTTC AGCAGGTTAC AAGTCGAAAT GCTGATGTTG 700 

ATGTAGTGAT TATTGCAGCT TGTGGTGAGC GAGCAGGAGA AGTGGTAGAA 750 

ACTCTTAAAG AATTTCCCGA ATTAATGGAT CCAAAAACCG GCAAATCTTT 800 

AATGGACAGG ACTTGTATTA TTTGTAATAC ATCTTCAATG CCAGTTGCAG 850 

CTAGAGAAGC TTCTGTTTAT ACTGCTATTA CTATTGGTGA GTATTACAGG 900 

CAAATGGGCC TTGATATTCT TCTTTTGGCA GATTCAACTT CAAGATGGGC 950 

TCAAGCAATG AGAGAAATGT CTGGACGCCT TGAGGAAATT CCTGGCGAGG 1000 

AGGCTTTTCC GGCATATCTT GAGTCTGTTA TTGCTTCCTT TTATGAAAGG 1050 

GCAGGTATTG TAGTTCTTAA TAATGGGGAT ATTGGATCTG TAACAGTTGG 1100 

TGGCTCTGTA AGTCCTGCTG GTGGTAATTT TGAAGAGCCA GTTACTCAAG 1150 

CAACTTTAAA AGTTGTAGGA GCATTTCACG GGCTTACAAG AGAAAGGTCT 1200 

GATGCTAGGA AATTTCCAGC TATTAGTCCT CTTGAATCTT GGAGTAAATA 12 50 

TAAAGGCGTT ATTGATCAAA AAAAGACTGA ATATGCAAGA TCTTTTTTGG 1300 

TGAAAGGTAA TGAAATTAAT CAAATGATGA AAGTTGTTGG AGAAGAAGGC 13 50 

ATAAGTAACG ATGATTTTTT AATTTATTTA AAATCCGAGC TACTTGATTC 14 00 

GTGCTATTTG CAGCAAAATT CATTTGATTC TATTGATGCT GCTGTTAGTT 14 50 

CAGAGCGTCA AAATTATATG TTTGATATAG TTTATAACAT TCTTAAAACT 1500 

AACTTTGAGT TTTCTGATAA ACTTCAAGCA AGAGATTTTA TAAATGAGTT 1550 

AAGGCAAAAT CTTTTAGACA TGAATCTTTC TTCTTTTAAG GATCATAAGT 1600 
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TTAATAAATT GGAGCATGCT TTGGGTGAAT TGATAAATTT TAAAAAG^TA 1650. 
ATTTAG 1656 
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2) INFORMATION FOR SEQ ID NO: 710 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Treponema pallidum 

(B) STRAIN: Nichols 

(C) ACCESSION NUMBER: extracted from AE0 0052 0 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 710 



GTGATCAAAG 
CATTGTGTAT 
ACGTAGGGGA 
AAGGCGGTCG 
GAAGGTGACA 
TAATCGGCAC 
CAAGAAGACG 
TGGCTCCGTA 
CCCTGTGCGC 
CAGGAGACTC 
GGGGAGCGTG 
AAGAAATTGG 
TGGCCAGTGC 
GCCACTAGTT 
AGGGAGGAAC 
ACGCAGCATG 
CGGCTGCGGA 
CCAAACTCAT 
TTGATCGCAA 
GTATTCAGGG 
TGGCCATCAT 
TTGTCTGGGC 
CCTTCCGACG 
CCTGTGTGGC 
CTGGGTGGAG 
CCGTTGCTTT 
CTGCCATTGG 
GCATGGTGGA 
CTTGGATTTG 
TCGGTCCTGA 
GAAATGATCA 
TGTGTTCTCC 
ATTTTCACGA 
GCGCTGTCCC 
GTACGGGAAT 
GCACTGAGTT 
GGGGAGAAAG 



ACGATGTGGT 
GCCGAGGGCC 
AGCATCGCTC 
TGCAAGTATA 
AGCTTGCGTC 
CATTTATGAC 
GCGCCTTCTT 
CGCTGGGATT 
GGGGATTCCG 
CTTCTGTTGT 
CTATCTTCGT 
ACGCACTGAT 
GTCGTGCGCG 
ACTGGACAGC 
GGCGGCTATT 
CCGTTGCCAA 
GAGCGGGGCA 
CGATCCGCGC 
ATACGTCCAA 
ATTACCCTTG 
GGCTGATTCT 
GCATGGAAGA 
CGTCTTGCAG 
GCGCGAGGGC 
ATTTCTCTGA 
TGGGCCTTGG 
GTGGATAGAT 
GTAAGTATGA 
CTGAGAAAGG 
TGCGCTGCCT 
AAGGTGGCTT 
TGTCCCGAAA 
ACGTGCCGTG 
AGCTTTCGTG 
GAGGATGTAC 
TGACCAACTG 
TCGAATGA 



TACAGGCCGT 
TCTCTGCGTG 
ATCGGAGAAA 
CGAGGATGAC 
GACCACTCTC 
GGTATTCAGC 
GCGTCCTGGT 
TTCGTCCTCA 
ATTGCACCTG 
GCACACTATC 
TCAAGGGCGC 
CTTGGTGAGC 
TCCTTTCAGC 
GGGCGATTGA 
CCAGGGGGAT 
GTGGTGTGAT 
ACGAGATGAC 
ACAGGACGCT 
TATGCCTGTG 
CGGAATACTA 
ACCAGCCGCT 
AATGCCTGCG 
AATTTTATGA 
TCTGTGAGCA 
GCCGGTGACG 
ATCGTGAACT 
TCATACTCTG 
CCCGCGCGCA 
AACAGCGGTT 
GGAGAAGATC 
TCTGCAGCAG 
AGCAGGTGCA 
GTGCTGCTGC 
CCGGGAGCTC 
ACAAGATGCA 
AGTGTGTGTG 



GTAGTGAGGG 
CAGCGTATAC 
TTATCCGGTT 
ACAGGTATGC 
AGTCCGCTTA 
GCCCACTTGA 
GCGCGTTCAC 
TTGTAACGAG 
GGTCAGTGTT 
ATGGTTCCTC 
AGGTGCTTAC 
CGCTTTTTCT 
AAAAAACTTG 
TGTTTTCTTC 
TTGGAACTGG 
GCAGATATTA 
AGACGTGCTC 
CTCTTATGGA 
TCCGCACGCG 
CCGTGATATG 
GGGCGGAGGC 
GAGGAGGGAT 
GCGCGCAGGA 
TCATTGGTGC 
CAGCACACAA 
TGCACACGCG 
AATATGCGCA 
GGCGCGTTGC 
ACAGCAAATT 
GTCTGGTGCT 
AACGCTTTTG 
GATCTTGCGT 
GTGCAGGTAT 
ATCGTACGTA 
GAAAGTGTAC 
CTGCCGCGCG 



TGTCTGGTCC 50 

GATGTTGTCG 100 

GGATGAGAGC 150 

GAGTCGGGGA 200 

GGGCCTGGAT 250 

GCGCCTCTTC 300 
AACCGCTTGA * 350 

CGCGGTGAGG 4 00 

AGGGACCGTG 450 

CTGACATCCG 500 

ACAATAGATG 550 

ATCCCAGTAC 600 

CAGTGTGTGA 650 

CCCCTATCAA 700 

GAAGACAATG 750 

TCGTGTACAT 800 

TCTGAATTTC 850 

GCGGACGATT 900 

AGGTGTCGCT 950 

GGTATGCATG 1000 

GCTGCGTGAA 1050 

TCCCTGCGTA 1100 

CGCGTGGAAA 1150 

TGTTTCTCCC 1200 

AGCGCTTCAT 1250 

CGTCATTACC 1300 

GGAAGTAAGT 1350 

GCGCCGCAGC 1400 

GTCAGGCTTG 1450 

AATGGTGTGT 1500 

ATCCGACGGA 1550 

ACCATAGTGG 1600 

TTCGCTTTCT 1650 

TGAAAACTAC 1700 

GACACGATGT 1750 

CACACAAGGG 1800 
1818 



403 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



2) INFORMATION FOR SEQ ID NO: 711 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1779 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Chlamydia trachomatis 
15 (B) STRAIN: MoPn 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 711 



CAGGGCTATG TCGTAGAAGC TTACGGAAAT TTATTGCGGG TGCATTTTGA 50 

20 TGGGCATGTG CGTCAAGGAG AAGTGGCCTA TGTCAGCGTG GATGATACTT 100 

GGTTGAAAGC GGAAATTATA GAAGTTGTGG GAGATGAGGT TAAAGTCCAA 150 

GTTTTTGAGG AAACTCAAGG AATTTCTCGA GGCGCTTTGG TAACTTTTTC 200 

CGGGCATTTA TTAGAAGCGG AACTTGGGCC CGGTCTATTG CAAGGTATTT 250 

TTGACGGACT TCAGAATCGC TTAGAGGTAT TGGCAGATAC AAGCTTGTTC 300 

2 5 TTGAAAAGAG GGGAGTATGT TAATGCCATT TGTCGGGAAA CTGTATGGGC 350 

TTATACGCAA AAGGCTTCTG TCGGGGATGT TCTATCTCGG GGAGATGTGC 4 00 

TTGGTACAGT AAAGGAAGGG CGGTTTGATC ATAAAATCAT GGTTCCTTTC 450 

TCTTGTTTTG AGGAAGTGAC TATCACTTGG GTCATTTCTT CAGGAGATTA 500 

CACTGTTGAT ACCGTTATTG CTAAAGGACG TACTGCTTCA GGAGCCGAGC 550 

30 TTGAATTTAC AATGGTTCAG AAATGGCCCA TTAAACAGGC TTTTTTAGAA 600 

GGGGAAAAGG TACCGTCTCA TGAAATTATG GATGTTGGGT TACGAGTATT 650 

AGATACTCAG ATCCCCGTCT TAAAGGGAGG AACTTTTTGT ACTCCAGGGC 700 

CTTTTGGTGC AGGAAAGACC GTTTTACAGC ACCATTTATC TAAGTATGCA 750 

GCTGTAGATA TCGTAGTTTT GTGTGCTTGT GGAGAGCGAG CTGGAGAGGT 800 

3 5 TGTAGAAATT CTTCAGGAGT TCCCGCATTT GACAGATCCT CATACGGGGC 850 

AGTCTTTGAT GCATAGGACC TGTATTATTT GTAATACATC TTCCATGCCT 900 

GTAGCAGCTA GAGAGTCCTC CATTTATTTG GGTATTACTA TAGCAGAATA 950 

TTACCGTCAA ATGGGGTTGC ATGTTTTGTT ATTGGCTGAC TCGACATCTA 1000 

GATGGGCTCA AGCTTTAAGG GAAATTTCAG GGCGATTAGA AGAAATCCCT 1050 

4 0 GGAGAAGAAG CTTTCCCAGC CTATTTGGCG TCTCGAATAG CAGCTTTTTA 1100 

TGAGCGAGGC GGGGCTGTGA AAATGAAAGA TGGATCGGAA GGCTCCTTGA 1150 

CTATCTGTGG AGCGGTTTCT CCCGCAGGAG GAAATTTTGA AGAGCCTGTT 1200 

ACACAAGCAA CTTTATCTGT TGTTGGGGCT TTCTGTGGGC TTTCTAAGGC 1250 

TAGAGCAGAT GCTAGACGGT ATCCTTCTAT TGATCCGATG ATTTCATGGT 1300 

45 CTAAGTACTT GGATTCTGTG GCGGAGATTT TAGAGAAAAA AGTTCCAGGA 1350 

TGGGGAGATT CCGTTAAAAA AGCTTCTCGT TTCTTAGAAG AAGGAGCAGA 14 00 

AATTGGTAAG CGAATAGAAG TTGTTGGGGA AGAAGGGATT TCTATGGAAG 1450 

ATATAGAAAT CTTTTTGAAA TCAGAGTTGT ATGATTTCTG TTACTTACAG 1500 

CAAAACGCTT TCGATGCAGA GGACTGTTAT TGTCCTTTTG ATCGTCAAAT 1550 

50 AGAGCTTTTT TCTTTAATGA GTCATATTTT TAGCTCTAGA TTCTGTTTTG 1600 

ATTGTCCAGA TAATGCTCGG AGTTTCTTTT TAGAGCTTCA AAGTAAAATT 1650 

AAAACGCTGA ATGGTCAAAA ATTCCTTTCT GAAGACTATC AGAAGGGGCT 1700 

AGAAGTGATC TATAAACTAT TAGAAAGCAA AATGGTGCAG ACGGCGTAGG 1750 

TATGCAAACA ATATATACAA GAATTACGG 1779 

55 



2) INFORMATION FOR SEQ ID NO: 712 

60 (i) SEQUENCE CHARACTERISTICS: 

404 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

(A) LENGTH: 965 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecal is 
10 (B) STRAIN: V583 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 712 

GTGCAAATTG GAAAAATTGT CAAAGTTTCA GGTCCTTTGA TTTTAGCTGA 50 

15 AAACATGTCA GATGCTAGTA TCCAAGACAT TTGTCATGTA GGAGATTTAG 100 

GCGTTATCGG AGAGATTATT GAAATGCGAG GCGACGTCGC TTCGATTCAA 150 

GTATATGAAG AAACAACAGG CATTGGACCA GGAGAACCAG TTATTTCAAC 200 

AGGAGAACCA TTATCTGTTG AATTAGCCCC AGGTTTAATT GCCGAAATGT 250 

TTGATGGTAT TCAACGACCA TTGGATACAT TTCAAGAAGT AACCCACAGT 300 

20 AACTTTTTAG GCCGTGGCGT TAAAATTGAT GCGTTAGATC GTGAGAAAAA 350 

ATGGACGTTT GAACCAACTG TGGCAGTTGG TGAAGAAGTG TCGGCAGGTG 4 00 

ACATCGTCGG TGTGGTTCAA GAAACACCGA TTATTCAACA TAAAATTATG 450 

GTGCCTTTCG GCGTTTCAGG AACGATTGCC GAAATTAAAG CAGGTGACTT 500 

TGCCATTGAT GAAACAGTTT ACTCAGTGGA AACGGCTAAA GGAACGGAAA 550 

25 GTTTTAGCAT GATGCAAAAA TGGCCCGTTC GGCGGGGACG TCCCATTTTA 600 

GAAAAACTAA GTCCCAAAGT ACCGATGGTG ACCGGACAAC GCGTAATTGA 650 

TACCTTTTTC CCAATTACGA AAGGCGGAGC GGCAGCAGTT CCAGGACCAT 700 

TTGGCGCTGG AAAAACAGTC GTTCAGCACC AAATTGCTAA GTGGGCCGAT 750 

GTCGACTTAG TCGTTTACGT TGGTTGTGGG GAACGCGGGA ATGAAATGAC 800 

30 AGATGTTTTA AATGAATTTC CAGAATTAAT TGACCCAACA ACTGGTGAGT 850 

CTTTGATGAA TCGGACGATT TTAATTGCGA ATACGTCAAA TATGCCGGTA 900 

GCGGCACGGG AAGCCTCGAT TTATACAGGG ATTACCATTG CAGAATATTT 950 

CCGTGATATG GGTTA 965 



35 



50 



2) INFORMATION FOR SEQ ID NO: 713 



(i) SEQUENCE CHARACTERISTICS: 
4 0 (A) LENGTH: 1737 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

4 5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Methanosarcina barkeri 

(C) ACCESSION NUMBER: extracted from J04836 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 713 



GTGGAAGTAA AAGGTGAAAT TTATCGTGTG TCTGGGCCTG TCGTCACCGC 50 

CATCGGCTTG CAGGCAAAAA TGTATGACCT GGTCAAAGTC GGTAATGAAG 100 

55 GTTTAATGGG TGAAGTCATT CAGATATTAG GGCCCAAGAC CATCATCCAG 150 

GTATATGAAG AGACCGCAGG TATCAAGCCA GGGGAACCCT GTGTATCTAC 200 

AGGGTCGTCT CTGTCCGTAG AACTTGGTCC GGGTCTTCTT TCCAGTATTT 250 

ATGACGGGGT TCAAAGGCCT CTGCACGTCC TGCTTGAAAA AATGGGTAGC 300 

TTCATCCAGA GAGGTGTCAG CGCAGATGGG CTTGATCATA AGAAACTCTG 350 

60 GGATTTCAAA CCCATTGTCA AGAAGGGCGA TTCCGTAAAA GGTGGAGACG 4 00 

405 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

TAATTGGTGT 
CCTCCTGATA 
GGTAGTAGAC 
TGCAGAGGTG 
5 CCAACCAGGC 
TGTGGCAAAA 
AGACCGTAAC 
GTCTACATCG 
CGAATTCCCT 

10 GTACTGTTCT 
GCATCTGTGT 
ATTAGATGTA 
TGAGAGAAAT 
CCAGCATACC 

15 TGCGGAGAGT 
TATCTCCACC 
CGTATCGTAA 
TCACTTCCCG 
GTCTTAATGA 

20 AGGGAAAGAG 
CGTGCAGCTT 
TTGAAATCAC 
CACCCAGTAG 
GGCAATCATG 

25 TTCCCGTAAC 
GTCAAGTACG 
GATGGATAAA 



TGTACAGGAA 
TCTCAGGTAC 
ACAATCTGTA 
GCCTGTTCGA 
CTCTGGTTAC 
GGCGGAACAG 
TCAGCAGTCG 
GTTGTGGTGA 
GAACTCGAAG 
TATCGCTAAC 
ATACCGGAAT 
TCCCTTATGG 
CTCTTCCCGT 
TGTCTGCAAG 
CTTTGCGGCG 
TGGCGGTGAC 
AAGTGTTCTG 
GCCATCAACT 
CTGGTTTGCA 
CAATGGAAAT 
GTAGGTTCCG 
CCGTATGCTT 
ATGCATACAG 
AAATGGGGAG 
TGAAATTATC 
AAGAGAAGTT 
GAGTTTGCAT 



ACCGTGAATA 

AATTTCCGAC 

CTCTGACTGA 

AGACCCAGAC 

AGGAATGAGA 

CTGCAATCCC 

CTTGCAAAAT 

GCGTGGAAAC 

ATCCGCAGAC 

ACTTCAAACA 

CACCATTGCA 

CAGACTCCAC 

CTGGAAGAAA 

ACTGGCCGAA 

AAACAGGTTC, 

TTCTCAGAGC 

GGCTCTCGAT 

GGCTGAACAG" 

GATAATGTGG 

GCTCCAGACA 

ATGCTCTGCC 

AGGGAAATTT 

CCCGTTCGAT 

ACGCTGCGAT 

AAGCTTGAAT 

TGATGAGTCT 

CCCTGAGAGG 



TTGAACATAA 
ATAAAGAGCG 
TGGGACCGAA 
CTGTGAAGGC 
ATCCTTGATG 
CGGACCTTTC 
GGAGTGATAC 
GAAATGGCAG 
CGGGCGCCCA 
TGCCTGTGGC 
GAATACTACC 
CTCAAGGTGG 
TGCCTGGTGA 
TTCTACGAGC 
CATTACTGTT 
CTGTTACACA 
GCCAAACTAT 
TTACAGTCTG 
CTCCTGATTA 
GAATCTGAAC 
AGACGACCAG 
TCCTGCAGCA 
CAGCAGTACA 
GGATGCCTTG 
CCAAAAATGT 
ATGAATGCTG 
TAGGTAA 



PCT/CAOO/01150 

GATC&TGGTG ~ ' ~4Sfc 

GAAACTTTAC 50 0 

TTGCAGATGA 550 

AAAACTTACT 600 

GGCTTTTCCC 650 

GGATCGGGAA 700 

CGAAATTGTG 750 

ATGTTCTGAG 800 

CTTATGGAGC 850 

CGCAAGAGAA 900 

GTGACATGGG 950 

GCAGAAGCCA 1000 

AGAAGGTTAC 1050 

GTGCCGGGGT 1100 

ATTGGAGCAG 1150 

GAATACCCTG 1200 

CTCAGAGGCG 1250 

TATAAGGACA 1300 

TGTGCCTTTG 1350 

TGCAGGAAAT 1400 

CAGCTTCTGC 1450 

GAATGCATTC 1500 

AGATCCTTAA 1550 

AAATCAGGTG 1600 

GCTTGCTAAG 1650 

TCCTGGCACA 1700 
1737 



30 



40 



2) INFORMATION FOR SEQ ID NO: 714 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1785 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Methanosarcina jannaschii 

(C) ACCESSION NUMBER: extracted from U67477 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 714 



50 



55 



60 



ATGAAAAGAG 
CGCAGGGCCT 
AGGTCGTTAA 
CACGATGATA 
ACCAGGAGAG 
GGCCAGGGAT 
GCAATTGAAG 
CCCTGCATTA 
AAGGAGATTA 
CCTTCAATAG 
TGTTGAAATA 
TAGAAACAGA 
CCAGTAAGAA 
ATTAATTACA 



AGGTTGAGAA 
GTTGTAGTTG 
AGTAGGAGAA 
AAGCAGTTAT 
CCAGTTGTTG 
GTTAAGAGCT 
AGAAAACAGG 
CCAAGAGATA 
TGTTGAAGAA 
TTCATAAAAT 
AAAGAGGGTA 
AAATGGAGAA 
AACCAAGACC 
GGGCAAAGAG 



TATGCCAGTT GTTGGTAAGA 
CAGAGGGAAT GAAAGGAGCT 
GAGAAATTGA CTGGAGAAAT 
TCAGGTTTAT GAAGAAACAT 
GTACTGGAGC TCCATTGTCT 
ATGTATGATG GTATTCAGAG 
TTCAATCTTT ATCCCAAGAG 
TAAAATGGGA ATTTAAACCA 
GGAGACATAA TTGGAACTGT 
CTTAGTTCCA ATTGGTGTTA 
AATTTACAGT TGAAGAGACA 
AGGAAAGAAA TTACAATGAT 
ATATAAAGAG AAACTACCTC 
TTGAAGACAC TTTCTTTACA 
406 



TTATTAAAAT 50 

CAGATGTATG 100 

CATTCAGTTG 150 

CTGGAATTAA 200 

GTTGAATTAG 250 

GCCTTTAACA 300 

GAGTTGATGT 350 

GTGGTAAATG 4 00 

TGATGAAACT 450 

AAGGAAAAAT 500 

GTTGCAGTTG 550 

GCAAAAATGG 600 

CAGAAATTCC 650 

TTAGCAAAAG 700 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CAOO/01150 

GAGGAACAGC AGCAATTCCA GGTCCATTCG GTTCAGGAAA AACGGTTACT ~750 

CAGCATCAGT TGGCAAAGTG GTCTGACGCT GATGTCGTTG TTTATATCGG 800 

ATGTGGAGAA AGAGGAAACG AGATGACAGA GGTTATTGAA GAGTTCCCAC 850 

ACTTAGAAGA TATTAGAACT GGAAACAAAT TAATGGATAG AACTGTATTA 900 

5 ATAGCCAACA CATCAAACAT GCCTGTCGCT GCAAGGGAAG CATCTGTCTA 950 

TACAGGAATT ACAATTGCAG AGTACTTCAG AGATATGGGT TATGGAGTTT 1000 

TATTAACAGC AGATTCAACA TCAAGATGGG CAGAGGCAAT GAGAGAAATT 1050 

TCAGGTAGAT TGGAAGAAAT GCCAGGGGAA GAAGGGTATC CAGCATACTT 1100 

AGCTTCAAGA TTGGCTCAGT TCTATGAAAG AGCTGGAAGA GTTATAACCT 1150 

10 TAGGGAAAGA TAACAGACAA GGATTCGTTT GTATCGTTGG AGCTGTTTCA 1200 

CCACCAGGAG GGGACTTCTC AGAACCAGTT ACATCAAACA CACTAAGGAT 1250 

AGTTAAGGTA TTCTGGGCGT TAGATGCAAA CTTGGCAAGA AGAAGACACT 1300 

TCCCAGCTAT CAACTGGTTG CAGAGTTATT CATTATACAT TGATGATGTT 1350 

ACAGAGTGGT GGAACACAAA TACTGGTCCA GATTGGAGAC AATTAAGAGA 14 00 

15 TGAAGCAATG AGCTTATTAC AAAAAGAGGC AGAGTTGCAA GAGATTGTTC 1450 

AGTTAGTTGG GCCTGATGCA TTGCCAGATA GGGAGAGAGT TATTTTAGAA 1500 

GTTGCAAGAA TGTTGAGGGA GGATTTCTTA CAGCAAGATG CGTTTGATGA 1550 

GGTAGATACC TACTGTCCTC CAATGAAACA GTACTTAATG TTAAAGATAA 1600 

TTATGACATT CTACCAAGAA GCATTGAAGG CAGTTGAAAG AGGAGTTGAA 1650 

20 CCAGCTAAGA TTTTAGGAGT TTCAGTTAAG CAAGATATTG CAAGAATGAA 1700 

ATACATCCCA CACGATGAGT TTATAAATGT TAAATCAAAA GAAATAATGG 1750 

AGAAAATTAA GAATGAATTA GGTTCATTAA ACTAA 1785 



25 



2) INFORMATION FOR SEQ ID NO: 715 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 13 54 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(B) STRAIN: W83 

4 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 715 

TGAGCGAGGT GATCAAGGTG ATCGGCAAAA ATGCTTATGT GCAGGTTTTC 50 

GAAAGTACTC GCGGTATGCA CGTAGGAGAT GAGGCAGAGT TTACCGGCAG 100 

TATGCTTGAG GTAACGCTCG GCCCCGGTAT GCTTTCGAAG AACTACGACG 150 

4 5 GTCTGCAACA CGACTTGGAC AAGATGGACG GGATCTTCCT CAAACGAGGC 200 

GATTATACTC CCGCTCTCGA TGACGACAAG CTGTGGGACT TCAAGCCTTT 2 50 

GGCCAATGTG AACGACAATG TGATCGCAGG CTCATGGCTC GGAGAGGTGA 300 

CGGAAAATTT CCAACCGCAC AAGATCATGG TACCTTTCGT TTTCGAAGGC 350 

AATTACAAGG TGAAGAGTCT GGCCAAAGCC GGTTCGTACA AAGTGAACGA 400 

50 TGTGATCGCT GTGGTAACGG ATCAGGACGG GAAAGACCAC AATGTAACCA 4 50 

TGGTGCAGAA ATGGCCGGTG AAACGTGCTA TCACTTGCTA TCGCGAGAAG 500 

CCGCGTCCTT TCAAACTGCT CGAAACGGGT ATCCGTATCA TCGACACTTT 550 

CAACCCCATC GTAGAGGGTG GTACGGGATT TATCCCCGGT CCTTTCGGTA 600 

CGGGAAAGAC GGTGCTCCAG CATGCTATCT CGAAGCAGGC GGAAGCCGAT 650 

55 ATCGTGATCA TTGCAGCCTG TGGCGAGCGT GCAAACGAGG TTGTGGAGAT 700 

CTTTGCGGAA TTCCCCCACC TGAATGACCC CCACACGGGA CGTAAATTGA 750 

TGGAACGTAC CATTATTATT GCTAATACGT CGAATATGCC TGTGGCTTCG 800 

CGTGAGGCAT CCGTATATAC GGCCATGACG ATAGCCGAGT ACTATCGCTC 850 

CATGGGCCTT CGCGTGCTGA TGATGGCAGA CTCCACTTCG CGTTGGGCAC 900 

60 AGGCTCTGCG TGAGATGTCT AACCGTCTGG AAGAGCTTCC CGGACCGGAT 950 

407 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

GCTTTCCCGA TGGACTTGTC AGCTATCGTA 
AGGATACGTT TACCTGAACA ACGGTTCGGC 
GTACGGTATC TCCCGCCGGT GGTAACCTCA 
ACCAAGAAAG TGGCTCGCTG CTTCTATGCT 
5 CCGCAAACGT TATCCGGCTG TAAACCCCAT 
TCGAATATCC CGAATTCGAG AGCTATATAT 
ATTATATTCA ACAGAAGTTG GCAGATATAT 
CTGGACTACT AAGGTGAATG AGCTGAAGAT 
AAAT 



PCT/CA00/01150 

GCCAACTTCT ACGfc!TCGTGC L600 

CGGTTCGGTA ACGTTCATCG 1050 

AAGAGCCTGT GACGGAAAAC 1100 

TTGGAGCAGA ATCGTGCCGA 1150 

CGATAGTTAC TCGAAGTACA 1200 

CGAACCACAT CAGTTTACTC 1250 

GGATCAAATC TTACAGCAGA 1300 

GCGCTTGCAT CAGGGTAAAG 1350 

1354 



15 



20 



25 



2) INFORMATION FOR SEQ ID NO: 716 

(i) SEQUENCE CHARACTERISTICS:, 

(A) LENGTH: 1788 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: Type 4 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 716 



TTTGACTCAA 

30 CAGGTATGCA 
GGGTTAATCG 
AGTCTATGAA 
CTGGAGAACC 
TTTGATGGCA 

35 TGATTTTCTA 
AGTGGCATTT 
GATATTCTTG 
GGTTCCTTAT 
TTACAATTGA 

4 0 TATAAAGGAA 
TTCTAAACGT 
TTGATGCATT 
CCGTTTGGAG 
CAATGTTGAT 

4 5 TGACGGATGT 
CAATCAATTA 
TGTTGCTGCT 
ATTTTCGTGA 
CGTTGGGCAG 

50 TGGTGATGAG 
ATGAAAGAGC 
ACGATTACTG 
ACCAGTTACT 
ATGCTCCGTT 

55 TCTTATTCAC 
GAAGACAGAT 
GGGAATCTAG 
TCTGATAATG 
TTATTTGCAA 

60 CAAAACAAGA 



GGGAAGATTA 
GGAGGCTAAT 
GTGAAATTAT 
GAAACATCTG 
TCTCTCGGTT 
TACAACGCCC 
GTTCGTGGGG 
TGATTCCACT 
GAACTGTCAA 
GGAGTATCTG 
TGAAGTTGTA 
CGCTTATGCA 
TTAATTCCAG 
CTTTCCAGTA 
CAGGAAAGAC 
ATTGTTATTT 
ACTGAATGAG 
TGCAACGGAC 
CGTGAGGCTT 
TATGGGCTAC 
AAGCGCTACG 
GGTTATCCTG 
AGGACGTTCT 
CTATTGGAGC 
CAAAACACTT 
GGCACAGCGA 
TATATAAAGA 
TGGAATAGTA 
TTTAGAGGAA 
AACGACTAAC 
CAGAACGCTT 
AGCAATGCTA 



TAAAAGTATC 
ATTCAAGATA 
TGAAATGAGA 
GTCTTGGTCC 
GAATTAGGGC 
ATTAGATCGA 
TAGAAGTTCC 
ATAGCAATTG 
GGAAACCGAG 
GAGAAGTCGT 
TATGAAATAA 
AAAATGGCCT 
AAGAACCATT 
ACCAAAGGGG 
AGTTGTACAA 
ATGTCGGTTG 
TTTCCTGAGT 
AGTTCTGATT 
CAATTTATAC 
TCTGTCGCCA 
TGAAATGTCA 
CTTATCTGGG 
CAGGTTCTAG 
TGTATCGCCA 
TACGGATTGT 
CGTCATTTTC 
CAGTGTGGGC 
AAATAACTCG 
ATTGTTCGTC 
GATGGAAATT 
TTGATTCGGT 
AGTAATATTC 
40 



GGGACCTCTA 
TTTGCCGTGT 
AGAGATCAGG 
GGGAGAACCT 
CAGGATTGAT 
TTTAAATTGG 
AAGTTTGGAT 
GTCAAAAAGT 
GTAGTTAATC 
TTCTATTGCA 
AAAAATTGGA 
GTCCGCAAGG 
AATCACAGGT 
GAGCTGCAGC 
CACCAAGTAG 
TGGAGAACGT 
TGATTGACCC 
GCTAATACTT 
AGGAATTACC 
TTATGGCTGA 
GGACGTCTAG 
AAGTCGTATC 
GGCTTCCAGA 
CCTGGTGGAG 
GAAAGTTTTT 
CTGCAATTAA 
ACTTATATAG 
TGCGATGAAC 
TTGTTGGAAT 
GCTAAACAAA 
AGATACATTC 
TCACTTTTGC 
8 



GTTATTGCAT 50 

AGGTAAGCTA 100 

CATCTATCCA 150 

GTTGTTACAA 200 

TTCTCAAATG 250 

CTACTCATAA 3 00 

AGAGATATTA 350 

GAGTACGGGT 4 00 

ATAAAATTAT 4 50 

TCTGGCGATT 500 

CGGTAGTTTC 550 

CGCGTCCTGT 600 

CAACGAGTTA 650 

AGTTCCTGGA 700 

CTAAATTTGC 750 

GGAAATGAAA 800 

TAATACCGGA 850 

CAAATATGCC 900 

ATGGCTGAGT 950 

TTCAACTTCA 1000 

AAGAAATGCC 1050 

GCTGAATATT 1100 

ACGTGAAGGA 1150 

ATATTTCAGA 1200 

TGGGGGCTTG 1250 

CTGGCTTACA 1300 

ATGGTAAAGA 1350 

TACTTACAAC 1400 

TGATTCTCTG 14 50 

TTCGAGAAGA 1500 

ACTTCGTTTG 1550 

TGATCAGGCA 1600 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

AATCATGCTT TAGAGTTGGG 
CGTGGCAGTT CGAGACCGTA 
GATTAGATGA AATCAAAATT 
TTGATATTAG AAACAGGAGG 



TTCTTACTTT ACAGAGATTA 
TGGCGAGAAG TAAATATGTT 
ATATCAAATG AGATTACACA 
TCTATAAATG AGTGTTAT 



PCT/CAOO/01150 

TGC3AAGGTAC L650 
TCAGAAGATA 1700 
TCAAATTCAT 1750 
1788 



2) INFORMATION FOR SEQ ID NO: 717 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear. 



15 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Burkholderia mallei 
20 (B) STRAIN: GB8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 717 

CGATCCTGGT GTGCTCGGCC GCTGACGGCC CGATGCCGCA AACGCGTGAG 50 

2 5 CACATCCTGC TGGCGCGTCA GGTCGGTGTG CCGTACATCA TCGTGTTCCT 100 

GAACAAGTGC GACATGGTGG ACGACGCGGA GCTGCTCGAG CTGGTCGAAA 150 

TGGAAGTGCG CGAACTGCTG TCGAAGTACG ACTTCCCGGG CGACGACACG 2 00 

CCGATCATCA AGGGTTCGGC GAAGCTGGCG CTGGAAGGCG ACAAGGGCGA 2 50 

GCTGGGCGAA GTGGCGATCA TGAACCTGGC CGACGCGCTG GACACGTACA 3 00 

30 TCCCGACGCC GGAGCGTGCG GTCGACGGCG CGTTCCTGAT GCCGGTGGAA 350 

GACGTGTTCT CGATCTCGGG CCGTGGTACG GTGGTGACGG GTCGTGTCGA 4 00 

GCGCGGCGTG ATCAAGGTTG GCGAGGAAAT CGAAATCGTC GGTATCAAGG 4 50 

CGACGGCGAA GACGACCTGC ACGGGCGTGG AAATGTTCCG CAAGCTGCTG 500 

GACCAGGGTC AGGCGGGCGA CAACGTCGGT ATCCTGCTGC GCGGCACAAA 550 

35 GCGTGAAGAC GTGGAGCGCG GCCAGGTTCT GGCGAAGCCG GGTTCGATCA 600 

CGCCGCACAC GCACTTCACG GCAGAAGTGT ACGTGCTGAG CAAGGACGAA 650 

GGCGGCCGCC ACACGCCGTT CTTCAACAAC TACCGTCCGC AGTTCTACTT 700 

CCGTACGACG GACGTGACGG GCTCGATCGA GCTGCCGAAG GACAAGGAAA 750 

TGGTGATGCC GGGCGACAAC GTGTCGATCA CGGTGAAGCT GATCGCGCCG 800 

40 ATCGCGATGG AAGAAGGTCT GCG 823 



2) INFORMATION FOR SEQ ID NO: 718 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
55 (A) ORGANISM: Burkholderia pseudomallei 

(B) STRAIN: 1026B 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 718 

60 GCGATCCTGG TGTGCTCGGC CGCTGACGGC CCGATGCCGC AAACGCGTGA 50 

409 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

GCACATCCTG CTGGCGCGTC AGGTCGGTGT 

TGAACAAGTG CGACATGGTG GACGACGCGG 

ATGGAAGTGC GCGAACTGCT GTCGAAGTAC 

GCCGATCATC AAGGGTTCGG CGAAGCTGGC 

5 AGCTGGGCGA AGTGGCGATC ATGAACCTGG 

ATCCCGACGC CGGAGCGTGC GGTCGATGGC 

AGACGTGTTC TCGATCTCGG GCCGTGGTAC 

AGCGCGGCGT GATCAAGGTT GGCGAGGAAA 

GCGACGGCGA AGACGACCTG CACGGGCGTG 

10 GGATCAGGGT CAGGCGGGCG ACAACGTCGG 

AGCGTGAAGA CGTGGAGCGC GGCCAGGTTC 

ACGCCGCACA CGCACTTCAC GGCTGAAGTG 

AGGCGGCCGC CACACGCCGT TCTTCAACAA 

TCCGTACGAC GGACGTGACG GGCTCGATCG 

15 ATGGTGATGC CGGGCGACAA CGTGTCGATC, 

GATCGCGATG GAAGAAGGTC TGCG 



PCT/CA00/01150 

GCCGTACATC ATCSGTGTTCC /LOO 

AGCTGCTCGA GCTGGTCGAA 150 

GACTTCCCGG GCGACGACAC 200 

GCTGGAAGGC GACAAGGGCG 250 

CCGACGCGCT GGACACGTAC 300 

GCGTTCCTGA TGCCGGTGGA 350 

GGTGGTGACG GGTCGTGTCG 400 

TCGAAATCGT CGGTATCAAG 450 

GAAATGTTCC GCAAGCTGCT 500 

TATCCTGCTG CGCGGCACGA 550 

TGGCGAAGCC GGGTTCGATC 600 

TACGTGCTGA GCAAGGACGA 650 

CTACCGTCCG CAGTTCTACT 700 

AGCTGCCGAA GGACAAGGAA 750 

ACGGTGAAGC TGATCGCGCC 800 

824 



20 2 ) INFORMATION FOR SEQ ID NO: 719 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 800 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium beijerincki (deposited as 
Clostridium butyricum) 

(B) STRAIN: ATCC 8260 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 719 

TGTATCAGCA GCAGATGGTC CAATGCCACA AACAAGAGAA CATATACTAC 50 

TAGGATCAAG AGTTGGTATC CAATATATCG TAGTATTCTT AAATAAAGCA 100 

GATATGGTAG ACGATCCAGA ATTATTAGAA TTAGTAGAAA TGGAAGTAAG 150 

4 0 AGAATTATTA AGCGAATATG ACTTCCCAGG AGACGATATT CCAGTAATAA 2 00 

CAGGATCAGC ATTAAAAGCA TTAGAAAATC CAACAGATGA AGAAGCAATT 250 

AAGCCAATCA TGGATTTAAT GGAAGCAGTA GATAGCTATA TCCCAACTCC 3 00 

AGAAAGAGCA ACAGATAAGC CATTCTTAAT GCCAATCGAA GATGTATTCA 350 

CAATTACAGG AAGAGGAACA GTTGCAACAG GAAGAGTTGA AGCTGGAGTA 400 

45 CTTCATGTAG GAGATGAAGT AGAAATCGTT GGATTAACAG AAGAAAAGAA 450 

GAAAGTTGTA GTAACTGGAA TCGAAATGTT CAGAAAGTTA TTGGATGAAG 500 

CACAAGCTGG AGATAACATC GGAGCATTAT TAAGAGGAGT TCAAAGAACT 550 

GATATTGAAA GAGGTCAAGT TTTATCAAAA CCAAATTCAG TACACCCTCA 600 

CACTAAATTT GTAGGTCAAG TATACGTACT TAAAAAAGAA GAAGGTGGAA 650 

50 GACATACTCC ATTCTTTGAT GGATACAGAC CACAATTCTA TTTCAGAACA 700 

ACAGACGTTA CAGGRTCAAT CAAGTTACCA GATGGAATGG AAATGGTAAT 750 

GCCTGGAGAT CACATTGATA TGAATGTTGA ATTAATCACT CCAATCGCAA 800 



55 

2) INFORMATION FOR SEQ ID NO: 720 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 799 bases 
60 (B) TYPE: Nucleic acid 

410 
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5 



WO 01/23604 PCT/CA00/01 150 

(C) STRANDEDNESS : . Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium innocuum 

(B) STRAIN: ATCC 14501 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 720 

GGTGCTATCC TGGTTGTTGC TGCATCTGAT GGTCCTATGC CTCAGACTCG 50 

TGAGCACATC CTGCTTGCTC GTCAGGTAGG TGTTCCTTAC ATCGTTGTAT 100 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAACTGAT CGACCTTGTT 150 

15 GAAATGGAAG TACGTGAGCT GTTAAGCGAG TACGGATTCG ACGGAGATAA 200 

CGCTCCGGTT ATCCGTGGTT CTGCACTGAA GGCTCTGGAA GGTGACGACA 250 

AATACGTTGG CGCTATCAAA GAACTGATGG ATGCAGTTGA TGAATTCATC 300 

CCAGATCCAA CTCGTGAAAC TGACAAACCA TTCCTGATGT CTGTAGAAGA 350 

CGTTATGACA ATCACAGGAC GTGGTACAGT TGCTACAGGA CGTGTTGAGC 4 00 

20 GTGGGGTAGT AAAACTGGGA GAAGAAGTTG AAATCGTTGG TATCAAGGAT 450 

ACTCAGAAAA CTGTTGTTAC CGGACTGGAA ATGTTCCGTA AGCAGCTGGA 500 

CTTCGCAGAA TCCGGAGACA ACATCGGTGC TCTGCTGCGT GGTATCAACC 550 

GTGACCAGAT TCAGCGTGGA CAGGTTCTTG CTAAACCAGG ATCCGTACAT 600 

CCACACACAA AGTTCAAGGC TCAGGTTTAT GTATTAACAA AAGAAGAAGG 650 

2 5 TGGACGTCAC ACTCCATTCG TTTCTAACTA CCGTCCTCAG TTCTACTTCC 700 

GTACAACTGA CGTAACTGGT GTTATTACAT TACCGGAAGG AACTGAAATG 750 

GTTATGCCTG GTGACAACGT TGAAATGAAC GTTGAGCTGA TTGCTCCAA 799 



30 



40 



2) INFORMATION FOR SEQ ID NO: 721 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 789 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium novyi 

(B) STRAIN: ATCC 19402 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 721 

TTGTATCAGC AGCAGATGGT CCAATGCCAC AAACAAGAGA ACACATCCTA 50 

TTAGCATCAA GAGTAGGAGT TAACCACATA GTAGTATTCT TAAACAAAGC 100 

AGACCAAGTA GATGATCCAG AATTACTAGA ATTAGTAGAA ATGGAAGTAA 150 

50 GAGAATTATT AAGCGAATAC GGATTTGACG GAGACGAATG TCCAGTAGTA 200 

GTAGGATCAG CATTAAAAGC AATCGAAGAA GGGGATGACC AATGCATCCT 250 

AGACTTAATG AAAGCTGTAG ATGAATATAT CCCAACTCCA GAAAGAGCAA 300 

CAGATCAACC ATTCTTAATG CCTGTAGAAG ATGTATTTAC AATTACAGGA 350 

AGAGGAACAG TTGCAACAGG AAGAGTTGAA AGAGGAGTAC TACACGTAGG 4 00 

55 AGATGAAGTA CAAATCGTAG GAATGAAAGA AGAAATCGGA AAGACAACAA 450 

TCACAGGAGT AGAAATGTTC AGAAAGATGT TAGATGAAGC AATGGCTGGA 500 

GATAACATCG GAGCATTATT AAGAGGAGTA CAAAGAGACG AAATCGAAAG 550 

AGGTCAAGTA CTAGCAAAAC CAGGTTCAGT AACACCTCAC AAAAAATTCG 600 

■TAGGTCAAGT TTACGTATTA AAGAAAGAAG AAGGTGGAAG ACACACTCCA 650 

60, TTCTTTAACG GATACAGACC ACAATTCTAC TTCAGAACAA CAGACGTAAC 700 

411 
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AGGATCAATC GCTTTACCAG AAGGAGTAGA AATGGTAATG CCAGGAGACC 
ATATAGACAT GAACGTAGAA TTAATCACAC CAGTAGCAA 



750 
789 



2) INFORMATION FOR SEQ ID NO: 722 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 798 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium septicum 

(B) STRAIN: ATCC 12464 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 722 



GTTCAGCAGC AGACGGTCCA ATGCCACAAA 
GCATCAAGAG TTGGTGTTGA CTATATCGTA 
TATGGTAGAT GACGAAGAAT TATTAGAATT 
AATTATTATC AGAATACAAC TTCCCAGGAG 
GGATCAGCTT TAGTAGCATT AGAAAACCCA 
TCCAATCTTA GAATTAATGG AAGCAGTAGA 
AAAGAGCAAC AGATAAGCCA TTCTTAATGC 
ATAACTGGTA GAGGAACAGT TGCAACAGGA 
TCATGTAGGA GACGAAGTAG AAATCGTTGG 
AAGTAGTAGT AACAGGAATA GAAATGTTCA 
CAAGCTGGAG ATAATGTTGG AGTACTTTTA 
TATCGAAAGA GGTCAAGTAT TAGCAAAGAC 
GCAAGTTCGT AGGTCAAGTA TACGTACTTA 
CATACTCCAT TCTTCGATGG ATACAGACCA 
AGACGTTACT GGATCAATCA AATTACCAGA 
CAGGAGACCA CATTGATATG AACGTTGAAT 



CAAGAGAACA TATACTACTA 50 

GTATTCTTAA ACAAGGCAGA 100 

AGTAGAAATG GAAGTTAGAG 150 

ATGATATTCC AGTAATCAAG 200 

ACAGATGAAA AATCAATCGC 250 

TAGCTACATT CCAACACCAG 300 

CAGTAGAAGA TGTATTCACA 350 

AGAGTTGAAA GAGGAGTTCT 4 00 

ATTATCAGAA GAAAGCAGAA 4 50 

GAAAGTTACT AGACGAAGCA 500 

AGAGGTGTTC AAAGAACAGA 550 

TGGATCAGTT AAGCCACACA 600 

AGAAAGAAGA AGGTGGAAGA 650 

CAATTCTACT TCAGAACAAC 700 

CGGAATGGAA ATGGTTATGC 750 

TAATCACTCA AGTAGCAA 798 



2) INFORMATION FOR SEQ ID NO: 723 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 799 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium tertium 

(B) STRAIN: ATCC 14573 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 723 



GTTCAGCAGC AGATGGTCCA ATGCCTCAAA CAAGAGAACA CATACTACTA 50 

GCTTCAAGAG TTGGTGTTGA CTACATAGTT GTTTTCTTAA ACAAGGCAGA 100 

TATGGTAGAT GACGAAGAAT TATTAGAATT AGTTGAAATG GAAGTAAGAG 150 

AATTATTATC AGAATACAAC TTCCCAGGAG ATGATATTCC AGTAATAAAG 200 

412 
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GGTTCAGCTT 
TCCAATCCTT 
AAAGAGCAAC 
ATCACTGGTA 
TCACGTAGGA 
AAGTAGTAGT 
CAAGCTGGAG 
CATCGAAAGA 
AGAAATTTGT 
CATACTCCAT 
AGAYGTTACT 
CAGGAGACCA 



PCT/CAOO/01150 



TACAAGCATT 
GAGTTAATGG 
AGATAAGCCA 
GAGGAACAGT 
GACGAAGTAG 
AACAGGAATA 
ACAACGTAGG 
GGTCAAGTTT 
AGGTCAAGTA 
TCTTCGATGG 
GGTTCAATCA 
CATTGATATG 



AGAAAACCCA 
AAGCTGTAGA 
TTCTTAATGC 
TGCTACAGGA 
AAATCGTTGG 
GAAATGTTCA 
AGTTCTTTTA 
TAGCAAAAGT 
TACGTACTTA 
ATACAGACCA 
AGTTACCAGA 
AACGTTGAAT 



ACAGATGAAA ^AGCAATCGC 
TAGCTACATT CCAACTCCAG 
CAGTAGAAGA TGTATTCACA 
AGAGTTGAAA GAGGAGTTCT 
ATTATCAGAA GACAGCAGAA 
GAAAGTTACT AGACGAAGCG 
AGAGGAGTTC AAAGAACTGA 
TGGATCAGTT AAGCCACACA 
AAAAAGAAGA AGGTGGAAGA 
CAATTCTACT TCAGAACAAC 
TGGAATGGAA ATGGTTATGC 
TAATCACTCA AGTAGCTAT 



250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
799 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 724 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 801 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium tetani 

(B) STRAIN: ATCC 19406 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 724 

TAGTAAGTGC AGCAGATGGT CCAATGCCAC AAACAAGAGA ACACATACTA 50 

TTAGCATCCA GAGTTGGAGT TGAGCACATA GTAGTATTCT TAAATAAAGC 100 

AGACCAAGTA GATGACGCAG AGTTAATCGA ATTAGTAGAA ATGGAAGTAA 150 

GGGAATTAAT GAACGAATAC GGATTCCCAG GAGATGACGC ACCAGTAGTA 2 00 

GTAGGATCCG CATTAAAAGC ATTAGAAAAT CCAGAAGATG ATGCAGCAAC 2 50 

ACAATGCATA ATGGACTTAA TGGCAGCAGT AGATGAATAT ATACCAACAC 3 00 

CAGAAAGAGC AACAGATAAG CCATTCTTAA TGCCAGTAGA AGATATCTTC 350 

ACAATCACAG GAAGAGGAAC AGTTGCAACA GGAAGAGTAG AAAGAGGAAT 4 00 

TCTAAAAGTA GGAGACGAAA TAGAAATCGT AGGATTAAGT GATGAAAGCA 4 50 

AGAAATCAGT AATCACAGGA ATAGAAATGT TCAGAAAACT ATTAGATGAA 500 

GCACAAGCAG GAGATAACAT CGGAGCATTA TTAAGAGGTG TTCAAAGAGA 550 

TGAAATCCAA AGAGGTCAAG TATTAGCAGC AACAGGATCA GTAAAACCAC 600 

ATAAGAGTTT TACAGGTCAA GTATATGTAT TAAAGAAAGA AGAAGGAGGA 650 

AGACACACTC CATTCTTTAA CGGATACAGA CCACAATTCT ACTTTAGAAC 700 

AACAGACGTA ACAGGTTCAA TCGCACTACC AGAAGGAGTA GAAATGGTAA 750 

TGCCAGGAGA CCACATAGAC ATGAAGGTAG AATTAATAAC AAGAGTAGCA 800 

A 801 



2) INFORMATION FOR SEQ ID NO: 725 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 633 bases 

(B) TYPE: Nucleic acid • 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



413 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus malodoratus 

(B) STRAIN: ATCC 43197 



(xi) SEQUENCE DESCRIPTION: SEQ 



ID NO: 725 



TATGCCTCAA 
CTTACATCGT 
TTACTAGAAT 
CTTCCCAGGC 
TAGAAGGCGA 
GTTGATGAAT 
GATGCCAGTC 
CTGGTCGTGT 
GTTGGTATTG 
GTTCCGTAAA 
TGTTACGTGG 
AAACCAGCTT 
TTTAACTAAA 



ACTCGTGAAC 
TGTATTCTTA 
TAGTTGAAAT 
GACGACACTC 
TGCTTCATAC 
ATATCCCAAC 
GAAGATGTAT 
TGAACGTGGA 
CTGAAGCAAC 
TTGTTAGATT 
TGTTGCACGT 
CAATCACTCC 
GAAGAAGGCG 



ACATCTTGTT 
AACAAAATGG 
GGAAGTTCGT 
CAGTTATCGC 
GAAGAAAAAA 
ACCAGTTCGT 
TCTCAATCAC 
CAAGTTCGCG, 
TGCTAAAACA 
ACGCTGAAGC 
GAAGACATCC 
ACATACAAAA 
GACGTCATAC 



ATCTCGTAAC 
ATATGGTTGA 
GACTTATTGT 
TGGTTCAGCT 
TCTTAGAATT 
GATACTGACA 
TGGTCGTGGA 
TTGGTGACGA 
ACTGTTACAG 
AGGCGATAAC 
AACGTGGACA 
TTCTCTGCAG 
TCC 



GTTGGTGTTC 
TGATGAAGAA 
CAGAATACGA 
TTGAAAGCTT 
AATGGCTGCT 
AACCATTCAT 
ACTGTTGCAA 
AGTTGAAATC 
GTGTTGAAAT 
ATTGGTGCAT 
AGTATTGGCT 
AAGTTTACGT 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
633 



2) INFORMATION FOR SEQ ID NO: 726 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 623 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus sulfureus 

(B) STRAIN: ATCC 49903 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 726 

CACGTGAACA CATCTTGTTA TCTCGTAACG TAGGTGTTCC TTACATCGTT 50 

GTATTCTTAA ACAAAATGGA TATGGTTGAT GACGAAGAAT TATTAGAATT 100 

AGTAGAAATG GAAGTTCGTG ACTTATTATC AGAATACGAT TTCCCAGGCG 150 

ATGACACTCC AGTTGTTGCA GGTTCTGCTT TGAAAGCTTT AGAAGGCGAC 200 

GCTTCTTACG AAGAAAAAAT CATGGAATTA ATGGCTGCAG TTGACGAGTA 250 

CATCCCAACT CCAACTCGTG ACACTGACAA ACCATTCATG ATGCCAGTTG 300 

AGGATGTATT CTCAATCACT GGACGTGGTA CTGTTGCTAC AGGTCGTGTT 350 

GAACGTGGAC AAGTTCGCGT TGGTGACGTT GTAGATATCG TTGGTATCGC 400 

TGACGAAACT GCTCAAACAA CTGTAACAGG TGTTGAAATG TTCCGTAAAT 450 

TATTAGACTA CGCTGAAGCA GGCGATAACA TCGGTGCTTT ATTACGTGGT 500 

GTTGCTCGTG AAGACATCCA ACGTGGACAA GTTTTAGCTA AACCAGCTTC 550 

AATCACTCCA CATACAAAAT TCTCTGCTGA AGTATACGTA TTAAGCAAAG 600 

AAGAAGGTGG ACGTCATACT CCA 623 



2) INFORMATION FOR SEQ ID NO: 727 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 646 bases 

(B) TYPE: Nucleic acid 

414 
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(C) STRANDEDNESS :. Double 

(D) TOPOLOGY: Linear 



PCT/CA00/01150 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus garvieae 

(B) STRAIN: ATCC 49156 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 727 

CGGTCCTATG CCTCAAACTC GTGAACACAT CTTGTTATCA CGTAACGTTG 50 

GCGTACCATA CATCGTTGTT TTCTTGAACA AAATGGATAT GGTTGATGAC 100 

GAAGAATTGC TAGAATTAGT TGAAATGGAA GTTCGTGACC TATTGTCTGA 150 

ATATGACTTC CCAGGCGACG ATGTTCCTGT AATCGCTGGT TCTGCTTTGA 200 

AAGCTCTTGA AGGAGATCCT TCATACGAAG AAAAAATCAT GGAATTGATG 250 

GCTGCAGTTG ACGAATACGT TCCAACTCCA GAACGTGATA CTGACAAACC 3 00 

ATTCATGATG CCAGTCGAAG ACGTATTCTC AATCACTGGA CGTGGTACTG 350 

TTGCTACAGG CCGTGTTGAA CGTGGACAAG TTCGCGTTGG TGATGAAGTA 4 00 

GAAATCGTTG GTATTGCTGA CGAAACTGCT AAAACAACTG TAACAGGTGT 4 50 

TGAAATGTTC CGTAAATTGT TAGACTATGC TGAAGCAGGG GATAACATTG 500 

GTGCATTGCT ACGTGGGGTT GCTCGTGAAG ACATCCAACG TGGACAAGTA 550 

TTGGCTAAAG CTGGTACAAT CACACCTCAT ACAAAATTCA AAGCTGAAGT 600 

TTATGTTTTG ACAAAAGAAG AAGGTGGACG TCACACTCCA TTCTTC 64 6 



2) INFORMATION FOR SEQ ID NO: 728 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycoplasma pirum 

(B) STRAIN: ATCC 25960D 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 728 

GAGCAATTTT AGTTGTTTCT GCAACTGATG GTCCAATGCC TCAAACTCGT 50 

GAACATATCT TATTAGCACG CCAAGTTGGT GTTCCTAAAA TGGTTGTTTT 100 

CTTAAACAAA TGTGATGTTG CTTCTGATCC AGAAATGCAA GAATTAGTTG 150 

CTGAAGAAGT AAAAGACTTA TTAAAATCTT ATGGTTTTGA TGGTGACAAT 200 

ACTCCAATTA TTCGTGGTTC AGCATTAGAA GCATTAAATG GTAAACCTGA 250 

ATGAGAAGAA AAAATTAAAG AATTAATGAA GGCAGTGGAT GACACTATTC 300 

CTGATCCAGT TCGTGATACT GAAAAGCCAT TCTTGTTACC AATTGAAGAC 350 

GTAATGACAA TTACAGGTCG TGGTACTGTT GTTACAGGTC GTGTAGAACG 4 00 

TGGTACTCTA AAATTAAATG ATGAAGTTGA AATTGTTGGT TTAGGTGAAA 450 

CATTTAAATC TGTTGTAACA GGTATTGAAA TGTTCCGTAA AGAATTAGAT 500 

GAAGCTCGTG CTGGTGACAA TGCTGGTATT TTACTTCGTG GTGTTGACCG 550 

TGGTCAAGTA CAACGTGGTC AAGTTCTTGC TAAACCAGGT TCTATTACTC 600 

CTCATACTAA ATTTAAAGCT GAAATTTATG CTTTGAAAAA AGAAGAAGGT 650 

GGTCGTCATA CTGCTTTCTT AAACGGTTAT CGTCCTCAAT TCTATTTCAG 700 

AACAACTGAT GTTACAGGTT CTATTAAATT AAAAGATGGA ACTGAAATGG 750 

TTATGCCTGG TGACAATACT GAAATCACTG TAGAATTAAT TTCACCAATT 800 

GCTTGTAAAA AGGAAGTAAG TTT 823 
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2) INFORMATION FOR SEQ ID NO: 729 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycoplasma salivarium 

(B) STRAIN: ATCC 23064 , 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 729 



GGAGCAATCT TAGTTGTTGC TGCAACTGAT 
TGAACACGTT TTACTTGCAA AACAAGTTGG 
TCTTAAACAA AATCGATATG TTCAAGCCAG 
GAAATGGTTG AAATGGACAT TCGTGACTTA 
TGGTGACAAT ACTCCTATTA TTGCTGGTTC 
GTGATCCAGA ATATGAAAAG AATATTCTAG 
ACATACATTG ATGAACCTAC TCGTGATAAA 
TGTAGAAGAC GTATTTACAA TTACTGGTCG 
GAGTAGAACG TGGTACATTA CATCTAAATG 
CTACGTCCAA CAATTAAAAC TGTTGTTACT 
AAATTTAAAA GAAGCTCAAG CTGGAGATAA 
GAATTGATAG AGATCAAGTA GAACGTGGAC 
AGTATTATTC CTCACACAGA ATTTGAAGCT 
TGAAGAAGGT GGACGTCACA CTCCATTCTT 
TTTACTTTCG TACAACCGAC GTTACTGGTG 
CGTGAAATGG TTATGCCTGG CGAAAATGTT 
TGCTCCTATT GCAGTTGAAG AAGGAA 



GGTGCGATGC CTCAAACTCG 50 

TGTTCCTAAA ATCGTTGTTT 100 

AAGAAAGAGC CGAAATGGTT 150 

TTAAACAAAT ACGACTTTGA 200 

AGCATTAAAA GCATTACAAG 250 

AACTTATGGA TGCAGTAGAC 3 00 

GATAAACCAT TCTTAATGGC 350 

TGGAACTGTT GCTACTGGTA 400 

ATGAAGTTGA AATCGTTGGT 450 

GGAATTGAAA TGTTCCGTAA 500 

TGCAGGACTT TTACTACGTG 550 

AAGTTTTGGC CAAACCAAAA 600 

GCTGTGTATG TTCTAAAAGC 650 

TGAACACTAT AAACCACAAT 700 

GAATTAAATT CAAACCTGGA 750 

GAATTTACAG TTACTTTAAT 800 

826 



2) INFORMATION FOR SEQ ID NO: 730 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria polysaccharea 

(B) STRAIN: ATCC 43768 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 730 



TGGTATGTTC CGCAGCYGAY 
CTGYTGGCTC GCCAAGTAGG 
ATGCGACATG GTTGACGATG 
TCCGYGACCT GYTGTCMAGC 
GTACAAGGTT CTGCACTGAA 
GAAAATCTTC GAAYTGGCTG 



GGYCCTATGC CTCAAACTCG 
YGTACCTTAC ATCATCGTRT 
CCGAGCTGTT GGAACTGGTT 
TACGACTTCC CMGGCGACGA 
AGCTTTGGAA GGYGAYGCTG 
CTGCTTTGGA CAGCTACATC 
416 



CGAACACATC 50 

TCATGAACAA 100 

GAAATGGAAA 150 

CTGCCCAATC 200 

GTTACGAAGA 250 

CCAACTCCTG 300 
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AGCGTGCTGT GGACAAACCK TTCYTGYTGC 
ATCTCTGGYC GYGGTACMGT AGTAACYGGY 
CCACGTTGGT GACGAGATYG AAATCGTAGG 
CCACTTGTAC CGGCGTTGAA ATGTTCCGCA 
5 GCWGGTGACA ACGTAGGCGT ATTGYTGCGT 
AGAGCGTGGT CAAGTATTGG CYAAACCAGG 
AATTCAAAGC AGAAGTATAC GTACTGAGCA 
ACTCCATTCT TCGCYAACTA CCGYCCMCAA 
CGTAACYGGT GCAGTTACTT TGGAAGAAGG 
10 GTGAGAACGT AACCATTACY GTAGAACTGA 
AAGGTCTGCG 
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CTATCGAAGA 


CGTRTTbTCT 


350 


CGTGTAGAGC 


GCGGTRTCAT 


400 


TCTGAAAGAA 


ACCCAAAAAA 


450 


AACTGCTGGA 


CGAAGGTCAA 


500 


GGTACCAARC 


GTGAAGACGT 


550 


YACCATYACT 


CCSCACACCA 


600 


AAGAAGAGGG 


TGGTCGYCAC 


650 


TTCTACTTCC 


GTACYACTGA 


700 


TGTAGAAATG 


GTAATGCCAG 


750 


TTGCGCCTAT 


CGCTAGGAAG 


800 






810 



15 2 ) INFORMATION FOR SEQ ID NO: 731 



20 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. 
serotype Entzeritidis 

(B) STRAIN: ATCC 13076 



choleraesuis 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 731 



CTGGTTGTTG 
CCTGCTGGGT 
AATGCGACAT 

3 5 GTTCGYGAAC 

CGTTCGTGGT 
CGAAAATCAT 
GAGCGTGCGA 
CATCTCCGGT 

4 0 TCAAAGTGGG 

TCTACCTGTA 
TGCCGGTGAG 
TCGAACGTGG 
AAGTTCGAAT 
45 CACTCCGTTC 
ACGTGACTGG 
GGCGACAACA 
CGACGGTCTG 



CTGCGACTGA 
CGTCAGGTAG 
GGTTGATGAC 
TGCTGTCTCA 
TCTGCTCTGA 
CGAACTGGCT 
TTGACAAGCC 
CGTGGTACCG 
CGAAGAAGTT 
CTGGCGTTGA 
AACGTAGGTG 
TCAGGTACTG 
CTGAAGTGTA 
TTCAAAGGCT 
CACCATCGAA 
TCAAAATGGT 
CGT 



CGGCCCGATG 
GCGTTCCGTA 
GAAGAGCTGC 
GTACGACTTC 
AAGCGCTGGA 
GGCTTCCTGG 
GTTCCTGCTG 
TTGTTACCGG 
GAAATCGTTG 
AATGTTCCGC 
TTCTGCTGCG 
GCTAAGCCGG 
CATTCTGTCC 
ACCGTCCGCA 
CTGCCGGAAG 
TGTTACCCTG 



CCGCAGACCC 
CATCATCGTG 
TGGAACTGGT 
CCGGGCGACG 
AGGCGACGCA 
ATTCTTACAT 
CCGATCGAAG 
TCGTGTAGAA 
GTATCAAAGA 
AAACTGCTGG 
TGGTATCAAA 
GCACCATCAA 
AAAGATGAAG 
GTTCTACTTC 
GCGTAGAGAT 
ATCCACCCGA 



GTGAGCACAT 50 

TTCCTGAACA 100 

TGAAATGGAA 150 

ACACTCCGAT 200 

GAGTGGGAAG 250 

CCCGGAACCA 300 

ACGTATTCTC 350 

CGCGGTATCA 4 00 

GACTCAGAAG 450 

ACGAAGGCCG 500 

CGTGAAGAAA 550 

GCCGCACACC 600 

GCGGCCGTCA 650 

CGTACTACTG 700 
GGTAATGCCG . 750 

TCGCAATGGA 800 
813 



50 



55 



2) INFORMATION FOR SEQ ID NO: 732 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



60 



(ii) MOLECULE TYPE: 



Genomic DNA 
417 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella 
serotype Gallinarum 

(B) STRAIN: ATCC 9184 

(xi) SEQUENCE DESCRIPTION: SEQ 

CTGGTTGTTG CTGCGACTGA CGGCCCGATG 
CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA 
AATGCGACAT GGTTGATGAC GAAGAGCTGC 
GTTCGYGAAC TGCTGTCTCA GTACGACTTC 
CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA 
CGAAAATCAT CGAACTGGCT GGCTTCCTGG 
GAGCGTGCGA TTGACAAGCC GTTCCTGCTG, 
CATCTCCGGT CGTGGTACCG TTGTTACCGG 
TCAAAGTGGG CGAAGAAGTT GAAATCGTTG 
TCTACCTGTA CTGGCGTTGA AATGTTCCGC 
TGCCGGTGAG AACGTAGGTG TTCTGCTGCG 
TCGAACGTGG TCAGGTACTG GCTAAGCCGG 
AAGTTCGAAT CTGAAGTGTA CATTCTGTCC 
CACTCCGTTC TTCAAAGGCT ACCGTCCGCA 
ACGTGACTGG CACCATCGAA CTGCCGGAAG 
GGCGACAACA TCAAAATGGT TGTTACCCTG 
CGACGGTCTG CG 



choleraesuis subsp. choleraesuis 



ID NO: 732 

CCGCAGACCC GTGAGCACAT 50 

CATCATCGTG TTCCTGAACA 100 

TGGAACTGGT TGAAATGGAA 150 

CCGGGCGACG ACACTCCGAT 200 

AGGCGACGCA GAGTGGGAAG 250 

ATTCTTACAT CCCGGAACCA 300 

CCGATCGAAG ACGTATTCTC 350 

TCGTGTAGAA CGCGGTATCA 4 00 

GTATCAAAGA GACTCAGAAG 450 

AAACTGCTGG ACGAAGGCCG 500 

TGGTATCAAA CGTGAAGAAA 550 

GCACCATCAA GCCGCACACC 600 

AAAGATGAAG GCGGCCGTCA 650 

GTTCTACTTC CGTACTACTG 700 

GCGTAGAGAT GGTAATGCCG 750 

ATCCACCCGA TCGCAATGGA 800 

812 



2) INFORMATION FOR SEQ ID NO: 733 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Paratyphi B 

(B) STRAIN: ATCC 8759 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 733 

TGGTTGTTGC TGCGACTGAC GGYCCGATGC CGCAGACCCG TGAGCACATC 50 

CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT TCCTGAACAA 100 

ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT GAAATGGAAG 150 

TTCGYGAACT GCTGTCTCAG TACGACTTCC CGGGCGACGA CACTCCGATC 2 00 

GTTCGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGACGCAG AGTGGGAAGC 2 50 

GAAAATCATC GAACTGGCTG GCTTCCTGGA TTCTTACATT CCGGAACCAG 300 

AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA CGTATTCTCC 350 

ATCTCTGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAAC GCGGTATCAT 4 00 

CAAAGTGGGC GAAGAAGTTG AAATCGTTGG TATCAAAGAG ACTCAGAAGT 450 

CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA CGAAGGCCGT 500 

GCYGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAAC GTGAAGAAAT 550 

CGAACGTGGT CAGGTACTGG CTAAGCCGGG CACCATCAAG CCGCACACCA 600 

AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGATGAAGG CGGCCGTCAT 650 

ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC GTACTACTGA 700 

CGTGACTGGC ACCATCGAAC TGCCGGAAGG CGTAGAGATG GTAATGCCGG 750 

418 
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GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT CGCAATGGAC 80tf 
GACGGTCTGC GTTT 814 



2) INFORMATION FOR SEQ ID NO: 734 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 828 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



15 



20 



50 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Virchow 

(B) STRAIN: ATCC 51955 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 734 



GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAGACCCG 50 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT 100 

25 TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT 150 

GAAATGGAAG TTCGTGAACT TCTGTCTCAG TACGACTTCC CGGGCGACGA 2 00 

CACTCCGATC GTTCGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGACGCAG 250 

AGTGGGAAGC GAAAATCATC GAACTGGCTG GCTTCCTGGA TTCTTACATT 300 

CCGGAACCAG AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA 350 

30 CGTATTCTCC ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAAC 400 

GCGGTATCAT CAAAGTGGGC GAAGAAGTTG AAATCGTTGG TATCAAAGAG 450 

ACTCAGAAGT CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGCCGT GCCGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAAC 550 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CACCATCAAG 600 

35. CCGCACACCA AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGATGAAGG 650 

CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 700 

GTACTACTGA CGTGACTGGC ACCATCGAAC TGCCGGAAGG CGTAGAGATG 750 

GTAATGCCGG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT 800 

CGCAATGGAC GACGGTCTGC GTTTCGCA 828 

40 



2) INFORMATION FOR SEQ ID NO: 735 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia grimesii 
55 (B) STRAIN: ATCC 14460 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 735 

GGCGCTATCC TGGTTGTTGC TGCGACTGAT GGCCCAATGC CACAGACCCG 50 
60 TGAGCACATC CTGCTGGGTC GTCAGGTTGG CGTTCCTTTC ATCATCGTAT 100 

419 
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TCATGAACAA ATGCGACATG GTTGATGATG AAGAGCTGCT GGAACPGGTA 150, 
GAAATGGAAG TTCGTGAACT TCTGTCTGCT TATGACTTCC CTGGTGATGA . 2 00 

CCTGCCAGTT GTTCGTGGTT CAGCGCTGAA AGCACTGGAA GGCGAAGCTG 250 

AGTGGGAAGC TAAAATCATC GAACTGGCTG GCTACCTGGA TTCTTACATC 300 

5 CCAGAACCAG AGCGTGCTAT CGACAAGCCG TTCCTGCTGC CAATCGAAGA 350 

CGTATTCTCC ATCTCCGGYC GTGGTACYGT AGTTACCGGT CGTGTAGAGC 400 

GCGGTATCGT TAAAGTTGGC GAAGAAGTTG AAATCGTTGG TATCAAAGAC 450 

ACCGTTAAGT CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGCCGT GCTGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAGC 550 

10 GTGAAGACAT CGAACGTGGT CAGGTACTGG CTAAACCAGG TTCAATCAAG 600 

CCACACACCA AATTCGACTC AGAAGTTTAC ATCCTGAGCA AAGAAGAAGG 650 

TGGTCGTCAC ACTCCATTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 700 

GTACAACTGA CGTGACCGGT ACCATCGAAC TGCCAGAAGG CGTAGAGATG 750 

GTAATGCCAG GCGATAACGT GAACATGGTT GTAACCCTGA TTCACCCAAT 800 

15 CGCGATGGAC GACGGTCTGC GTTTC , 825 



2) INFORMATION FOR SEQ ID NO: 736 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 798 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
25 '(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Clostridium difficile 

(B) STRAIN: ATCC 9689 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 736 

3 5 TATTAGTTTG TTCAGCAACA GATGGACCAA TGCCACAAAC AAGAGAGCAT 50 

ATACTATTAT CAAGACAAGT TGGAGTACCA TATATAGTAG TATTCTTAAA 100 

CAAATGTGAC ATGGTAGATG ATGAAGAGTT ATTAGAGTTA GTAGAGATGG 150 

AAGTAAGAGA TTTATTAACA GAATATGATT TCCCAGGAGA TGACACTCCA 200 

ATAGTAAGAG GTTCAGCATT AATGGCATTA GAAGATCCAA AGAGYGAGTG 250 

4 0 GGGAGATAAG ATAGTAGAAT TATTCGAGCA AATAGATGAG TATATACCAG 300 

CTCCAGAGAG AGATACAGAT AAACCATTCT TAATGCCAGT AGAGGACGTA 350 

TTCTCAATCA CAGGAAGAGG AACAGTTGCA ACAGGAAGAG TGGAAAGAGG 4 00 

AGTACTAAAA GTACAAGACG AAGTAGAKTT AGTAGGATTA ACAGAAGCAC 4 50 

CAAGAAAAGT AGTAGTAACA GGAGTAGAGA TGTTCAGAAA ATTATTAGAC 500 

4 5 CAAGCACAAG CAGGGGATAA TATAGGAGCA TTATTAAGAG GAGTACAAAG 550 

AAACGAGATA GAAAGAGGAC AAGTACTAGC AAAGACTGGA TCAGTAAAGG 600 

CACACACAAA GTTTACAGCA GAAGTATATG TACTTAAAAA AGAAGARGGT 650 

GGAAGACATA CACCATTCTT TGATGGATAT AGACCACAAT TCTATTTCAG 700 

AACAACAGAC GTAACAGGAG CTTGTAAGTT ACCAGAAGGA ATAGAGATGG 750 

50 TAATGCCTGG AGATAACGTA ACAATGGAAG TAGACTTAAT AAACTCAA 798 



2) INFORMATION FOR SEQ ID NO: 737 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 411 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
60 (D) TOPOLOGY: Linear 

420 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Burkholderia pseudomallei 

(B) STRAIN: 1026B 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 737 

GTCAACATGA TGGAGCTCAT CAACAACATC GCGAAGGAGC ACGGCGGTTA 50 

CTCCGTGTTC GCGGGCGTGG GCGAGCGTAC CCGTGAAGGG AACGACTTCT 100 

ACCACGAAAT GAAGGACTCG AACGTTCTCG ACAAGGTCGC GCTGGTGTAC 150 

GGCCAGATGA ACGAGCCGCC GGGCAACCGT CTGCGCGTGG CGCTGACGGG 200 

CCTCACGATG GCCGAGCACT TCCGTGACGA AGGCCTCGAC GTGCTGTTCT 250 

TCGTCGACAA CATCTACCGT TTCACGCTGG , CCGGTACCGA AGTGTCGGCG 300 

CTGCTCGGCC GTATGCCGTC GGCAGTGGGC TATCAGCCGA CGCTGGCTGA 350 

AGAAATGGGC AAGCTGCAAG AGCGCATCAC GTCGACGAAG AAGGGCTCGA 400 

TCACGTCGGT T 411 



2) INFORMATION FOR SEQ ID NO: 738 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 394 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium bifermentans 

(B) STRAIN: ATCC 638 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73 8 



TACAAGAGCT TATTAACAAT 
TTCGCAGGTG TTGGAGAGAG 
GATGAGCGAT ACAGGAGTTA 
TGAATGAGCC ACCTGGAGCA 
ATGGCTGAAT ACTTCAGAGA 
AGATAATATA TTCCGTTTCA 
TTGGACGTAC TCCATCAGCA 
ATGGGTAGAT TACAAGAGAG 



ATAGCTACTC AACACGGTGG 
AACAAGAGAA GGTAACGACT 
TAAATAAAAC AGCTCTAGTA 
AGAATGAGAG TTGCTTTAAC 
TCAACAAGGG CAAGACGTTT 
CTCAAGCAGG ATCTGAGGTT 
GTTGGATACC AACCAACATT 
AATAACATCT ACAAATAAAG 



TATATCAGTA 50 

TATTCCATGA 100 

TTCGGACAAA 150 

TGGTCTTACA 2 00 

TATTATTCGT 250 

TCTGCACTTC 300 

AGCAACAGAG 350 

GGTC 394 



2) INFORMATION FOR SEQ ID NO: 739 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 94 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 

421 
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(A) ORGANISM: Clostridium beijerincki (deposited as" 
Clostridium butyricum) 

(B) STRAIN: ATCC 8260 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 739 

TTAATAAACA ACATAGCTAA ACAACATGGT GGTTTATCAG 
AGTTGGTGAA AGATCAAGAG AAGGTAATGA CTTATATCAT 
AGTCAGGAGT TATTGATAAG ACAGCATTAG TATTTGGACA 
CCACCGGGTG CCAGAATGAG AGTTGCATTA ACAGGTCTTA 
GTATTTTAGA GATAAAGGTC AAGATGTGTT ACTATTCATA 
TCAGATATAC TCAAGCAGGT TCAGAGGTTT CAGCATTACT 
CCTTCAGCGG TTGGATATCA GCCAACACTT GCAACTGAAA 
TCAGGAAAGA ATTACATCAA CAGTTAATGG TTCTATTACG 



TATTTACTGG 
GAAATGAGAG 
AATGAATGAG 
CTATGGCAGA 
GATAACATAT 
TGGAAGAACA 
TGGGTGCACT 
TCAG 



50 
100 
150 
200 
250 
300 
350 
394 



20 



25 



30 



35 



40 



2) INFORMATION FOR SEQ ID NO: 740 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 393 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium difficile 

(B) STRAIN: ATCC 9689 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 740 

TTATAAACAA TATTGCTAAG CAACATGGTG GTATTTCTGT 
GTAGGAGAAA GAACAAGAGA AGGTAACGAC CTTTATGGCG 
GTCTGGAGTT ATAAATAAAA CAGCTCTAGT ATTTGGTCAA 
CACCTGGAGC GAGAATGAGA GTTGCTTTAA CTGGACTTAC 
CATTTTAGAG ATGAGCAAGG ACAAGACGTT TTACTTTTCG 
ATTCCGTTTC ACACAAGCTG GTTCAGAAGT TTCAGCACTT 
TGCCATCAGC TGTTGGTTAT CAGCCAACAT TAGCTACTGA 
CTTCAAGAGA GAATAACATC AACTAAGAAA GGTTCAATAA 



ATTTTCAGGA 50 

AAATGAGTGA 100 

ATGAATGAAC 150 

AATGGCAGAA 200 

TTGATAATAT 250 

CTAGGACGTA 300 

AATGGGTGCA 350 

CAT 393 



45 2) INFORMATION FOR SEQ ID NO: 741 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 98 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 



55 



60 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium ramosum 

(B) STRAIN: ATCC 25582 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 741 

422 



SUBSTITUTE SHEET {RULE 26) 



10 



25 



40 



WO 01/23604 PCT/CAOO/01150 

TTGATTCAAG AATTCATTAA TAACATTGCT ACAGAACATG GTGGTTTATC 50 

AGTTTTTGCT GGAGTTGGTG AACGTAGCCG TGAAGGTAAT GATTTATATT 100 

ATGAAATGAA GGAAAGTGGT GTTTTATCTA AAACAACACT AGTATTTGGA 150 

CAGATGAATG AACCCCCAGG AGCTCGTTTA AGAGTTGCTT TAACGGGTCT 200 

TACTATGGCA GAAGAATTCC GTGATGAACA AGGTCAGGAT GTCTTATTAT 250 

TCATCGATAA TATTTTCCGT TTTACTCAAG CTGGATCTGA AGTATCTGCC 300 

TTACTTGGAC GGGTACCATC ACAAGCTGGG TATCAGCCAA CTTTAGCAAC 350 

CGAAATGGGT GCTTTACAAG AACGGATTAC ATCAACTAAA AAAGGATC 398 



2) INFORMATION FOR SEQ ID NO: 742 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 380 bases 

(B) TYPE:' Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium septicum 

(B) STRAIN: ATCC 12964 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 742 



TAGCTAAGGA ACACGGTGGA CTTTCAGTAT TCACAGGTGT TGGAGAAAGA 50 

TCAAGAGAAG GTAATGATTT ATATTACGAA ATGAAAGAAT CAGGAGTTAT 100 

3 0 AGACAAGACA GCTCTAGTGT TTGGACAAAT GAATGAATCT CCAGGAGCTA 150 

GAATGAGAGT ATCTTTAACA GGATTAACTA TGGCTGAATA TTTCAGAGAT 200 

CAAGGTCAAG ATGTGCTTTT ATTCATAGAT AACATATTTA GATTf ACTCA 250 

AGCTGGATCA GAAGTATCGG CTTTACTTGG AAGAATACCA TCAGCAGTTG 300 

GTTATCAACC AACACTAGCA ACTGAAATGG GTGCACTTCA AGAAAGAATT 350 

35 ACTTCAACTA AAAATGGATC AATAACTTCA 380 



2) INFORMATION FOR SEQ ID NO: 743 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 389 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Clostridium tertium 

(B) STRAIN: ATCC 14573 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 743 

55 TTAATAAATA ATATAGCAAA AGAGCATGGT GGTCTTTCTG TATTTACAGG 50 

AGTTGGAGAA AGGTCAAGAG AAGGTAACGA CTTATATTAT GAAATGAAAG 100 

AGTCAGGGGT TATAGATAAG ACAGCTTTAG TATTTGGACA AATGAATGAA 150 

TCACCAGGAG CAAGAATGAG AGTTTCATTA ACTGGATTAA CTATGGCTGA 200 

ATATTTTAGA GATCAAGGTC AAGACGTTCT TTTATTTATA GATAATATAT 250 

60 TTAGATTTAC TCAAGCGGGA TCAGAAGTTT CTGCGTTATT AGGAAGAATT 300 
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CCTTCAGCAG TTGGATATCA ACCAACTCTT GCAACTGAAA TGGGAGCACT 3 St) 

TCAAGAAAGA ATAACATCAA CAAAGAATGG ATCAATCAC 389 



2) INFORMATION FOR SEQ ID NO: 744 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 843 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Comamonas acidovorans 

(B) STRAIN: ATCC 15668 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 744 

TTCCCCCGCA CGCATGCCCA AGGTGTTCGA TGCCCTGAAG CTCGACGGCT 50 

CGGCCCTGAC GCTGGAAGTG CAGCAACTGC TGGGTGACGG CGTTGTGCGT 100 

ACCATCGCCC TGGGTTCGTC CGACGGTCTG CGTCGCGGCC TGATGGTGTC 150 

25 CAACACCGGC AACCCCATCA CCGTGCCCGT GGGCAAGGCG ACGCTGGGTC 200 

GCATCATGGA CGTGCTGGGC AATCCCATCG ACGAACGTGG TCCCGTGGAT 250 

CAGGCGCTGA CGGCTCCCAT CCACCGCAAG GCACCGGCTT ATGACGAGCT 300 

GTCGCCTTCG CAGGAACTGC TGGAAACCGG CATCAAGGTG ATCGACCTGA 350 

TCTCGCCCTT CGCCAAGGGC GGCAAGGTGG GTCTGTTCGG TGGCGCCGGT 400 

3 0 GTGGGCAAGA CCGTGAACAT GATGGAACTC ATCAACAACA TCGCCAAGGG 450 

CCACGGTGGT CTGTCGGTGT TCGCCGGTGT GGGTGAACGT ACCCGCGAAG 500 

GCAATGACTT CTATCACGAA ATGTCGGACG CCGGCGTGGT CAACCAGGAG 550 

TCGCTGAACG ACTCCAAGGT GGCCATGGTC TACGGCCAGA TGAACGAACC 600 

CCCGGGCAAC CGTCTGCGCG TGGCGCTGAC CGGCCTGACC ATGGCCGAAG 650 

35 CCTTCCGTGA CGAAGGCAAG GACGTGCTGT TCTTCGTGGA CAACATCTAC 700 

CGCTACACGC TGGCCGGTAC CGAAGTGTCC GCTCTGCTGG GTCGCATGCC 750 

TTCCGCCGTG GGCTACCAGC CCACGCTGGC CGAGGAAATG GGCCGCCTGC 8 00 

AAGAGCGCAT CACCTCGACC AAGGTCGGTT CGATCACTTC CAC 843 



40 



2) INFORMATION FOR SEQ ID NO: 745 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. 
rhinoscleromatis 
55 (B) STRAIN: ATCC 13884 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 745 

GCCGTACCAC GCGTGTACGA AGCCCTTGAG GTACAGAATG GTAATGAAGT 50 
60 TCTGGTGCTG GAAGTTCAGC AGCAGCTGGG CGGCGGTATC GTACGTACCA 100 

424 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

TCGCCATGGG TTCTTCTGAT GGTCTGCGCC 
CTCGAGCACC CGATCGAAGT CCCGGTAGGT 
CATGAACGTA CTGGGTCAAC CGGTTGACAT 
AAGAGCGTTG GGCTATCCAC CGCGCGGCAC 
5 AGCTCTCAGG AACTGCTGGA AACCGGCATC 
TCCGTTCGCC AAGGGCGGTA AAGTTGGTCT 
GTAAAACTGT AAACATGATG GAGCTGATCC 
TCCGGTTACT CTGTGTTTGC GGGCGTAGGT 
CGACTTCTAC CACGAAATGA CCGACTCCAA 

10 TGGTGTACGG CCAGATGAAC GAGCCGCCGG 
CTGACCGGCC TGACCATGGC TGAGAAATTC 
ACTGCTGTTC GTCGATAACA TCTATCGTTA 
TATCCGCGCT GCTGGGTCGT ATGCCTTCAG 
CTGGCGGAAG AGATGGGCGT TCTGCAGGAA 

15 CGGTTCTATC ACCTCCGTA 



PCT/CA00/01150 

GCGGTCTGGA TGTXAAAGAC 450 

AAAGCAACGC TGGGTCGTAT 200 

GAAAGGCGAC ATCGGCGAAG 250 

CGTCCTATGA AGAGCTGTCC 3 00 

AAAGTTATCG ACCTGATGTG 350 

GTTCGGCGGT GCGGGTGTAG 4 00 

GTAACATCGC GATCGAGCAC 450 

GAGCGTACTC GTGAGGGTAA 500 

CGTTATCGAT AAAGTATCCC 550 

GAAACCGTCT GCGCGTTGCG 600 

CGTGACGAAG GTCGTGACGT 650 

CACCCTGGCC GGTACTGAAG 7 00 

CGGTAGGTTA TCAGCCGACC 750 

CGTATCACCT CCACCAAAAC 8 00 

819 



20 



25 



2) INFORMATION FOR SEQ ID NO: 746 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 

(C) STRAND EDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Neisseria canis 

(B) STRAIN: ATCC 14687 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 746 



35 GCGATTCTAT 
CTGACGCTTG 
TGCGATGGGT 
CCGGTGCTCC 
ATGGACGTAT 

40 GCAAACCCGA 
GCGCCACAGA 
CCGTTTGCCA 
CAAAACCGTA 
GCGGTTTGTC 

45 GACTTTTATC 
GGTTTACGGT 
TAACTGGCTT 
GGTAAAGGCC 
CCTAGCCGGT 

50 TAGGTTATCA 
ATTACYTCAM 



TCCGCGCGTA 
AAGTACAACA 
AGTTCCGACG 
AATTACAGTG 
TAGGTAATCC 
GCAATCCACC 
GATTTTGGAA 
AAGGCGGTAA 
AATATGATGG 
TGTATTTGCC 
ATGAGATGAA 
CAGATGAACG 
GTCTATGGCC 
GTGATGTATT 
ACAGAAGTAT 
GCCGACGTTG 
CCCAAACAGG 



TATGATGCTC 
ACAGTTGGGT 
GCCTCAAACG 
CCTGTGGGCA 
GGTTGATGAA 
AACCTGCTCC 
ACAGGTATTA 
AGTAGGTTTG 
AGTTGATTAA 
GGTGTGGGTG 
AGATTCCAAT 
AGCCTCCCGG 
GAATTCTTCC 
GTTCTTTGTA 
CTGCATTGCT 
GCAGAGGAAA 
CTCT 



TTAAACTAGT 
GATGGTGTCG 
AGGTTTGGCG 
AAGCAACATT 
GCTGGTCCGA 
TAAGTTCGAC 
AAGTAATTGA 
TTTGGTGGTG 
CAACATCGCG 
AACGGACGCG 
GTATTAGATA 
TAACCGTTTG 
GTGACGAGAA 
GACAATATTT 
TGGCCGTATG 
TGGGCCGCTT 



GGATAGAGAA 
TTCGTACTAT 
GTAGTTAACA 
AGGCCGTATT 
TTGGCTCCGA 
GAGCTTTCTA 
TTTGCTTTGC 
CGGGAGTGGG 
AAAGCACACA 
TGAAGGTAAT 
AAGTAGCCAT 
CGCGTTGCGC 
AGATGAAAAC 
ACCGCTATAC 
CCTTCGGCAG 
GCAAGAGCGT 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
824 



55 2) INFORMATION FOR SEQ ID NO: 747 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS : Double 

425 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(D) TOPOLOGY: Linear 



PCT/CA00/01150 



(ii) MOLECULE TYPE: Genomic DNA 



10 



15 



20 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria cinerea 

(B) STRAIN: ATCC 14685 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 747 



25 



CGCGACGCTA 
TCTGACTCTG 
TTGCAATGGG 
ACTGGTGCGC 
TGTCGACGTA 
ACAAAAGCCG 
TCAGCTACCG 
TCCGTTTGCT 
GCAAAACCGT 
AGCGGTCTGT 
CGACTTCTAC 
TGGTGTATGG 
TTGACCGGTT 
CGGTAAAGGC 
CTTTGGCCGG 
GTGGGTTACC 
TATTACCTCT 



TCCCGCATGT 
GAGGTTCAAC 
TAGTTCAGAC 
CAATCACTGT 
TTGGGTACGC 
TGCCATTCAC 
AATTGTTGGA 
AAAGGCGGTA 
GAACATGATG 
CCGTGTTCGC 
CACGAGATGA 
CCAAATGAAC 
TGACTATGGC 
CGCGACGTAT 
TACTGAAGTA 
AACCGACATT 
ACCCAAACCG 



TTACGATGCC 
AGCTTCTGGG 
GGCCTTAAAC 
GCCGGTAGGT 
CTGTTGATGA 
CAAACTGCTC 
AACCGGTATT 
AAGTAGGTCT 
GAATTGATCA 
AGGTGTGGGT 
AAGATTCCAA 
GAACCTCCGG 
CGAATACTTC 
TGTTCTTCGT 
TCTGCACTGT 
GGCTGAAGAA 
GTTCCATTAC 



CTGAAATTGG 
CGACGGCGTT 
GCGGTATGTC 
AAAGGTACAT 
AGCAGGTCCG 
CGAAATTCGA 
AAAGTGATCG 
GTTCGGTGGT 
ACAACATCGC 
GAGCGTACCC 
CGTATTGGAT 
GCAACCGTCT 
CGTGACGAAA 
TGACAACATC 
TGGGCCGTAT 
ATGGGTCGTT 
T 



ACGAGAACGG 50 

GTCCGTACTA 100 

TGTAAGCAAT 150 

TGGGTCGTAT 200 

ATCGATACCG 250 

CGAGTTGTCT 300 

ACTTGCTGTG 350 

GCCGGTGTGG 400 

CAAAGCGCAC 450 

GTGAAGGTAA 500 

AAAGTAGCCA 550 

GCGCGTTGCT 600 

AAGACGAAAA 650 

TACCGTTACA 700 

GCCTTCTGCA 750 

TGCAAGAGCG 800 
831 



30 



2) INFORMATION FOR SEQ ID NO: 748 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 862 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



40 



45 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria cuniculi 

(B) STRAIN: ATCC 14688 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 748 



50 



55 



60 



CCGTGGCCAA 
AAACAACCTT 
ATTGCGATGG 
CTCTGGTGCA 
TTATGGATGT 
CAAGAAAAAT 
AAACTCAACT 
GCCCATTTGC 
GGTAAAACCG 
TTCAGGTCTT 
ACGATTTTTA 
TTCACCAAGT 
AGGAAACCGT 
TCCGTGACGA 



GTACCACAAA 
GGAAGTTCAA 
GTTCAACCGA 
CCGATTTCTG 
GTTGGGTCGT 
GGTCAATTCA 
GAGCTGCTAG 
TAAAGGTGGT 
TGAATATGAT 
TCTGTTTTTG 
TCACGAAATG 
CAAAAGTAGC 
TTGCGTGTTG 
AAAAGATGAA 



TTTATGACGC 
CAGCAGTTGG 
AGGCTTGAAG 
TGCCAGTGGG 
CCAATCGACG 
CCGTGAAGCA 
AAACAGGCAT 
AAAGTTGGCT 
GGAGCTGATT 
CAGGTGTTGG 
CAAGAAGCAG 
GATGGTTTAT 
CATTGACAGG 
GCAACAGGCA 
42 



ACTGAGTGTT 
GTGATGGCGT 
CGTGGTTTGA 
TCAAGCGACT 
AGGCAGGTCC 
CCAAGCTATG 
CAAAGTAATT 
TGTTCGGTGG 
AATAATATCG 
TGAGCGTACT 
GCGTTGTTAA 
GGTCAGATGA 
CTTGACGATG 
AAGGGCGTGA 
6 



GATGGCACCG 50 

GGTGCGTACC 100 

CTGTATCTAA 150 

TTGGGTCGTA 200 

TGTAAATGCT 250 

ATGAGCAATC 300 

GATTTGCTTT 350 

TGCAGGTGTG 4 00 

CTCTGAAGCA 4 50 

CGTGAGGGTA 500 

TACCGAAGAC 550 

ATGAGCCACC 600 

GCAGAATATT 650 

TGTTCTATTG 700 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

TTCGTTGATA ACATCTATCG TTACACACTG GCTGGTACGG AAGTGTCAGC 
ACTTCTAGGT CGTATGCCAT CAGCAGTAGG TTATCAACCG ACTCTGGCTG 
AAGAAATGGG TGCGTTGCAA GAGCGTATTA CCTCAACGCA ATCGGGTTCG 
ATCACTTCGG GG 



PCT/CAOO/01150 

750' 
800 



850 
862 
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2) INFORMATION FOR SEQ ID NO: 749 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 844 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria elongata subsp. elongata 

(B) STRAIN: ATCC 25295 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 749 



GGAACTCCCA 
TTGAAAATGA 
GTGCGTACCA 
TGTGAATAAT 
TGGGTCGTAT 
GTAAATGCAT 
TGAGCTTTCT 
ACTTGTTATG 
GCGGGTGTAG 
CAAGGCACAT 
GTGAAGGTAA 
AAAGTGGCAA 
GCGCGTTGCT 
AAGACGAAAA 
TACCGTTACA 
GCCTTCAGCA 
TGCAAGAGCG 



CGTGACGCTA 
CCTAACCTTA 
TTGCGATGGG 
ACCGGAGCTC 
CATGGATGTA 
CCAATACACG 
TCAACAACGG 
TCCGTTTGCC 
GTAAAACCGT 
AGCGGTTTGT 
TGACTTCTAT 
TGGTTTACGG 
TTGACCGGTT 
CGGTAAAGGT 
CTTTGGCCGG 
GTAGGTTACC 
TATTACCTCT 



TCCCGCATGT 
GAAGTTCAAC 
TAGTTCAGAT 
CGATTACTGT 
TTGGGTAATC 
TGCGATCCAT 
AATTATTAGA 
AAAGGTGGTA 
AAATATGATG 
CTGTGTTTGC 
CACGAGATGA 
TCAGATGAAC 
TGACTATGGC 
CGCGACGTAT 
TACGGAAGTA 
AACCGACATT 
ACCCAGACAG 



ID NO: 749 






TTTTGATGCA 


TTAAAATTAG 


50 


AACTTTTGGG 


GGATGGTGTA 


100 


GGATTAAAGC 


GTGGTATGGC 


150 


TCCTGTTGGC 


CGTGAAACTT 


200 


CGGTTGATGA 


GGCAGGTCCG 


250 


CAAGAGGCTC 


CTAAGTTTGA 


300 


AACTGGCATT 


AAGGTTATCG 


350 


AAGTAGGTCT 


GTTTGGTGGT 


400 


GAGTTAATTA 


ACAACATTGC 


450 


AGGCGTGGGT 


GAACGTACTC 


500 


AAGATTCCAA 


CGTATTGGAC 


550 


GAACCTCCAG 


GCAACCGTCT 


600 


CGAATACTTC 


CGTGACGAAA 


650 


TGTTCTTCGT 


GGACAACATT 


700 


TCCGCATTGC 


TGGGTCGTAT 


750 


GGCTGAAGAA 


ATGGGTCGTT 


800 


GCTCTATTAC 


TTCC 


844 



45 



50 
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60 



2) INFORMATION FOR SEQ ID NO: 750 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 834 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria flavescens 

(B) STRAIN: ATCC 13120 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 750 

CGCGACGCTA TTCCGCATGT TTACGATGCC CTGAAATTGG ACGAGAACGG 

427 



50 
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TCTGACTCTG GAAGTTCAAC AGCTTCTGGG TGACGGCGTT GTCCGT&TTA 100 4 

■ TTGCAATGGG TAGTTCAGAC GGCCTGAAAC GCGGCATGTC TGTAAGCAAT 150 

ACCGGTGCGC CAATCACTGT GCCGGTAGGT AAAGGTACAT TGGGCCGTAT 200 

TGTCGACGTA TTGGGTACGC CTGTTGATGA AGCAGGTCCG ATCGATACCG 250 

5 ACAAGAGCCG TGCCATTCAC CAAACTGCTC CGAAATTCGA TGAGTTGTCT 300 

TCAACTACCG AATTGTTGGA AACCGGTATT AAAGTGATCG ACTTGCTGTG 350 

TCCGTTTGCT AAGGGCGGTA AAGTAGGTCT GTTCGGTGGT GCCGGTGTGG 400 

GCAAAACCGT GAACATGATG GAATTGATCA ACAACATCGC CAAAGCGCAC 450 

AGCGGCCTGT CCGTGTTCGC AGGTGTGGGT GAACGTACCC GTGAAGGTAA 500 

10 CGACTTCTAC CACGAGATGA AAGATTCCAA CGTATTGGAT AAAGTAGCCA 550 

TGGTGTATGG TCAAATGAAC GAACCTCCAG GCAACCGTCT GCGCGTTGCT 600 

TTGACCGGTT TGACTATGGC CGAATACTTC CGTGACGAAA AAGATGAAAA 650 

CGGTAAAGGT CGCGACGTAT TGTTCTTCGT TGACAACATC TACCGTTACA 700 

CTCTGGCCGG TACCGAAGTA TCCGCACTGT TGGGTCGTAT GCCTTCTGCA 750 

15 GTGGGTTACC AACCGACATT GGCTGAAGAA ATGGGTCGTT TGCAGGAGCG 800 

TATTACTTCT ACGCAAACCG GTTCCATTAC TTCC 834 



20 2) INFORMATION FOR SEQ ID NO: 751 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 834 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear , 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria gonorrhoeae 

(B) STRAIN: ATCC 31426 



35 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 751 



CGCGACATGA TTCCGCGCGT TTACGACGCT TTGAAATTAG ACGAAAACGG 50 

TCTGACTTTG GAAGTCCAAC AGCTTTTGGG TGATGGCGTA GTCCGTACCA 100 

TCGCTATGGG CAGCTCGGAC GGTTTGAAAC GCGGCATGAC TGTGAGCAAT 150 

ACTGGTTCGC CCATTACTGT GCCGGTAGGT AAAGGTACGT TGGGACGCAT 200 

4 0 TGTCGATGTA TTGGGAACTC CTGTTGACGA GGCAGGTCCA ATTGATACCG 250 

ACAAGAGTCG TGCCATCCAC CAAGCCGCTC CTAAGTTTGA CGAACTGTCT 300 

TCCACAACCG AATTGCTCGA AACGGGCATT AAAGTGATTG ACTTGCTGTG 350 

TCCGTTTGCC AAAGGCGGTA AAGTAGGTCT GTTCGGCGGT GCCGGTGTGG 4 00 

GTAAAACCGT GAACATGATG GAATTGATCA ACAACATCGC CAAAGCGCAC 450 

45 AGCGGCTTGT CCGTGTTCTC AGGCGTGGGT GAGCGTACCC GCGAAGGTAA 500 

CGACTTCTAC CACGAGATGA AAGATTCCAA CGTATTGGAT AAAGTAGCCA 550 

TGGTGTATGG CCAAATGAAC GAACCTCCAG GCAACCGTCT GCGCGTTGCT 600 

TTGACCGGTT TGACTATGGC TGAATACTTC CGTGACGAAA AAGACGAAAA 650 

CGGTAAAGGT CGTGACGTAT TGTTCTTCGT TGACAACATC TACCGTTACA 700 

50 CTCTGGCCGG TACCGAAGTA TCCGCACTGT TGGGCCGTAT GCCTTCTGCA 750 

GTGGGTTACC AACCGACATT GGCTGAAGAA ATGGGTCGTT TGCAAGAGCG 800 

TATTACCTCT ACCCAAACCG GTTCCATTAC TTCC 834 



2) INFORMATION FOR SEQ ID NO: 752 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 825 bases 
60 "(B) TYPE: Nucleic acid 

428 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria gonorrhoeae 

(B) STRAIN: ATCC 27628 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 752 

ACATGATTCC GCGCGTTTAC GACGCTTTGA AATTAGACGA AAACGGTCTG 50 

ACTTTGGAAG TCCAACAGCT TTTGGGTGAT GGCGTAGTCC GTACCATCGC 100 

TATGGGCAGC TCGGACGGTT TGAAACGCGG CATGACTGTG AGCAATACTG 150 

15 GTTCGCCCAT TACTGTGCCG GTAGGTAAAG, GTACGTTGGG ACGCATTGTC 200 

GATGTATTGG GAACGCCTGT TGACGAGGCA GGTCCAATTG ATACCGACAA 250 

GAGTCGTGCC ATCCACCAAG CCGCTCCTAA GTTTGACGAA CTGTCTTCCA 300 

CAACCGAATT GCTCGAAACG GGCATTAAAG TGATTGACTT GCTGTGTCCG 350 

TTTGCCAAAG GCGGTAAAGT AGGTCTGTTC GGCGGTGCCG GTGTGGGTAA 4 00 

2 0 AACCGTGAAC ATGATGGAAT TGATCAACAA CATCGCCAAA GCGCACAGCG 450 

GCTTGTCCGT GTTCTCAGGC GTAGGTGAGC GTACCCGCGA AGGTAACGAC 500 

TTCTACCACG AGATGAAAGA TTCCAACGTA TTGGATAAAG TAGCCATGGT 550 

GTATGGCCAA ATGAACGAAC CTCCAGGCAA CCGTCTGCGC GTTGCTTTGA 600 

CCGGTTTGAC TATGGCTGAA TACTTCCGTG ACGAAAAAGA CGAAAACGGT 650 

2 5 AAAGGTCGTG ACGTATTGTT CTTCGTTGAC AACATCTACC GTTACACTCT 700 

GGCCGGTACC GAAGTATCCG CACTGTTGGG CCGTATGCCT TCTGCAGTGG 750 

GTTACCAACC GACATTGGCT GAAGAAATGG GTCGTTTGCA AGAGCGTATT 800 

ACCTCTACCC AAACCGGTTC CATTA 825 



30 



45 



2) INFORMATION FOR SEQ ID NO: 753 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

4 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria lactamica 

(B) STRAIN: ATCC 23970 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 753 



GATGCGATTC CGCATGTTTA CGATGCCCTG AAATTGGACG AGAACGGTCT 50 

GACCCTGGAA GTCCAACAGC TTTTGGGTGA CGGCGTTGTC CGTACTATTG 100 

50 CAATGGGTAG TTCAGACGGC CTGAAACGCG GCATGTCTGT CAGCAATACC 150 

GGTGCGCCAA TCACTGTGCC GGTAGGTAAA GGTACGTTGG GCCGTATTGT 2 00 

CGACGTATTG GGTACACCTG TTGACGAAGC AGGTCCGATC GATACCGACA 250 

AGAGCCGCGC CATCCACCAA ACCGCCCCGA AATTCGACGA GTTGTCTTCA 300 

ACTACCGAAT TGTTGGAAAC CGGCATTAAA GTGATCGATT TGCTGTGTCC 350 

55 GTTTGCTAAG GGCGGTAAAG TAGGTCTGTT CGGTGGTGCC GGTGTGGGCA 4 00 

AAACCGTGAA CATGATGGAA TTGATCAACA ACATCGCCAA AGCGCACAGC 4 50 

GGTCTGTCCG TGTTTGCAGG CGTGGGCGAG CGTACCCGCG AAGGTAACGA 500 

CTTCTACCAC GAGATGAAAG ATTCCAACGT ATTGGATAAA GTAGCCATGG 550 

TGTATGGTCA GATGAACGAA CCTCCGGGCA ACCGTCTGCG CGTTGCTTTG 600 

60 ACCGGTTTGA CGATGGCCGA ATACTTCCGC GACGAAAAAG ACGAAAACGG 650 

429 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



CAAAGGCCGC GACGTATTGT TCTTCGTGGA 
TGGCCGGTAC CGAAGTATCC GCACTGTTGG 
GGTTACCAAC CGACATTGGC TGAAGAAATG 
TACCTCTACC CAAACCGGTT CCATTACTTC 



CAACATCTAC CGTTACACCC 
GCCGTATGCC TTCCGCAGTG 
GGTCGTTTGC AAGAGCGTAT 
C 



7O0 
750 
800 
831 
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2) INFORMATION FOR SEQ ID NO: 754 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 836 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear. 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: 2241C 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 754 

CCACGCGACA TGATTCCGCG CGTTTACGAC GCTTTGAAAT TAGACGAAAA 50 

CGGTCTGACT TTGGAAGTCC AACAGCTTTT GGGCGACGGC GTAGTCCGTA 100 

CCATTGCGAT GGGCAGCTCG GACGGTTTGA AACGCGGCAT GACTGTGAGC 150 

AATACCGGTG CGCCCATTAC TGTGCCGGTA GGTAAAGGTA CGTTGGGACG 200 

CATTGTCGAT GTATTGGGAA CGCCTGTTGA CGAGGCAGGT CCAATCGATA 250 

CCGACAAGAG CCGTGCCATC CACCAAGCCG CTCCTAAGTT TGACGAACTG 300 

TCTTCCACAA CCGAATTGCT CGAAACGGGC ATTAAAGTGA TTGACTTGCT 350 

GTGTCCGTTT GCCAAAGGCG GTAAAGTAGG TCTGTTCGGC GGTGCCGGTG 4 00 

TGGGTAAAAC CGTGAACATG ATGGAATTGA TCAACAACAT CGCCAAAGCG 450 

CACAGCGGCT TGTCCGTGTT CGCAGGCGTG GGTGAGCGTA CCCGCGAAGG 500 

TAACGACTTC TACCACGAGA TGAAAGATTC CAACGTATTG GATAAAGTGG 550 

CAATGGTTTA CGGTCAGATG AACGAACCTC CGGGCAACCG TTTGCGCGTC 600 

GCATTGACCG GTTTGACCAT GGCGGAATAC TTCCGTGACG AAAAAGACGA 650 

AAACGGCAAA GGCCGCGACG TATTGTTCTT CGTGGACAAC ATCTACCGTT 700 

ACACTCTAGC TGGTACCGAA GTATCCGCAT TGTTGGGCCG TATGCCGTCT 750 

GCAGTGGGCT ACCAACCGAC ATTGGCAGAA GAAATGGGTC GTTTGCAGGA 800 

GCGTATTACC TCTACCCAAA CCGGTTCTAT TACTTC 836 



2) INFORMATION FOR SEQ ID NO: 755 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 837 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria mucosa 

(B) STRAIN: ATCC 19696 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 755 
CCGCGTGATG CCATTCCGCG TGTTTACGAC GCCCTGAAAT TGGATGCAAA 50 



i 
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CGGCCTGACT TTGGAAGTGC AACAGCTTCT GGG TGACGGC GTAGTTCGTA ^00 

CTATTGCAAT GGGTAGTTCG GATGGTTTGA AACGCGGCAT GACTGTAAGC 150 

AATACAGGTG CGCCGATTAC AGTACCGGTA GGTAAAGGTA CTTTGGGACG 200 

TATTGTCGAT GTATTGGGTA CGCCTGTTGA CGAAGCAGGT CCGATTGATA 250 

5 CCGACAAACA CCGTGCCATC CATCAGACAG CTCCGAAATT TGATGAACTG 300 

TCTGCGACTA CTGAGCTGCT GGAAACCGGC ATTAAAGTGA TTGACTTGCT 350 

GTGTCCGTTT GCCAAAGGCG GTAAAGTAGG TCTGTTCGGT GGTGCCGGTG 400 

TAGGCAAAAC CGTCAACATG ATGGAATTAA TTAACAACAT CGCCAAAGCA 450 

CATAGCGGTT TGTCCGTGTT TGCAGGTGTG GGTGAGCGTA CCCGTGAAGG 500 

10 TAATGACTTC TACCACGAGA TGAAAGATTC CAACGTATTG GACAAAGTGG 550 

CGATGGTTTA CGGTCAGATG AACGAACCTC CGGGTAACCG TCTGCGTGTA 600 

GCCTTGACCG GTTTGACGAT GGCCGAATAC TTCCGTGATG AAAAAGACGA 650 

AAGCGGCAAA GGCCGTGACG TATTGTTCTT CGTGGACAAT ATCTACCGTT 700 

ACACCCTGGC CGGTACCGAA GTATCCGCAT TGTTGGGTCG TATGCCTTCA 750 

15 GCAGTAGGTT ACCAACCGAC ATTGGCTGAA, GAAATGGGTC GTTTGCAAGA 800 

GCGTATTACC TCTACCCAAA CAGGCTCCAT TACCTCC 837 



20 2) INFORMATION FOR SEQ ID NO: 756 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 834 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria sub f lava 

(B) STRAIN: ATCC 14221 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 756 



CGCGACGCTA TCCCGCATGT TTACGATGCC CTGAAATTGG ACGAGAACGG 50 

TCTGACTCTG GAAGTTCAAC AGCTTCTGGG TGACGGCGTT GTCCGTACTA 100 

TTGCAATGGG TAGTTCAGAC GGCCTGAAAC GCGGCATGTC TGTAAGCAAT 150 

ACTGGTGCGC CAATCACTGT GCCGGTAGGT AAAGGTACAT TGGGTCGTAT 200 

4 0 TGTCGACGTA TTGGGTACGC CTGTTGATGA AGCAGGTCCG ATCGATACCG 250 

ACAAGAGCCG TGCCATTCAC CAAACTGCTC CGAAATTCGA CGAGTTGTCT 300 

TCAACTACCG AATTGCTGGA AACCGGTATT AAAGTGATCG ACTTGCTGTG 350 

TCCGTTTGCT AAGGGCGGTA AAGTAGGTCT GTTCGGTGGT GCCGGTGTGG 4 00 

GCAAAACCGT GAACATGATG GAATTGATCA ACAACATCGC CAAAGCGCAC 4 50 

45 AGCGGTCTGT CCGTGTTCGC AGGTGTGGGT GAACGTACCC GTGAAGGTAA 500 

CGACTTCTAC CACGAGATGA AAGATTCCAA CGTATTGGAT AAAGTAGCCA 550 

TGGTGTATGG CCAAATGAAC GAACCTCCGG GCAACCGTCT GCGCGTTGCT 600 

TTGACCGGTT TGACTATGGC CGAATACTTC CGTGACGAAA AAGACGAAAA 650 

CGGTAAAGGT CGCGACGTAT TGTTCTTCGT TGACAACATC TACCGTTACA 700 

50 CTCTGGCCGG TACCGAAGTA TCTGCACTGT TGGGCCGTAT GCCTTCTGCA 750 

GTGGGTTACC AACCGACATT GGCTGAAGAA ATGGGTCGTT TGCAAGAGCG 800 

TATTACCTCT ACCCAAACTG GTTCCATTAC TTCC 834 



2) INFORMATION FOR SEQ ID NO: 757 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 833 bases 
60 (B) TYPE: Nucleic acid 

431 
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(C) STRANDEDNESS Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria weaveri 

(B) STRAIN: ATCC 51223 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 757 

CCCGTGATGC TATTCCCATG TATACGATGC CCTGAAATTG GTAGATAACG 50 

ATCTGACCCT GGAAGTGCAA CAACTTTTAG GTGATGGTGT GGTTCGTACC 100 

ATTGCAATGG GTAGTTCAGA CGGCCTAAAA CGTGGTATGG CTGTTAACAA 150 

TACCGGCGCT CCGATTACTG TTCCGGTGGG GAAAGCCACC TTGGGACGTA 200 

TTATGGATGT GTTGGGTAAT CCGGTTGATG AAGCAGGTCC TGTTGTATCA 250 

GAAGAAACTC GCGCTATTCA TCAAGCTGCC CCTAAATTTG ACGAACTGTC 300 

TTCAGCAACT GAGTTGTTGG AAACAGGCAT TAAAGTAATT GACTTGCTGT 350 

GCCCGTTTGC CAAAGGTGGT AAAGTAGGTT TGTTTGGTGG TGCCGGCGTG 4 00 

GGTAAAACCG TAAATATGAT GGAGTTGATC AACAACATCG CGAAGGCACA 4 50 

TAGTGGTTTG TCTGTATTCG CCGGTGTAGG TGAGCGTACC CGTGAAGGTA 500 

ACGACTTCTA CCATGAAATG AAAGACTCTA ACGTATTGGA TAAAGTAGCC 550 

ATGGTTTATG GCCAGATGAA TGAACCTCCG GGTAACCGTT TGCGCGTTGC 600 

TTTGACTGGT TTGACTATGG CCGAATATTT CCGTGACGAG AAAGATGAAA 650 

ACGGCAAAGG TCGTGACGTC TTGTTCTTTG TGGATAATAT CTATCGCTAT 700 

ACTCTGGCCG GTACTGAAGT GTCTGCACTG TTAGGTCGTA TGCCGTCTGC 750 

AGTAGGTTAT CAGCCTACAT TGGCAGAAGA AATGGGTCGC TTGCAGGAGC 800 

GTATTACTTC TACTCAAACA GGTTCGATTA CTT 833 



2) INFORMATION FOR SEQ ID NO: 758 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 833 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria aniwalis 

(B) STRAIN: ATCC 19573 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 758 



CGCGATGCCA TTCCTCACGT TTACGATGCC 
TCTGACTTTG GAAGTACAAC AACTTCTGGG 
TTGCAATGGG TAGTTCAGAC GGCCTGAAAC 
ACCGGTTCTC CGATTGCCGT TCCTGTCGGT 
TATGGACGTA TTGGGCAATC CGGTTGATGA 
AAGAGAAACG TGCTATTCAC CAAGCCGCAC 
TCAGCTACCG AGTTGTTGGA AACCGGTATT 
TCCGTTTGCA AAAGGCGGTA AAGTAGGTCT 
GCAAAACCGT AAACATGATG GAATTGATCA 
AGCGGTCTGT CTGTGTTTGC CGGTGTAGGT 
CGACTTCTAC CACGAGATGA AAGATTCCAA 
TGGTGTACGG TCAGATGAAT GAGCCGCCGG 
TTGACCGGCC TGACTATGGC CGAATACTTC 

4! 



CTGAAATTGG ACGACACCGG 50 

CGACGGTGTG GTACGTACCA 100 

GGGGTTTGTC TGTGAGCAAT 150 

AAAGCGACTT TGGGTCGTAT 200 

AGCCGGTCCG GTTGCTACCG 250 

CGAAGTTTGA TGAATTGTCA 300 

AAAGTAATCG ACTTGCTGTG 350 

GTTCGGCGGT GCCGGTGTGG 4 00 

ACAACATCGC CAAAGCACAC 450 

GAACGTACCC GCGAAGGTAA 500 

CGTGTTGGAT AAAGTAGCCA 550 

GCAACCGCTT GCGCGTGGCT 600 

GTGACGAAAA AGACGAAAAC 650 
2 
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GGCAAAGGTC GTGACGTATT GTTCTTCGTG GACAACATTT ACCGCTACAC 700 

ACTGGCCGGT ACCGAAGTAT CAGCATTGTT GGGCCGTATG CCGTCTGCAG 750 

TAGGTTATCA GCCGACATTG GCAGAGGAAA TGGGTCGCTT GCAAGAGCGT 800 

ATTACCTCTA CCCAAACCGG TTCGATTACC TCT 833 



10 



15 



2) INFORMATION FOR SEQ ID NO: 759 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



20 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus penneri 

(B) STRAIN: ATCC 33519 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 759 



25 



30 



35 



40 



AGCGTCCCTA 
ACTGGTGCTG 
TCGCAATGGG 
TTAGGCCACC 
CATGAACGTT 
AAGAACGTTG 
AACTCACAAG 
TCCGTTTGCT 
GTAAAACAGT 
TCAGGTTACT 
CGACTTCTAT 
TAGTTTATGG 
CTGACTGGTC 
ACTGTTATTC 
TATCAGCACT 
TTGGCTGAAG 
AGGTTCAATC 



AAGTATACGA 
GAAGTTCAGC 
TACATCAGAC 
CAATTGAAGT 
CTGGGTACAC 
GTCTATTCAC 
AACTGCTTGA 
AAAGGTGGTA 
TAACATGATG 
CTGTATTTGC 
CATGAAATGA 
TCAGATGAAT 
TGACTATGGC 
GTCGATAACA 
GTTAGGTCGT 
AGATGGGTGT 
ACCTCTGTA 



CGCTCTTGAG 
AACAGTTAGG 
GGTTTAAGCC 
ACCAGTAGGT 
CTATTGATAT 
CGTGAAGCAC 
AACCGGTATC 
AAGTAGGTCT 
GAATTGATCC 
TGGTGTTGGT 
CAGATTCTAA 
GAGCCACCAG 
TGAAAAATTC 
TTTATCGTTA 
ATGCCATCAG 
TCTGCAAGAA 



GTTATGAATG 
CGGTGGTATC 
GTGGCTTAAA 
AAAGCAACAC 
GAAAGGTGAT 
CAACCTACGA 
AAAGTAATGG 
GTTCGGTGGT 
GTAATATCGC 
GAGCGTACTC 
CGTTCTTGAC 
GAAACCGTCT 
CGTGATGAAG 
CACCTTAGCC 
CGGTAGGTTA 
CGTATCACTT 



GTAAAGAAAA 50 

GTTCGTTGTA 100 

AGTTGAAGAT 150 

TGGGACGTAT 200 

ATTGCAACTG 250 

AGAGTTATCA 300 

ACTTAATCTG 350 

GCGGGTGTGG 400 

GATCGAGCAC 450 

GTGAGGGTAA 500 

AAAGTATCGT 550 

GCGTGTAGCA 600 

GCCGTGACGT 650 

GGTACAGAAG 700 

CCAGCCAACA 750 

CAACCAAAAC 800 
819 



45 



50 



2) INFORMATION FOR SEQ ID NO: 760 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



55 



60 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Enteritidis 

(B) STRAIN: ATCC 13076 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 760 

433 
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GCCGTACCAC GCGTGTACGA TGCCCTTGAG GTGCAGAATG GTAATGAGAA* ^0 

GCTGGTGCTG GAAGTTCAGC AGCAGCTTGG CGGCGGTATC GTGCGTACCA 100 

TCGCGATGGG GTCTTCTGAC GGTCTGCGTC GCGGTCTGGA TGTAAAAGAT 150 

CTCGAACACC CGATCGAAGT CCCGGTAGGT AAAGCCACGC TGGGTCGTAT 200 

5 CATGAACGTC CTGGGCGAAC CGGTCGACAT GAAAGGCGAG ATCGGCGAAG 250 

AAGAGCGTTG GGCGATTCAC CGCGCAGCAC CTTCCTACGA AGAGTTGTCA 300 

AACTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTCGCG AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GCGGGTGTAG 4 00 

GTAAAACCGT AAACATGATG GAGCTTATTC GTAACATCGC GATCGAGCAC 450 

10 TCCGGTTACT CAGTGTTTGC GGGCGTAGGG GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTATCGAT AAAGTATCCC 550 

TGGTGTATGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCA 600 

CTGACCGGCC TGACCATGGC GGAGAAATTC CGTGACGAAG GTCGTGACGT 650 

ACTGCTGTTC GTCGATAACA TCTATCGTTA CACCCTGGCC GGTACGGAAG 700 

15 TATCCGCACT GCTGGGCCGT ATGCCTTCCG CAGTAGGTTA CCAGCCGACT 750 

CTGGCGGAAG AGATGGGCGT TCTGCAGGAA CGTATCACCT CCACCAAAAC 800 

CGGTTCTATC ACCTCCGTA 819 



20 



30 



55 



2) INFORMATION FOR SEQ ID NO: 761 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 812 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia pestis 

(B) STRAIN: KIM D27 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 761 

ACCAAAAGTG TACAACGCCC TTGAGGTAGA AGGTACAACT GAAAAGTTAG 50 

TGCTGGAAGT TCAGCAACAG TTGGGCGGTG GTGTTGTTCG TTGTATCGCA 100 

ATGGGCTCTT CCGATGGTCT GAGCCGTGGG TTGAAAGTAA CCAACCTAGA 150 

4 0 ACACCCGATC GAAGTACCGG TTGGTAAAGC GACCCTTGGC CGTATCATGA 200 

ACGTATTGGG TGAACCAATC GACATGAAAG GTCCTATCGG TGAAGAAGAG 250 

CGTTGGGCAA TCCATCGCGA AGCGCCTTCT TATGAAGAGC TTGCTAGCTC 300 

ACAAGATCTG TTAGAAACCG GTATCAAGGT TATGGACCTG ATTTGTCCGT 350 

TTGCTAAGGG CGGTAAAGTC GGTCTGTTCG GTGGTGCGGG TGTAGGTAAA 400 

4 5 ACAGTAAACA TGATGGAGCT GATCCGTAAC ATCGCGATCG AGCACTCTGG 450 

GTATTCTGTA TTTGCCGGTG TAGGTGAGCG TACCCGTGAG GGTAATGACT 500 

TCTACCATGA AATGACTGAC TCCAACGTTT TGGACAAAGT ATCCTTGGTT 550 

TACGGCCAGA TGAATGAGCC ACCAGGTAAC CGTCTACGCG TTGCACTGAC 600 

CGGCCTGACC ATGGCGGAGA AATTCCGTGA TGAAGGTCGT GACGTACTGC 650 

50 TGTTTATCGA TAATATCTAT CGTTATACCC TAGCTGGTAC GGAAGTATCC 700 

GCATTGCTGG GTCGTATGCC ATCAGCGGTA GGTTATCAGC CAACACTGGC 750 

TGAAGAGATG GGTGTGTTGC AGGAACGTAT TACTTCCACT AAGACGGGTT 800 

CAATCACCTC TG 812 



2) INFORMATION FOR SEQ ID NO: 762 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 4 08 bases 

434 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 \ (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Burkholderia mallei 

(B) STRAIN: GB8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 762 



10 



CGACGTGATC GAGCCCTTCT TCGTCGACGT GATGCGCTCT TGCAGCTTGC 50 

CCATTTCTTC AGCCAGCGTC GGCTGATAGC CCACTGCCGA CGGCATACGG 100 

15 CCGAGCAGCG CCGACACTTC GGTACCGGCC, AGCGTGAAAC GGTAGATGTT 150 

GTCGACGAAG AACAGCACGT CGAGGCCTTC GTCACGGAAG TGCTCGGCCA 200 

TCGTGAGGCC CGTCAGCGCC ACGCGCAGAC GGTTGCCCGG CGGCTCGTTC 250 

ATCTGGCCGT ACACCAGCGC GACCTTGTCG AGAACGTTCG AGTCCTTCAT 300 

TTCGTGGTAG AAGTCGTTCC CTTCACGGGT ACGCTCGCCC ACGCCCGCGA 350 

20 ACACGGAGTA ACCGCCGTGC TCCTTCGCGA TGTTGTTGAT GAGCTCCATC 4 00 

ATGTTGAC 4 08 



25 2) INFORMATION FOR SEQ ID NO: 763 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium sordellii 

(B) STRAIN: ATCC 9714 



40 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 763 



GAACTTATAA ACAACATAGC TACTCAACAT GGTGGTATAT CAGTATTCGC 50 

AGGTGTTGGA GAGAGAACAA GAGAAGGTAA CGACCTTTAC GGAGAAATGA 100 

GTGAGTCTGG AGTTATAAAC AAGACAGCTC TAGTATTCGG ACAAATGAAT 150 

GAGCCACCTG GAGCAAGAAT GAGAGTTGCT TTAACTGGTC TTACAATGGC 200 

45 TGAATATTTC AGAGATCAAG AAGGACAAGA CGTTTTATTA TTCGTAGATA 250 

ATATATTCCG TTTCACTCAA GCAGGATCTG AGGTTTCTGC ACTTCTTGGA 300 

CGTACTCCAT CAGCAGTTGG ATACCAACCA ACATTAGCTA CAGAGATGGG 350 

TAGATTACAA GAGAGAATAA CATCTACAAA TAAAGGGTCT ATAACATCAG 4 00 



2) INFORMATION FOR SEQ ID NO: 764 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 4 05 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 

435 
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PCT/CA00/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium novyi 

(B) STRAIN: ATCC 19402 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 64 

TTAATTCAAG AATTAATCAA CAATATAGCG AAGGAACACG GTGGATTATC 50 

TGTATTTACA GGAGTTGGAG AAAGAACAAG AGAAGGTAAT GACCTTTACT 100 

10 ATGAAATGAA AGAATCTGGT GTTATAAATA AAACAGCACT AGTATTTGGT 150 

CAAATGAATG AGCCACCTGG AGCAAGAATG AGAGTTGCTC TTACAGGACT 200 

TACTATGGCA GAATATTTCA GGGATCAAGG ACAAAACGTA CTTTTATTCA 250 

TAGACAACAT ATTTAGATTC ACTCAAGCAG GTTCAGAGGT GTCAGCTTTA 300 

CTTGGAAGAA TACCTAGTGC CGTTGGATAC CAACCAACAC TAGCAACAGA 350 

15 AATGGGTGCG CTTCAAGAAA GAATAACATC TACAAAGCAT GGTTCTATAA 400 

CATCA 405 



20 2) INFORMATION FOR SEQ ID NO: 765 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 393 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium botulinum 

(B) STRAIN: 20:3.1 



35 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 765 



AATTAATAAA TAACATAGCT AAAGAACATG GTGGATTATC AGTGTTTACT 50 

GGAGTTGGAG AAAGATCTAG AGAAGGTAAT GATCTTTATC ATGAAATGAG 100 

AGAATCAGGC GTTATAGATA AAACCGCATT AGTATTTGGT CAAATGAATG 150 

AGCCACCTGG TGCTAGAATG AGAGTTGCAT TAACAGGGTT AACTATGGCT 2 00 

40 GAATATTTTA GAGATAAAGG TCAAGATGTA TTACTATTTA TAGATAATAT 250 

ATTTAGATTT ACTCAAGCTG GATCAGAAGT TTCAGCATTA CTTGGAAGAA 30 0 

TACCATCAGC AGTTGGTTAC CAACCAACTT TAGCAACTGA AATGGGTGCA 350 

CTTCAAGAGA GAATTACATC AACTAAGAAT GGTTCTATAA CAT 393 



2) INFORMATION FOR SEQ ID NO: 766 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 399 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium histolyticum 

(B) STRAIN: ATCC 19401 

60 

436 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 



766 



10 



AGGAACTTAT 
ACAGGTGTTG 
GAAAGATTCT 
ATGAACCACC 
GCAGAATATT 
TATATTTAGA 
GAATTCCAAG 
GCATTACAAG 



AAATAATATT 
GTGAAAGAAC 
GGGGTTATAG 
AGGAGCGAGA 
TTAGAGATCA 
TTTACGCAGG 
TGCAGTTGGA 
AAAGAATAAC 



GCAAAACAAT 
AAGAGAAGGT 
AGAAAACTGC 
ATGAGAGTAG 
AGGGCAAGAT 
CTGGTTCTGA 
TATCAACCAA 
ATCCACAAAA 



ATGGAGGTCT 
AATGACCTAT 
ACTAGTATTT 
CATTGACAGG 
GTACTTTTAT 
AGTTTCTGCA 
CCCTTGCAAC 
AATGGATCAA 



PCT/CAOO/01150 



ATCTGTATTT 50 

ATAATGAAAT 100 

GGACAGATGA 150 

ACTTACTATG 2 00 

TTATAGATAA 250 

TTGTTAGGAA 300 

TGAAATGGGT 350 

TTACTTCAG 399 



15 



20 



25 



30 



35 



2) INFORMATION FOR SEQ ID NO: 767 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 390 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Peptostreptococcus prevotii 

(B) STRAIN: ATCC 9321 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 767 



CATGATGGAA CTGATCCGTA ACATCGCCAT CGAGCACAGC 
TGTTCGCCGG TGTGGGTGAG CGTACTCGTG AGGGTAACGA 
GAGATGAAGG ACTCCAACGT TCTGGACAAA GTGGCACTGG 
GATGAACGAG CCGCCGGGTA ACCGTCTGCG CGTAGCACTG 
CCATGGCCGA GAAGTTCCGT GACGAAGGTA ACGACGTTCT 
GACAACATCT ACCGTTACAC CCTGGCCGGT ACTGAAGTAT 
GGGCCGTATG CCTTCGGCAG TAGGTTACCA GCCGACCCTG 
TGGGCGTTCT GCAAGAACGT ATCACTTCGA CCAAGGAAGG 



GGTTATTCCG 50 

CTTCTACCAC 100 

TCTACGGTCA 150 

ACTGGCCTGA 200 

GTTGTTCGTC 2 50 

CCGCACTGCT 3 00 

GCTGAAGAGA 350 
390 
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2) INFORMATION FOR SEQ ID NO: 768 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 953 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Absidia corymbifera 

(B) STRAIN: ATCC 46775 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 768 

AGGTCTTGTT CGTGGCAAAA GGTCATTGAC ACTGGTGCTC CTATCACCAT 
TCCTGTTGGT AACGAAGTCC TTGGTCGTAT CATTAACGTC ATTGGTGAGC 
CCATTGATGA GCGTGGTCCY ATCAAGTCCA AGGCCACTCG TGCTATCCAC 
GCTGATGCTC CCGAGTTCGT TGATCAATCC CCCACTCCCG AGATTCTCGA 

437 
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GACTGGTATC AAGGTTGTCG ATTTGCTTGC 
AGATTGGTCT TTTCGGTGGT GCCGGTGTCG 
GAATTGATCA ACAACATTGC CAAGGCTCAC 
TGGTGTCGGT GAACGTACTC GTGAAGGCAA 
5 TTTCCACTGG TGTCATCAAG CTTGAAGGTG 
TTTGGTCAAA TGAACGAACC CCCCGGAGCT 
TGGTTTGACC ATTGCCGAAT ACTTCCGTGA 
TGCTCTTCAT TGACAACATT TTCCGTTTCA 
TCCGCTTTGC TTGGTCGTAT TCCCTCTGCT 

10 YTCCACTGAT ATGGGTGGTA TGCAAGAGCG 
GTTCCATTAC CTCCGTGCAA GCTGTCTACG 
GATCCTGCTC CTGCTACTAC TTTTGCTCAC 
GTCTCGTTCC ATTGCTGAGT TGGGTATCTA 
ACTCCAAGTC CCGTATCTTG GATCCCCGTA 

15 TCTGTTGCCA CTGGTGTCCA ACAAATCCTT 
AGA 



PCT/CAOO/01150 

TCCCTATGCT CGTGGtGGTA 2Sfi 

GCAAGACTGT CTTGATTCAA 300 

GGTGGTTACT CTATCTTCTG 350 

CGATTTGTAC CACGAAATGA 4 00 

AATCCAAGTG TGCTCTTGTC 450 

CGTGCCCGTG TTGCCTTGAC 500 

TGAGGAAGGT CAAGATGTGT 550 

CTCAAGCCGG TTCTGAAGTG 600 

GTCGGTTACC AACCCACTCT 650 

TATTACTACC ACCAAGAACG 700 

TCCCTGCTGA CGATTTGACT 750 

TTGGACGCCA CCACTGTGTT 800 

CCCTGCTGTC GATCCCCTTG 850 

TCGTTGGTGA TGAGCACTAC 900 

CAAAACTACA AGTCGCTTCA 950 

953 



20 2) INFORMATION FOR SEQ ID NO: 769 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1343 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Alternaria alternata 

(B) STRAIN: ATCC 62099 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 769 



CGCGGTTCCA AGGCCACCGA CACCGGTGCC CCCATCAAGA TTCCCGTTGG 50 

TCACGGTACC CTTGGTCGTA TCATGAACGT CACTGGTGAC CCCATTGACG 100 

AGCGTGGTCC CATCAAGGCC ACCAAGTACG CTCCCATCCA CGCCGACCCC 150 

CCGGAGTTCA CCGAGCAATC CACCTCCG.CT GAGGTCCTCG TTACCGGTAT 200 

40 CAAGGTTGTC GACCTGTTGG CTCCTTACGC TCGTGGTGGA AAGATTGGTC 250 

TCTTCGGAGG TGCTGGTGTC GGAAAGACTG TCTTCATTCA GGAGCTGATT 300 

GTAAGGAGAC ACACTGTCTA CTGGCTGAGC ATTAGCTAAC GGCAGGCAGA 350 

ACAACATCGC CAAGGCCCAC GGTGGTTTCT CTGTCTTCAC TGGTGTCGGT 4 00 

GAGCGTACCC GTGAGGGTAA CGATCTGTAC CACGAGATGC AGGAGACTTC 4 50 

45 CGTCATTCAG CTTGACGGTG ACTCCAAGGT CGCCCTCGTC TTCGGTCAGA 500 

TGAACGAGCC CCCGGGTGCC CGTGCCCGTG TCGCTCTTAC TGGTCTTACT 550 

GTTGCTGAGT AAGTCTTGAA TTCACTGTGT TGACAACGTC GTGGCTAATG 600 

GGAAAAAAGA TACTTCCGTG ACGAAGAGGG ACAGGATGTG CTTCTCTTCA 650 

TCGACAGTAA GTGCTTGTAC GAACTGCCTG TGAGACATAC ACTGACTTCG 700 

50 GCAATAGACA TTTTCCGTTT CACCCAGGCC GGTTCCGAGG TGTCCGCTCT 750 

TCTTGGTCGT ATCCCCTCTG CCGTCGGTTA CCAGCCCACC CTCGCCATTG 800 

ACATGGGTGT CATGCAGGAG CGTATTACCA CCACCACCAA GGGTTCCATC 850 

ACCTCCGTCC AGGCCGTCTA CGTGCCCGCT GACGATTTGA CTGACCCTGC 900 

CCCCGCCACC ACCTTCGCCC ATTTGGACGC CACCACTGTC TTGTCCCGTG 950 

55 GTATCTCCGA GTTGGGTATC TACCCCGCCG TCGACCCTCT TGACTCCAAG 1000 

TCCCGTATGT TGGACCCCCG TGTCATTGGT CAGGACCACT ACGACACCGC 1050 

CACCCGCGTT CAGCAGATTC TCCAGGAGTA CAAGTCGCTC CAGGATATCA 1100 

TTGCCATTCT CGGTATGGAC GAGTTGTCGG AAGCTGACAA GCTTACCGTC 1150 

GAGCGTGCCC GTAAGATCCA GCGTTTCTTG AGCCAGCCTT TCGCTGTCGC 1200 

60 CCAGGTCTTC ACTGGTATTG AGGGCAAGCT TGTCGACCTC AAGGACACCA 1250 

438 
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1300 
1343 



TCCGATCATT CAAGGCTATC TTGACTGGTG AGGGTGACGA CCTTCCCGAG 
GGTGAGTCTC GACTATCTCC GCATTCATAG CGTATAACTG ACA 



2) INFORMATION FOR SEQ ID NO: 770 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 480 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE : Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus 

(B) STRAIN: ATCC 26947 

(xi) SEQUENCE DESCRIPTION: SEQ 

TTCAGGAGTT GATTGTATGT TCACCTGCAA 
ACTCTTTTCT AACTCTTCAC AGAACAACAT 
ACTCTGTCTT CACTGGTGTC GGTGAGCGTA 
TACCACGAAA TGCAGGAGAC TGGTGTCATT 
GGTCGCCCTT GTCTTCGGTC AGATGAACGA 
GTGTCGCCCT TACCGGTCTG ACCATCGCCG 
GGTCAGGATG TGCTGCTCTT CATTGACAAC 
CGGTTCTGAG GTGTCTGCCC TTCTTGGTCG 
ACCAGCCCAC TCTGGCCGTC GACATGGGTG 
ACCACCACCA AGGGTTCCAT TACCTCCGTC 



flavus 



ID NO: 770 

CATAAGACTT CCCATTCTCC 50 

TGCCAAGGCT CACGGTGGTT 100 

CCCGTGAGGG TAACGATCTG 150 

CAGCTCGAGG GTGAATCTAA 200 

GCCCCCAGGT GCCCGTGCCC 2 50 

AGTACTTCCG TGACGAGGAA 300 

ATTTTCCGTT TCACCCAGGC 350 

TATCCCCTCC GCTGTCGGTT 400 

GTATGCAGGA GCGTATTACC 4 50 

480 



2) INFORMATION FOR SEQ ID NO: 771 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1174 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mucor circinelloides 

(B) STRAIN: ATCC 38592 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 771 

CTCGAACAAG AYAACTTGCC YGCCATTTTG AACGCCCTTG AAGTCAAGGA 50 

YCACTCTGGT GGACGTCTCG TYCTCGAAGT CTCTCAACAT TTGGGTGAGA 100 

ACACTGTCCG TACTATTGCT ATGGATGGTA CTGAAGGTAA GTTATGTYCA 150 

TCCCANNGGA TACAGTCARA CAGMAATGTC TAGTGGTTAT AGCAGYAGCA 200 

GATGATTGAC CAATATGTTA GGTCTTGTCC GGTGGTCAAA AGGTTGTTGA 250 

CACTGGTGCT CCCATTACCA TCCCCGTTGG TAAGGAAGTC CTTGGTCGTA 300 

TCATCAACGT TATTGGTGAA CCCATTGATG AACGTGGTCC CATTGACGCC 350 

AAGACTCACC GTCCTATTCA CGCTGAAGCT CCCGAATTCG TTGATCAATC 4 00 

CCCCACTCCC GAAATCCTCG AGACTGGTAT CAAGGTYGTC GATTTGTTGG 450 

CTCCTTATGC TCGTGGTGGT AAGATTGGTC TCTTCGGTGG TGCTGGTGTC 500 

439 
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GGTAAGACTG TCTTGATTCA AGAATTGATT 
CGGTGGTTAC TCTATCTTCT GTGGTGTCGG 
ACGATTTGTA CCATGAAATG ATTGAAACCG 
GAGTCCAAGT GTGCTCTCGT CTTTGGTCAA 
5 TCGTGCCCGT GTCGCTTTGA CTGGTTTGAC 
ATGATGAGGG TCAAGATGTC TTGCTTTTCA 
ACTCAAGCTG GTTCTGAGGT ATCTGCCCTT 
TGTCGGTTAC CAACCCACTC TTTCCACYGA 
GTATTACTAC CACCAAGAAC GGTTCCATTA 
10 GTCCCTGCTG ATGATTTGAC CGATCCTGCT 
CTTGGATGCC ACCACTGTCT TGTCTCGTTC 
AYCCCGCTGT CGATCCTCTT GATTCCAAGT 
ATTGTCGGTG ATGAGCACTA CAAGGTTGCC 
CCAAAACTAC AAGTCTCTCC AAGA • 

15 
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AACAACATYG CCAAGGCTCA 550 

TGAACGTACT CGTGAGGGTA 600 

GTGTCATTCA ATTGGAAGGC 650 

ATGAACGAAC CCCCAGGTGC 700 

TATTGCTGAA TACTTCCGTG 750 

TTGATAACAT TTTCAGATTC 800 

TTGGGTCGTA TTCCTTCCGC 850 

TATGGGTGGT ATGCAAGAGC 900 

CCTCCGTCCA AGCTGTCTAC 950 

CCTGCCACCA CTTTTGCTCA 1000 

CATCGCTGAA TTGGGTATCT 1050 

CTCGTATCCT CGATCCCCGT 1100 

ACTGAAGTTC AACAAATCCT 1150 

1174 



2) INFORMATION FOR SEQ ID NO: 772 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 467 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



45 



60 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Piedraia hortai 
30 (B) STRAIN: ATCC 24292 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 772 

AGGAGCTTAT CAACAACATC GCCAAGGCTC ACGGTGGTTA CTCCGTCTTC 50 

35 ACTGGTGTCG GTGAGCGTAC TCGTGAGGGT AACGATTTGT ACCACGAGAT 100 

GCAAGAGACT TCCGTCATTC AGCTTGACSG CGAGTCCAAG GTCGCTCTCG 150 

TGTTCGGTCA GATGAACGAG CCCCCGGGTG CCCGTGCCCG TGTTGCCCTG 200 

ACTGGTCTTA CCATCGCTGA GTACTTCCGT GATGCCGAGG GTCAGGATGG 250 

TAAGTTCTAT AACTCTTGTC GCAAAGGTTT CATTCTGGTC GCTAACTTGC 300 

4 0 TCAGTGCTCC TGTTCATCGA CAACATTTTC CGTTTCACCC AGGCTGGTAT 350 

GGAGGTGTCT GCCCTCCTCG GTCGTATTCC TTCTGCCGTC GGTTACCAAC 4 00 

CTACTCTCGC CGTCGACATG GGTGGTATGC AAGAGCGTAT TACCACTACC 450 

AAGAAGGGAT CCATTAC 467 



2) INFORMATION FOR SEQ ID NO: 773 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 578 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudallescheria boydii 

(B) STRAIN: ATCC 44331 
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(xi) SEQUENCE DESCRIPTION: SEQ 

TTGCCAAGGC CCACGGTGGT TACTCTGTCT 
ACCCGTGAGG GTAACGATCT GTACCACGAA 
5 TCAGCTTGAT GGCGAGTCCA AGGTCGCGCT 
AGCCCCCTGG TGCCCGTGCT CGTGTCGCTC 
GAGTACTTCC GTGATGAGGA GGGTCAGGAT 
TTATCTTCTT TGCCACCACC CCTCTACGAA 
GGCATCGTTT GTAGGGCGGG TCGGAGTTTG 
10 TTGAAGCCGC GGATGCCCGA TGTTTGACGC 
GACAACAGTG CTTCTCTTCA TTGACAACAT 
GTTCCGAGGT GTCTGCCCTT CTCGGTCGTA 
CAGCCCACTC TTGCCGTAGA CATGGGTGCC 
CACCAAGAAG GGTTCGATTA CCTCCGTC 

15 



PCT/CA00/01150 

ID NO: 773 

TCACTGGTGT CGGTGAGCGT 50 

ATGCAGGAAA CCTCCGTCAT 100 

TGTCTTCGGT CAGATGAACG 150 

TTACTGGTCT TACCGTTGCC 200 

GGTAAGTTAT ATCGTTTTTA 250 

TCCATGCCTC CGTTGGTGAA 300 

CGGCAATTTC TGCCGTCGGC 3 50 

GTATCGATGC TAACAACAAT 400 

TTTCCGATTC ACCCAGGCCG 4 50 

TTCCCTCTGC CGTCGGTTAC 500 

ATGCAGGAAC GTATTACCAC 550 

578 



2) INFORMATION FOR SEQ ID NO: 774 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1123 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rhizopus oryzae 
30 (B) STRAIN: ATCC 56015 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 774 

AACTTACCYG CTATCTTGAA CGCTCTCGAA GTCCAAGATC ACTCTGGTGG 50 

35 ACGTCTTGTC CTTGAAGTTC GCTCAACACT TGGGTGAAAA TACTGTCCGT 100 

ACTATTGCTA TGGATGGTAC TGAAGGTAAG CTATACTATA ACCGTKTTAT 150 

CCGAGTATGA TATTAACTTG AAAAAAGGTC TCGTYCGTGG TCAAAAGGTT 200 

ATTGACACTG GTGCTCCCAT TACCATTCCT GYTGGTAAGG AAGTTCTCGG 250 

TCGTATCATT AACGTCATTG GTGAACCCAT CGATGAACGT GGTCCTATCA 300 

40 ACGCCAAGAG CCAACGTCCC ATTCACGCCG AAGCTCCCGA ATTCGTTGAC 3 50 

CAATCTCCTA CTCCCGAAAT TCTTGAAACT GGTATCAAGG TTGTCGACTT 4 00 

GTTGGCTCCT TATGCTCGTG GTGGTAAGAT TGGTCTTTTC GGTGGTGCTG 4 50 

GTGTCGGTAA GACTGTGTTG ATTCAAGAAT TGATTAACAA CATCGCCAAG 500 

GCTCACGGTG GTTACTCTAT TTTCTGTGGT GTCGGTGAAC GTACTCGTGA 550 

45 AGGTAACGAT CTTTACCACG AAATGATTGA AACTGGTGTC ATCAAGCTCG 600 

ATGGTGACTC CAAGTGTGCT CTTGTCTTTG GTCAAATGAA CGAACCCCCA 650 

GGAGCTCGTG CCCGTGTTGC CTTGACTGGT TTGACCATTG CTGAATACTT 700 

CCGTGATGAT GAAGGTCAAG ATGTGTTGCT TTTCATTGAT AACATTTTCC 750 

GTTTCACCCA AGCTGGTTCW GAAGTATCTG CCCTTTTGGG TCGTATTCCC 800 

50 TCCGCTGTCG GTTACCAACC CACTCTTTCT ACTGATATGG GTGGTATGCA 850 

AGAACGTATT ACAACCACCA AGAACGGTTC CATTACATCC GTCCAAGCTG 900 

TCTACGTCCC TGCTGATGAT TTGACCGATC CTGCTCCCGC CACCACTTTT 950 

GCTCACTTGG ATGCCACCAC TGTCTTGTCT CGTTCCATTG CCGAATTGGG 1000 

TATTTACCCT GCCGTCGATC CTCTTGAYTC CAAGTCTCGT ATCTTGGATC 1050 

55 CTCGTATCGT TGGTGACGAA CACTACAAGG TCGCTACCGA AGTTCAACAA 1100 

ATCCTTCAAA ACTACAAGTC TCT 1123 



60 2) INFORMATION FOR SEQ ID NO: 775 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 477 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Scopulariopsis koningii 

(B) STRAIN: ATCC 38745 



15 



(xi) SEQUENCE DESCRIPTION:. SEQ ID NO: 775 



ATTCAGGAGC TCATCAACAA CATTGCCAAG GCTCACGGTG GTTACTCTGT 50 

GTTCACTGGT GTCGGTGAGC GTACCCGTGA GGGTAACGAT CTGTACCACG 100 

AAATGCAGGA GACTTCGGTC ATTCAGCTCG AGGGCGAGTC CAAGGTCGCG 150 

CTTGTGTTCG GTCAGATGAA CGAGCCCCCC GGTGCCCGTG CCCGTGTCGC 200 

20 CCTTACCGGT CTGACCGTTG CCGAGTACTT CCGTGACGAG GAGGGCCAGG 250 

ATGGTGAGTA ACCGACGAAG TCTGAGATCT TGTCGGGCAT TATTCTAACG 300 

ACAACTAGTG CTTCTCTTCA TCGACAACAT TTTCCGCTTC ACCCAGGCCG 350 

GTTCCGAGGT GTCCGCGCTT CTCGGCCGTA TCCCCTCTGC CGTCGGTTAC 400 

CAGCCCACCC TGGCCGTCGA CATGGGAGGT ATGCAGGAGC GTATTACCAC 450 

25 GACTCAGAAG GGCTCGATTA CCTCGGT 477 



2) INFORMATION FOR SEQ ID NO: 776 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 610 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Trichophyton mentagrophytes 

(B) STRAIN: ATCC 812 5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 776 

4 5 TCCGGAGTTG ATTGTAAGTC ATTTGAAACC CAGCCCCAAG AAACAGAAGC 50 

TAGGCGAAAA TTGGACAATT GAGCAATTTA GCCATTGGAG AAAAGAAATT 100 

TCGAGTATTA ATTGTTTTTA TAGAACAACA TTGCCAAGGC TCACGGTGGT 150 

TACTCTGTCT TCACTGGTGT CGGAGAGCGT ACCCGTGAAG GAAACGATCT 200 

CTACCATGAA ATGCAGGAGA CCCGTGTCAT TCAGCTTGAT GGCGAGTCCA 250 

50 AGGTCGCCCT GGTCTTCGGC CAGATGAACG AGCCCCCAGG TGCCCGTGCC 300 

CGTGTTGCTC TTACTGGTTT GACCATTGCT GAGTACTTCC GTGATGAGGA 350 

AGGTCAAGAC GGTGAGTTTY TTATGGATAA AARATTTTTT TTTTTTTTTT 400 

TTTTTTMAAR AAATTCATGT TCTAACAAAG TATATCCTAG TGCTTCTCTT 450 

CATCGACAAC ATTTTCCGTT TCACTCAGGC TGGTTCCGAA GTGTCTGCCC 500 

55 TGCTTGGTCG TATCCCATCT GCCGTCGGTT ACCAACCCAC TCTTGCCGTC 550 

GACATGGGTG GTATGCAGGA ACGTATTACC ACCACCAAGA AGGGATCCAT 600 

TACCTCCGTC 610 



60 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 593 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton tonsurans 

(B) STRAIN: ATCC 56185 

(xi) SEQUENCE DESCRIPTION: SEQ, ID NO: 777 

GGAGTTGATT GTAAGTCATT TGAAACCCAG CCCCAAGAAA CAGAAGCTAG 50 

GTGAAAATTG GACAATTGAA CAATTTAGCC CTTGGAGAAA AGAAATTTCG 100 

AGTATTAATT ATTTTTATAG AACAACATTG CCAAGGCTCA CGGTGGTTAC 150 

TCTGTCTTCA CTGGTGTCGG AGAGCGTACC CGTGAAGGAA ACGATCTCTA 200 

CCATGAAATG CAGGAGACCC GTGTCATTCA GCTTGATGGC GAGTCCAAGG 250 

TCGCCCTGGT CTTCGGCCAG ATGAACGAGC CCCCAGGTGC CCGTGCCCGT 300 

GTTGCTCTTA CTGGTTTGAC CATTGCTGAG TACTTCCGTG ATGAGGAAGG 350 

TCAAGACGGT GAGTTTCTTA TGGATGAAAG ATTTTTTTTT TTCAAGAAAT 4 00 

TCATGTTCTA ACAAAGTGTA TTCTAGTGCT TCTCTTCATC GACAACATTT 450 

TCCGTTTCAC TCAGGCTGGT TCCGAAGTGT CTGCCCTGCT TGGTCGTATC 500 

CCATCTGCCG TCGGTTACCA ACCCACTCTT GCCGTCGACA TGGGTGGTAT 550 

GCAGGAACGT ATTACCACCA CCAAGAAGGG ATCCATTACC TCC 593 



2) INFORMATION FOR SEQ ID NO: 778 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1141 bases 

(B) TYPE: Nucleic acid 
■(C) STRANDEDNESS: Double 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichosporon cutaneum 

(B) STRAIN: ATCC 62965 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 778 

CCGTGGTCAA GAAGTTATTG ACACTGGTGC CCCAATTACC ATTCCTGTTG 50 

GTCGTGGTAC TCTTGGTAGA ATTATCAACG TCATTGGTGA ACCAATTGAC 100 

GAACGTGGCC CTATCAAGGC TTCTAAGTAT GCTCCTATCC ATACTGAACC 150 

ACCAACCTTT GCTGAACAAT CTACTTCTGC TGAAGTTCTT GAAACCGGTA 200 

TCAAGGTTGT CGATCTTCTT GCTCCTTACG CCCGTGGTGG TAAGATTGGT 250 

CTTTTCGGTG GTGCTGGTGT CGGTAAGACT GTCTTCATTC AAGAACTTAT 300 

TAACAACATT GCCAAGGCTC ACGGTGGTTT CTCTGTCTTC ACTGGTGTCG 350 

GTGAAAGAAC CCGTGAAGGT AACGATCTTT ACCGTGAAAT GAAGGAAACT 4 00 

GGTGTCATCA ACCTCGAAGG TGACTCCAAG GTCGCTCTCG TTTTCGGTCA 450 

AATGAACGAA CCTCCAGGTG CCCGTGCCCG TGTCGCTTTG ACTGGTCTTA 500 

CCATTGCCGA ATACTTCCGT GATGAAGAAG GACAAGATGT CTTGCTTTTC 550 

GTTGACAACA TTTTCAGATT CACCCAAGCC GGTTCTGAAG TCTCTGCTCT 600 

TTTGGGTCGT ATTCCATCTG CCGTCGGTTA CCAACCTACC CTTGCTACCG 650 
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ATATGGGTGC CCTCCAAGAA 
ACATCTGTCC AAGCCGTCTA 
CCCAGCCACC ACTTTCGCTC 
CCATTTCCGA ATTGGGTATC 
5 TCTCGTCTTT TGGATCCTGA 
CACTCAAGTT CAACAAACCC 
TTGCCATTTT GGGTATGGAT 
GAACGTGCCC GTAAGATCCA 
CGAAGTTTTC ACTGGTATCG 
10 TCAGATCTTT CAAGGAAATC 



CGTATTACCA CCACCCAAAA 
TGTCCCAGCA GACGATTTGA 
ACTTGGACGC CACCACTGTC 
TACCCAGCTG TCGATCCTCT 
AGTTATTGGA CACGAACACT 
TCCAAGCTTA CAAGTCTCTC 
GAATTGTCTG AAGCTGATAA 
AAGATTCCTT TCCCAACCAT 
AAGGTCGTCT CGTTCCATTG 
CTTGAAGGTA AGTACGATCA 



PCT/CA00/01150 



gggtt6cgtc ' 




CTGATCCTGC 


750 


TTGTCTCGTT 


800 


CGATTCCAAG 


850 


ACGAAGTTGC 


900 


CAAGATATCA 


950 


GCTTACTGTC 


1000 


TCGCTGTTGC 


1050 


AAGGAAACCG 


1100 


C 


1141 



2) INFORMATION FOR SEQ ID NO:. 779 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1093 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Cladophialophora carrionii 

(B) STRAIN: ATCC 22864 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 779 

30 GCTGAAGGCC GAACGTGAGC GTGGTATCAC CATCGATATC GCGCTCTGGA 50 

AGTTCGAGAC TCCCAAGTAC TTCGTCACCG TCATCGATGC CCCTGGTCAT 100 

CGTGACTTCA TCAAGAACAT GATCACTGGT ACCTCCCAGG CTGATTGTGC 150 

TATTCTCATC ATTGCCGCTG GTACTGGTGA GTTCGAGGCC GGTATCTCCA 200 

AGGATGGCCA GACCCGTGAG CATGCTCTGC TCGCCTACAC CCTGGGCGTG 250 

35 AAGCAGCTTA TCGTCGCCAT CAACAAGATG GACACCACCA AATGGTCTGA 300 

GGATCGTTTC AACGAAATCA TCAAGGAGAC TTCCAACTTC ATCAAGAAGG 350 

TCGGATACAA CCCCAAGTCC GTTCCATTCG TGCCCATCTC CGGCTTCAAC 4 00 

GGTGACAACA TGATCGACGT CTCCACCAAT GCCCCCTGGT ACAAGGGCTG 4 50 

GGAAAAGGAG TCCAAGGCTG GCAAGGCCAC CGGCAAGACC CTCCTTGAGG 500 

4 0 CTATCGACTC CATCGACCCT CCTGCTCGTC CCACCGACAA GCCTCTCCGT 550 

CTCCCACTCC AGGATGTCTA CAAGATTTCT GGTATCGGCA CGGTGCCCGT 6 00 

CGGTCGTGTT GAGACTGGTA CCATCAAGGC CGGTATGGTC GTCACCTTTG 650 

CCCCCGCCAA CGTCACCACT GAAGTCAAGT CCGTCGAAAT GCACCACGAA 700 

CAGCTCGCCG AGGGCGTTCC GGGTGACAAC GTCGGCTTCA ACGTCAAGAA 750 

45 CGTCTCCGTG AAGGAGGTTC GTCGTGGAAA CGTTGCTGGT GACTCCAAGA 800 

ACGACCCCCC CAAGGGTGCC GACTCCTTCA ACGCCCAGGT CATCGTCCTC 850 

AACCACCCTG GTCAGGTCGG TGCTGGCTAC GCCCCGGTCT TGGATTGCCA 900 

CACTGCCCAC ATTGCCTGCA AGTTCTCTGA GCTCCTCGAG AAGATCGATC 950 

GTCGTACCGG CAAGTCCATG GAAAACAACC CCAAGTTCAT CAAGTCTGGT 1000 

50 GATGCTGCCA TCGTGAAGAT GGTTCCCAGC AAGCCTATGT GCGTTGAGGC 1050 

TTTCACCGAC TACCCTCCTC TTGGTCGTTT CGCCGTCCGT GAC 1093 



55 2) INFORMATION FOR SEQ ID NO: 780 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 752 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 



PCT/CA00/01150 



10 



45 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cunninghamella bertholletiae 

(B) STRAIN: ATCC 42115 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 780 



TACTTGTAAA TGGTCTCAAG ATCGTTACAA CGAAATTGTT AAGGAAGTTT 50 

CTTCCTTCAT CAAGAAGATT GGTTACAATC CTAAATCCGT TCCTTTCGTY 100 

CCTATCTCTG GTTGGCACGG TGATAACATG TTGGAAGCTT CTACCAACAT 150 

GCCTTGGTAC AAGGGATGGA CCAAGGAAAC TAAAGCTGGT TCTTCCACTG 200 

15 GTAAGACTCT CTTAGAAGCC ATTGACAGCA, TTGAACCTCC TACCCGTCCT 250 

TCTGACAAGC CTTTACGTCT TCCTTTACAA GATGTTTACA AGATTGGTGG 300 

TATTGGTACT GTCCCTGTTG GTCGTGTTGA AACTGGTGTC ATCAAGGCTG 350 

GTATGGTTGT TACYTTCGCT CCCGCTAACG TCACCACTGA AGTTAAGTCC 4 00 

GTTGAAATGC ATCACGAACA ATTAGAACAA GGTGTTCCTG GTGACAACGT 450 

20 TGGTTTCAAC GTCAAGAACG TTTCCGTTAA GG AT ATCCGT CGTGGTAACG 500 

TCTGTTCCGA CTCCAAGAAC GACCCCGCTA AGGAATCTGC TTCCTTCAAC 550 

GCTCAAGTTA TCGTCTTGAA CCACCCTGGT CAAATTGGTG CTGGTTATGC 600 

CCCAGTTCTT GACTGTCACA CTGCTCACAT TGCTTGTAAG TTCGCTGAAT 650 

TATTAGAAAA GATCGATCGT CGTTCCGGTA AGAAACTCGA AGATGCTCCT 700 

25 AAATTCGTTA AATCTGGTGA CTCTGCTATC GTTAAGATGG TTCCTTCCAA 750 

GC 752 



30 2) INFORMATION FOR SEQ ID NO: 781 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 8 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Curvularia lunata 

(B) STRAIN: ATCC 26425 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 81 



CAAGTGGTCT GAGGACCGTT ACCAGGAAAT CATCAAGGAG ACCTCCAACT 50 

TCATCAAGAA GGTCGGCTAC AACCCCAAGC ACGTTCCCTT CGTCCCCATC 100 

TCCGGTTTCA ACGGAGACAA CATGATTGAG GCCTCCACCA ACTGCCCCTG 150 

GTACAAGGGT TGGGAGAAGG AGACCAAGGC CAAGGCCACT GGTAAGACCC 200 

50 TCCTTGAGGC CATCGACGCC ATCGACCCTC CTGTCCGTCC TACCGACAAG 250 

CCCCTCCGCC TTCCCCTCCA GGATGTTTAC AAGATTGGTG GTATTGGCAC 300 

GGTCCCCGTC GGTCGTGTCG AGACCGGTAT CATCAAGCCC GGTATGGTCG 350 

TCACCTTCGC CCCCGCTGGT GTCACCACCG AAGTCAAGTC CGTCGAGATG 4 00 

CACCACGAGC AGCTTACTGA GGGTGTCCCC GGTGACAACG TCGGCTTCAA 4 50 

55 CGTCAAGAAC GTCTCCGTCA AGGAGATCCG TCGTGGTAAC GTTGCCGGTG 500 

ACTCCAAGAA CGACCCCCCC AAGGGTTGCG AGTCCTTCAA CGCCCAGGTC 550 

ATCGTCCTCA ACCACCCTGG TCAGGTCGGT GCCGGTTACG CCCCAGTCCT 600 

TGACTGCCAC ACTGCCCACA TTGCCTGCAA GTTCTCCGAG CTCCTCGAGA 650 

AGATCGACCG CCGTACCGGA AAGTCTGTTG AGAACTCCCC CAAGTTCATC 700 

60 AAGTCCGGTG ACGCCGCCAT CGTCAAGA 728 
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2) INFORMATION FOR SEQ ID NO: 782 



10 



15 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1145 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fonsecaea pedrosoi 

(B) STRAIN: ATCC 18831 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 782 



TGAAGTCCGA 
TTCGAGACTC 
CGCTAACCTA 
CCGGTCACCG 
GACTGCGCCA 
25 TATCTCCAAG 
TTGGTGTCAA 
TGGTCTGAGG 
CAAGAAGGTC 
GTTTCAACGG 
30 AAGGGTTGGG 
CGAGGCCATT 
TCCGTCTTCC 
CCCGTCGGTC 
CTTCGCCCCC 
35 ACGAGCAGCT 
AAGAACGTCT 
CAAGAACGAC 
TCCTCAACCA 
TGCCACACTG 
40 CGACCGCCGT 
CTGGTGACGC 
GAGGCCTTCA 



GCGTGAGCGT 
CCAAGTAAGG 
TTCACCCACC 
TGATTTCATC 
TTCTCATCAT 
GACGGTCAGA 
GCAGCTCATC 
CCCGTTACCA 
GGCTTCAACC 
TGACAACATG 
AGAAGGAGAC 
GACGCCATCG 
CCTYCAGGAT 
GTGTCGAGAC 
GCTGGTGTCA 
TCCCGAGGGT 
CCGTCAAGGA 
CCCCCCAAGG 
CCCCGGTCAG 
CTCACATTGC 
ACCGGCAAGT 
CGCCATCGTC 
CCGACTACCC 



GGTATCACCA 
CTCAACAGAC 
ACAGGTACAA 
AAGAACATGA 
TGCCGCCGGT 
CCCGTGAGCA 
GTTGCCATCA 
GGAGATCATC 
CCAAGCACGT 
ATCGACGTCT 
CAAGGCCAAG 
ACCCCCCCAC 
GTCTACAAGA 
CGGTACCATC 
CCACTGAGGT 
CTCCCCGGTG 
GATCCGTCGT 
GCTGCGACAG 
GTCGGCGCCG 
TTGCAAGTTC 
CCATTGAGGC 
AAGATGGTTC 
CCCTCTTGGA 









TPPATATPflP 


PPTPTCfiAAG 


50 


Z1PAAPAAPPA 


AATPPATAPT 


100 

J. V v 


CGTCACCGTC 


ATTGACGCCC 


150 


TCACTGGTAC 


CTCCCAGGCT 


200 


ACTGGTGAGT 


TCGAGGCCGG 


250 


CGCTCTTCTC 


GCCTACACCC 


300 


ACAAGATGGA 


CACCACCAAG 


350 


AAGGAGACCT 


CCGGTTTCAT 


400 


TCCCTTCGTG 


CCCATCTCCG 


450 


CCACCAACTG 


CCCCTGGTAC 


500 


GCCACCGGCA 


AGACCCTCCT 


550 


TCGTCCCACC 


GACAAGCCCC 


600 


TCGGTGGTAT 


TGGCACGGTT 


650 


AAGGCCGGCA 


TGGTCGTCAC 


700 


CAAGTCCGTC 


GAGATGCACC 


750 


ACAACGTCGG 


CTTCAACGTC 


800 


GGCAACGTCG 


CCGGTGACTC 


850 


CTTCAACGCC 


CAGGTCATCG 


900 


GCTACGCGCC 


CGTCCTCGAC 


950 


TCTGAGCTCC 


TCGAGAAGAT 


1000 


CAGCCCCAAG 


TTCATCAAGT 


1050 


CCTCCAAGCC 


TATGTGCGTT 


1100 


CGTTTCGCCG 


TCCGT 


1145 



45 



50 



55 



60 



2) INFORMATION FOR SEQ ID NO: 783 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1151 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Microsporum audouinii 

(B) STRAIN: ATCC 11347 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 783 
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GCTCAAGGCT GAGCGTGAGC GTGGTATCAC 
AGTTCGAGAC CCCCAAGTAC ATGGTCACCG 
TTTCCCATTT ATAGTTGCGA CAAGTAACTA 
GACACCGTGA CTTCATCAAG AACATGATTA 
TGCGCTATTC TCATCATTGC TGCCGGTACT 
CTCCAAGGAT GGCCAGACTC GTGAGCACGC 
GTGTCAAGCA GCTCATCGTT GCCATCAACA 
TCTGAGTCCC GTTTCGGTGA AATCATCAAG 
GAAGGTCGGC TACGACCCCA AGGGTGTCCC 
TCAACGGTGA CAACATGATT GAGCCCTCCA 
GGATGGAACA AGGAGACCAA GGCCGGTGGC 
CCTTGAGGCC ATCGATGCCA TTGACATGCC 
CTCTCCGTCT CCCACTCCAG GATGTCTACA 
GTACCAGTCG GTCGTGTTGA GACTGGTATC, 
CACTTTCGCC CCCGCCAACG TCACCACTGA 
ACCACCAGCA GCTCGTTCAG GGTGTTCCCG 
GTCAAGAACG TCTCTGTCAA GGAAGTCCGC 
TTCCAAGAAC GACCCACCAT CTGGCTGCGC 
TCGTCCTCAA CCACCCCGGC CAGATCGGTG 
GACTGCCACA CTGCCCACAT TGCTTGCAAG 
GATTGACCGC CGTACTGGTA AATCCGTCGA 
AGTCTGGTGA TGCCGCTATT GCCACCATGG 
GTTGAGGCTT TCACTGACTA CCCACCACTT 
C 



CATTGACATC GCCCTCTGGA StJ 

TCATCGGTAT GCTTTATCTG 100 

ATAAAAAGTA GATGCCCCCG 150 

CTGGTACCTC CCAGGCCGAC 200 

GGTGAGTTCG AGGCTGGTAT 250 

CCTGCTCGCT TTCACCCTCG 300 

AGATGGACAC CACCAACTGG 350 

GAAGTCACCA ACTTCATCAA 400 

ATTCGTCCCA ATCTCTGGCT 450 

CCAACTGCCC ATGGTACAAG 500 

AAATCCTCTG GTAAGACCCT 550 

CACTCGTCCC ACCGACAAGC 600 

AGATCTCTGG TATCGGAACA 650 

ATCAAGCCTG GTATGGTTGT 700 

AGTCAAGTCC GTCGAAATGC 750 

GTGACAACGT TGGCTTCAAC 800 

CGTGGTAACG TTGCCGGTGA 850 

CTCTTTCAAG GCCCAGGTCA 900 

CTGGTTACGC CCCAGTCCTC 950 

TTCTCTGAGC TTCTTGAGAA 1000 

AACCAGCCCT AAGTTCGTCA 1050 

TTCCATCCAA GCCCATGTGC 1100 

GGTCGTTTCG CCGTCCGTGA 1150 

1151 



2) INFORMATION FOR SEQ ID NO: 784 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 979 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mucor circinelloides 

(B) STRAIN: ATCC 38592 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 784 

CCAAGTGGTC TCAAGATCGT TACAACGAAA TTGTCAAGGA AGTTTCCGGT 50 

TTCATCAAGA AGATCGGTTT CAACCCCAAG TCCGTTCCTT TCGTTCCCAT 100 

TTCTGGCTGG CACGGTGATA ACATGTTGGA TGAATCCACC AACATGCCCT 150 

GGTTCAAGGG ATGGAACAAG GAGACCAAGG CCGGTTCCAA GACTGGTAAG 200 

ACTCTCCTCG AAGCCATCGA TGCCATTGAG CCCCCTGTCC GTCCTTCTGA 2 50 

CAAGCCTCTC CGTCTTCCTC TTCAAGATGT CTACAAGATT GGTGGTATTG 300 

GTACAGTTCC CGTCGGTCGT GTTGAAACTG GTACTATCAA GGCTGGTATG 350 

GTTGTCAACT TCGCTCCCGC TGCTGTCACC ACTGAAGTTA AGTCCGTTGA 400 

AATGCATCAC GAAACCCTCT CTGAAGGTCT CCCCGGTGAC AACGTTGGTT 4 50 

TCAACGTCAA GAACGTCTCC GTCAAGGATA TCCGTCGTGG TAACGTCTGT 500 

TCCGACTCCA AGAACGATCC CGCTAAGGAA TCTGCCTCTT TCACTGCTCA 550 

AGTTATTATC TTGAACCATC CCGGTCAAAT CTCTGCTGGT TACGCACCAG 600 

TTCTCGATTG TCACACTGCT CACATCGCCT GTAAGTTCTC TGAACTCATT 650 

GAGAAGATTG ATCGTCGTTC CGGTAAGTAC CTGCATCTGT CAGAATTGAA 700 

GGTCCGCCGT TATAGCAAAG GCTGGGTTTA AATGTTGGGG TTTGTCTGAT 750 

CTATAATGAT GATTGCTCCT TCAATTTTTG ACATAATTTG ATGATCTGAA 800 
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TTGTGTTGCT AACGTCGCAT TTGCTTCTTT 
AAGAAGATGG AAGATGCTCC CAAAGTAAGT 
AAAATAGAAT ACTAACAATT ATTGTTTATA 
CTGCTATCGT CAAGATGGTT CCCTCCAAG 



PCT/CAOO/01150 

TGCTTCTTTT GCATGTAGGT ' 8$t) 
ATTACGATTG ATGGACAATT 900 
GTTCGTCAAG TCTGGTGACT 950 

979 



2) INFORMATION FOR SEQ ID NO: 785 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1099 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Phialaphora verrucosa 
20 (B) STRAIN: ATCC 38561 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 785 

GGACAAGCTG AAGGCCGAAC GTGAGCGTGG TATCACCATC GATATCGCGC 50 

25 TCTGGAAGTT CGAGACTCCC AAATACTTCG TCACCGTCAT TGATGCCCCT 100 

GGTCATCGTG ACTTCATCAA GAACATGATC ACTGGTACCT CCCAGGCTGA 150 

TTGTGCCATC CTCATCATTG CCGCCGGTAC CGGTGAGTTC GAAGCCGGTA 2 00 

TCTCCAAGGA TGGCCAGACC CGTGAGCACG CTCTCCTCGC CTACACCCTA 250 

GGTGTGAAGC AGCTTATCGT CGCCATCAAC AAGATGGACA CCGCCAAATG 30 0 

30 GTCCGAGGAT CGGTTCAACG AAATCATCAA GGAGACTTCC AACTTCATCA 350 

AGAAGGTCGG ATACAACCCC AAGTCCGTCC CGTTCGTGCC CATCTCCGGT 4 00 

TTCAACGGTG ACAACATGAT CGACGTCTCC TCCAACGCCC CCTGGTACAA 450 

GGGTTGGGAG AAGGAGACCA AGGCCGGCAA GGCCACTGGC AAGACCCTCC 500 

TCGAGGCCAT CGACGCGATT GACCCTCCTA CTCGTCCCAC CGACAAGCCC 550 

35 CTCCGTCTCC CTCTCCAGGA TGTCTACAAG ATCTCTGGTA TCGGCACGGT 600 

GCCCGTTGGT CGTGTTGAGA CCGGTACCAT CAAAGCCGGT ATGGTCGTCA 650 

CCTTCGCTCC CGCCAACGTC ACCACTGAAG TCAAGTCCGT CGAAATGCAC 700 

CACGAACAGC TCGCCGAAGG TGTTCCAGGT GACAATGTCG GCTTCAACGT 750 

CAAGAACGTC TCCGTCAAGG AGGTTCGTCG TGGAAACGTT GCCGGTGACT 800 

4 0 CCAAGAATGA CCCCCCCAAG GGTGCCGACT CCTTCAACGC TCAGGTCATC 850 

GTCCTCAACC ACCCTGGTCA GGTCGGTGCC GGCTACGCCC CGGTCTTGGA 900 

TTGCCACACT GCCCACATTG CTTGCAAGTT CTCTGAGCTC CTCGAGAAGA 950 

TCGATCGTCG TACCGGCAAG TCCATGGAAA ACAACCCCAA GTTTATCAAG 1000 

TCTGGTGATG CTGCCATCGT GAAGATGGTT CCCAGCAAGC CTATGTGCGT 1050 

45 TGAGGCCTTC ACCGACTATC CTCCTCTTGG TCGTTTCGCC GTCCGTGAC 1099 



50 



2) INFORMATION FOR SEQ ID NO: 786 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Saksenaea vasiformis 
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(B) STRAIN: ATCC 60625 



(xi) SEQUENCE DESCRIPTION: SEQ 



ID NO: 786 



ACCACCAAGT 
CGGYTTCATC 
CCATCTCCGG 
CCCTGGTACA 
GACCMTCCTC 
ACAAGCCCCT 
GGCACGGTTC 
GGTCGTCACC 
AGATGCACCA 
TTCAACGTCA 
CGGTGACTCC 
AGGTCATCGT 
GTCCTSGACT 
CGAGAAGATC 
TCATCAAGTC 



GGTCTGAGGC 
AAGAAGGTCG 
TTTCAACGGT 
AGGGTTGGGA 
GAGGCCATTG 
YCGTCTTCCC 
CCGTCGGTCG 
TTCCCCCCCG 
CGAGCAGCTC 
AGAACGTCTC 
AAGAACGACC 
CCTCAACCAC 
GCCACACTGC 
GACCGCCGTT 
TGGTGACGCC 



CCGTTACCAG 
GCTTCAACCC 
GACAACATGA 
GAAGGAGACC 
ACGCCATCGA 
CTMCAGGATG 
TGTYGAGACC 
CTGGTGTCAC 
GCCGAGGGTS 
CGTCAAGGAG 
CCCCCAAGGG, 
CCCGGTCAGG 
TCACATTGCC 
CCGGC AAGTC 
GCCATCGTCA 



GAGATCATCA 
CAAGCACGTT 
TCGACGTCTC 
AAGGCCAAGG 
CCCCCCYAGY 
TYTACAAGAT 
GGTRCCATCA 
CACTGAGGTC 
TCCCCGGTGA 
ATCCGTCGTG 
CTGCGACAGC 
TCGGCGCCGG 
TGCAAGTTCT 
CATCGAGTCC 
AGATGGTTCC 



AGGAGACCTC 
CCCTTCGTGC 
CACCAACTGC 
CCACCGGCAA 
CGTCCCACCG 
TGGCGGTATT 
AGGGTGGCAT 
AAGTCCGTCG 
CAACGTCGGC 
GCAACGTTGC 
TTCAACGCCC 
CTACGCGCCS 
CTGAGCTCCT 
GGCCCCAAGT 
CTCCAAGCCC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 



2) INFORMATION FOR SEQ ID NO: 787 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1084 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sync ephal a strum racemosum 

(B) STRAIN: ATCC 32330 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 787 

GCTCAAGGCC GAGCGTGAGC GTGGTATCAC CATCGATATC GCTCTCTGGA 50 

AGTTCGAGAC CCCCAAGTAC CACGTCACCG TCATTGATGC CCCCGGCCAT 100 

CGTGATTTCA TCAAGAACAT GATCACTGGT ACCTCGCAGG CTGACTGCGG 150 

TATCCTCATC ATTGCCGCCG GTACTGGTGA GTTCGAGGCT GGTATCTCCA 200 

AGGATGGCCA GACCCGTGAG CACGCTCTGC TTGCCTTCAC CCTCGGTGTC 250 

CGTCAGCTGA TCGTCGCCAT CAACAAGATG GACTCGACCA AGTACTCTGA 300 

GGCCCGTTAC AACGAAATCG TCAAGGAGGT CTCCACCTTC ATCAAGAAGA 3 50 

TCGGTTTCAA CCCCAAGTCC GTTCCCTTCG TCCCCATCTC TGGCTGGAAC 4 00 

GGTGACAACA TGTTGGAGGA GTCCTCCAAC ATGCCCTGGT TCAAGGGCTG 4 50 

GAAGAAGGAG ACCAAGGCTG GCGAGAAGTC CGGCAAGACC CTCCTTGAGG 500 

CCATTGACAA CATTGACCCC CCGGTCCGTC CCTCGGACAA GCCCCTCCGT 550 

CTTCCCCTCC AGGATGTCTA CAAGATCGGT GGTATCGGCA CAGTCCCCGT 600 

CGGTCGTGTC GAGACTGGTG TCATCAAGGC TGGTATGGTC GTGACCTTCG 650 

CCCCCGCCAA CGTCACCACT GAAGTCAAGT CCGTCGAGAT GCACCACGAG 700 

CAGCTCGTCG AGGGTGTCCC CGGTGACAAC GTCGGTTTCA ACGTCAAGAA 750 

CGTTTCCGTC AAGGATATCC GCCGTGGTAA CGTCTGCTCT GACTCCAAGA 800 

ACGACCCCGC CAAGGAGTCT GCCTCGTTCA CCGCCCAGGT CATCGTCCTG 850 

AACCACCCCG GTCAGATCGG TGCCGGTTAC GCCCCGGTTC TTGACTGCCA 900 

CACCGCTCAC ATTGCCTGCA AGTTCGCTGA GCTCCTCGAG AAGATCGACC 950 

GTCGTTCCGG YAAGAAGCTC GAAGAGTCCC CCAAGTTCGT CAAGTCGGGT 1000 

GACTCCGCCA TCGTCAAGAT GGTTCCCTCC AAGCCCATGT GCGTTGAGGC 1050 

CTACACTGAG TACCCCCCTC TTGGCCGTTT CGCC 1084 
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2) INFORMATION FOR SEQ ID NO: 788 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1155 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Trichophyton tonsurans 

(B) STRAIN: ATCC 56185 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 788 

20 GCTCAAGGCC GAGCGTGAGC GTGGTATCAC CATCGATATC GCCCTCTGGA 50 

AGTTCGAGAC CCCCAAGTAC AATGTCACCG TCATTGGTAT GTTTTTCTTT 100 

ACCTTTCCCC TCCATCGTCT TGCTGTGCCA TAACTAACGA GAGTAGACGC 150 

CCCCGGTCAC CGTGACTTCA TCAAGAACAT GATCACTGGT ACCTCCCAGG 200 

CTGACTGTGC TATTCTCATC ATTGCTGCCG GTACTGGTGA GTTCGAGGCT 250 

25 GGTATCTCCA AGGATGGCCA GACCCGTGAG CACGCTCTGC TCGCCTTCAC 300 

CCTTGGTGTC AAGCAGCTCA TCGTTGCCAT CAACAAGATG GACACCACCA 350 

ACTGGTCCGA GGACCGTTTC AAGGAAATCA TCAAGGAAGT CACCAACTTC 400 

ATCAAGAAGG TTGGCTACGA CCCCAAGGGT GTTCCATTCG TTCCAATCTC 4 50 

TGGTTTCAAC GGTGACAACA TGATTGAGGC CTCCAGCAAC TGCCCATGGT 500 

30 ACAAGGGATG GAACAAGGAG ACCAAGGCCG GTGGTGCCAA GACTGGCAAG 550 

ACCCTYCTCG AGGCCATCGA TGbCATCGAC ATGCCAACCC GTCCTACCGA 600 

CAAGCCCCTY CGTCTCCCAC TCCAGGATGT CTACAAGATC TCTGGTATCG 650 

GAACTGTACC AGTCGGTCGT GTTGAGACCG GTATCATCAA GCCTGGTATG 700 

GTCGTCACCT TCGCCCCTGC CAACGTCACC ACTGAAGTCA AGTCCGTCGA 750 

35 AATGCACCAC CAGCAGCTTC AGCAGGGTGT CCCCGGTGAC AACGTCGGCT 800 

TCAACGTCAA GAACGTTTCC GTCAAGGAAG TCCGCCGTGG TAACGTTGCC 850 

GGTGACTCCA AGAACGACCC ACCATCCGGC TGTGCCTCCT TCAACGCCCA 900 

GGTCATCGTC CTCAACCACC CCGGCCAGAT CGGTGCTGGT TACGCCCCAG 950 

TCCTCGACTG CCACACTGCT CACATTGCTT GCAAGTTCGC TGAGCTCCTC 1000 

4 0 GAGAAGATTG ACCGCCGTAC CGGTAAATCC GTCGAAGCCA ACCCCAAGTT 1050 

CGTCAAGTCT GGTGATGCCG CTATCGCCAA GATGGTTCCC TCCAAGCCCA 1100 

TGTGCGTTGA GGCTTTCACT GACTACCCCC CACTTGGTCG TTTCGCCGTC 1150 

CGTGA 1155 



45 



2) INFORMATION FOR SEQ ID NO: 789 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 1138 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton mentagrophytes 

(B) STRAIN: ATCC 8125 

60 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 789 



TCAAGGCCGA 
TTCGAGACCC 
5 CTTTCCCCTC 
CCGGTCACCG 
GACTGTGCTA 
TATCTCCAAG 
TTGGTGTCAA 

10 TGGTCCGAGG 
CAAGAAGGTT 
GTTTCAACGG 
AAGGGATGGA 
CCTCCTCGAG 

15 AGCCCCTCCG 
ACTGTACCAG 
CGTCACCTTC 
TGCACCACCA 
AACGTCAAGA 

20 TGACTCCAAG 
TCATCGTCCT 
CTCGACTGCC 
GAAGATTGAC 
TCAAGTCTGG 

25 TGCGTTGAGG 



GCGTGAGCGT 
CCAAGTACAA 
CATCGTCTTG 
TGACTTCATC 
TTCTCATCAT 
GATGGCCAGA 
GCAGCTCATC 
ACCGTTTCAA 
GGCTACGACC 
TGACAACATG 
ACAAGGAGAC 
GCCATCGATG 
TCTCCCACTC 
TCGGTCGTGT 
GCCCCTGCCA 
GCAGCTTCAG 
ACGTTTCCGT 
AACGACCCAC 
CAACCACCCC 
ACACTGCTCA 
CGCCGTACCG 
TGATGCCGCT 
CTTTCACTGA 



GGTATCACCA 
TGTCACCGTC 
CTGTGCCATA 
AAGAACATGA 
TGCTGCCGGT 
CCCGTGAGCA 
GTTGCCATCA 
GGAAATCATC 
CCAAGGGTGT 
ATTGAGGCCT 
CAAGGCCGGT 
CCATCGACAT 
CAGGATGTCT 
TGAGACCGGT 
ACGTCACCAC 
CAGGGTGTCC 
CAAGGAAGTC 
CATCCGGCTG 
GGCCAGATCG 
CATTGCTTGC 
GTAAATCCGT 
ATCGCCAAGA 
CTACCCCCCA 



TCGATATCGC 
ATTGGTATGT 
ACTAACGAGA 
TCACTGGTAC 
.ACTGGTGAGT 
CGCTCTGCTC 
ACAAGATGGA 
AAGGAAGTCA 
TCCATTCGTT 
CCACCAACTG 
GGTGCCAAGA 
GCCAACCCGT 
ACAAGATCTC 
ATCATCAAGC 
TGAAGTCAAG 
CCGGTGACAA 
CGCCGTGGTA 
TGCCTCCTTC 
GTGCTGGTTA 
AAGTTCGCTG 
CGAAGCCAAC 
TGGTTCCATC 
CTTGGTCG 
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CCTCTGGAAG 50 

TTCTCTTTAC 100 

GTAGACGCCC 150 

CTCCCAGGCT 200 

TCGAGGCTGG 250 

GCCTTCACCC 3 00 

CACCACCAAC 350 

CCAACTTCAT 4 00 

CCAATCTCTG 450 

CCCATGGTAC 500 

CTGGCAAGAC 550 

CCTACCGACA 600 

TGGTATCGGA 650 

CCGGTATGGT 700 

TCCGTCGAAA 750 

CGTCGGCTTC 800 

ACGTTGCCGG 850 

AACGCCCAGG 900 

CGCCCCAGTC 950 

AGCTCCTCGA 1000 

CCCAAGTTCG 1050 

CAAGCCTATG 1100 
1138 



2) INFORMATION FOR SEQ ID NO: 790 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 748 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
4 0 (A) ORGANISM: Bipolaris hawaiiensis 

(B) STRAIN: ATCC 26067 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 790 

4 5 CACCACCAAG TGGTCTGAGG AGCGTTACCA GGAAATCATC AAGGAGACCT 50 

CCAACTTCAT CAAGAAGGTC GGCTACAACC CCAAGCACGT TCCCTTCGTC 100 

CCCATCTCTG GTTTCAACGG AGACAACATG ATTGAGGCTT CCACCAACTG 150 

CCCCTGGTAC AAGGGTTGGG AGAAGGAGAC CAAGGCCAAG GCCACTGGTA 200 

AGACCCTTCT CGAGGCCATC GACGCCATCG ACCCCCCTGT CCGTCCTACC 250 

50 GACAAGCCCC TCCGCCTTCC CCTCCAGGAT GTGTACAAGA TTGGTGGTAT 300 

TGGCACGGTT CCCGTCGGTC GTGTCGAGAC CGGTATCATC AAGCCCGGTA 350 

TGGTCGTCAC CTTCGCCCCC GCTGGTGTCA CCACTGAAGT CAAGTCCGTC 400 

GAGATGCACC ACGAGCAGCT TGCCGAGGGT GTCCCCGGTG ACAACGTCGG 450 

CTTCAACGTC AAGAACGTCT CCGTCAAGGA GATCCGTCGT GGTAACGTTG 500 

55 CCGGTGACTC CAAGAACGAC CCCCCCAAGG GTTGCGAGTC CTTCAACGCC 550 

CAGGTCATCG TCCTCAACCA CCCCGGTCAG GTCGGTGCCG GTTACGCACC 600 

AGTCCTTGAC TGCCACACTG CCCACATTGC TTGCAAGTTC TCCGAGCTCC 650 

TCGAGAAGAT TGACCGCCGT ACCGGAAAGT CTGTTGAGAA CTCCCCCAAG 700 

TTCATCAAGT CCGGTGACGC CGCCATCGTC AAGATGGTTC CCTCCAAG 74 8 



60 
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2) INFORMATION FOR SEQ ID NO: 791 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 958 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Aspergillus 

(B) STRAIN: ATCC 14110 

(xi) SEQUENCE DESCRIPTION: SEQ 

CGCTATTGTC GTTGTTGCTG CCTCCGACGG 
TTGGGAAATA CGTCGTAAAA CACGGCGCTT 
CAGACTCGTG AGCATTTGCT GCTCGCCCGC 
CGTTGTCTTC GTCAACAAAA TCGATGCTAT 
AACTGGTCGA ACTCGAGATG CGTGAGCTGC 
GGTGAAGAGA CTCCGATCAT TTTCGGTTCC 
ACGCCGTGAC GACATCGGTA AAGACAGAAT 
TCGACACCTG GATCCCCACT CCTCAGCGTG 
ATGTCTGTCG AGGAAGTGTT CTCTATCGCC 
TGGTCGTGTC GAGCGTGGTA TCTTGAAGAA 
TTGGAGGCTC CTTCGAACCC AAGAAGACCA 
TTCAAGAAGA GCTGTGATGA ATCGCGTGCT 
CCTGCGTGGT ATCCGACGTG AAGACGTCAA 
TTCCCGGCAG CACCAAAGCT CACGACAAGT 
CTGACCGAGG CGGAGGGTGG TCGTCGTACT 
TCCCCAAGTC TTCATCCGTA CTGCAGGTAA 
GATCTTCCGA GAGATTAGCG ATATATGCTA 
GCTGACCTCA GCTTCCCTGA CGGCGACCAA 
TGACAACGTC GAGATGATCC TGAAGACCCA 
CTGGTCAA 



fumigatus 



ID NO: 791 

TCAGATGTAG GTGGAACATC 50 

ACGTTTTCGC GAATAGGCCC 100 

CAGGTTGGTG TCCAGAAGAT 150 

TGATGATCCG GAGATGCTGG 2 00 

TGAACAGCTA CGGTTTCGAG 250 

GCTCTCTGTG CTCTCGAAGG 300 

TGAGCAGCTT ATGAACGCTG 350 

ACCTCGACAA ACCTTTCTTG 400 

GGCCGTGGTA CCGTGGCTTC 450 

GGACTCTGAG GTTGAGATTG 500 

AAGTCACCGA CATTGAAACC 550 

GGTGACAACT CTGGTCTCCT 600 

GCGTGGTATG GTCATTGCTG 650 

TCCTCGTCTC CATGTACGTC 700 

GGCTTCGGTG CCAACTACCG 750 

GTTCCCGCAC ACCGTGTCCA 800 

ATGATTCATC AGACGAGGCT 850 

TCTCGCAGAG TTATGCCTGG 900 

CCACCCTGTT GCTGCTGAGG 950 

958 



2) INFORMATION FOR SEQ ID NO: 792 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 936 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton wentagrophytes 

(B) STRAIN: ATCC 8125 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 792 

CGTTGTCGCA GCTTCTGACG GTCAAATGTA ATTGAATGCC CGCCCAGACG 50 

GATGAAAGGA TTTGACGTTT CTAACATCAT TCTAGGCCTC AGACCAGAGA 100 

ACATTTGCTC CTTGCCCGCC AGGTCGGTGT CCAGAAGCTG GTCGTTTTCG 150 
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TTAACAAGGT CGATGCCGTT GAGGACCCAG AGATGTTGGA GCTTGTCGAA "209' 

CTTGAAATGC GTGAACTCCT CAGCCACTAC GGTTTCGAGG GTGAGGAGAC 250 

CCCCATCATT TTTGGCTCTG CTCTCTGTGC CCTCGAGTCC CGTCGACCTG 300 

AGCTTGGTGT CGAGAAGATT GACGAGCTAT TGAACGCCGT CGACACCTGG 350 

5 ATCCCCACCC CCGAGCGCGC CACTGATAAG CCTTTCCTCA TGTCCATTGA 400 

GGAAGTGTTC TCTATCTCTG GTCGTGGTAC CGTCGTCTCC GGTCGTGTTG 450 

AGCGTGGTAT CCTCAAGAAG GATTCCGACG TCGAAATTGT TGGTGGCTCT 500 

ACCACCCCTA TCAAGACCAA GGTCACAGAT ATCGAAACCT TCAAGAAGTC 550 

CTGCGATGAA TCTCGAGCTG GTGACAACTC TGGTCTCCTT CTCCGAGGTA 600 

10 TCAAGCGTGA GGACTTGAAG CGTGGAATGG TTGTTGCTGC CCCCGGATCC 650 

ACCAAGGCTC ACACCGACTT CATGGTCTCC CTCTACGTCC TGACTGAGGC 700 

TGAGGGTGGT CGTTCCAACG GCTTCACCCA CAAGTACCGC CCCCAAATGT 750 

TCATCCGTAC TGCTGGTATG TAACCCAAGT TTCCGCTATT TACTAAGTAG BOO 

ATCATTGCTA ACTTGTATTT CCTTCCGTAG ACGAAGCCGC ATCTTTCAGC 850 

15 TGGCCTGGAG AAGACCAAGA CAAGAAGGCT, ATGCCTGGTG ACAACGTCGA 900 

GATGATTTGC AAGACCCTCC ACCCCATTGC TGCCGA 936 



20 2) INFORMATION FOR SEQ ID NO: 793 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Single 

<D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 793 

TTATTGTTGC TGCTGGTACT 20 



35 



45 



50 



2) INFORMATION FOR SEQ ID NO: 794 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma cruzi 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 794 

GACGACAAGT CGGTGAACTT 20 



55 2) INFORMATION FOR SEQ ID NO: 795 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma cruzi 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 795 

10 ACTTGCACGC GATGTGGCAG 
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20 



15 



2) INFORMATION FOR SEQ ID NO: 796 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 796 

25 

GGTCCAATGC CWCAAACWAG A 21 



30 2) INFORMATION FOR SEQ ID NO: 797 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
4 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 797 

CATTAAGAAT GGYTTATCTG TSKCTCT 27 



45 



55 



60 



2) INFORMATION FOR SEQ ID NO: 798 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 798 
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TGGTTGTCCC AGCCGATCGT TT 22 



5 2) INFORMATION FOR SEQ ID NO: 799 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 799 

20 ACCTGTGAAT ACAAGCAATC T 21 



25 



50 



55 
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2) INFORMATION FOR SEQ ID NO: 800 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 800 

GATGAAATCT TCAACGAAGT TGAT 24 

40 



2) INFORMATION FOR SEQ ID NO: 801 

4 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 801 

ACAACACCGA GAAGATCCCA 20 
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PCT/CA00/01150 



10 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 802 

TTGCCATTTC TGGTTTCGTT 2 0 



20 2) INFORMATION FOR SEQ ID NO: 803 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 803 

35 ACTTCAGTGG TAACACCAGC 20 



2) INFORMATION FOR SEQ ID NO: 804 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
4 5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 804 

CCTGGGACGG CCTCTGGCAT 20 

55 

2) INFORMATION FOR SEQ ID NO: 805 

60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 805 
CTCTTGTCCA TCTTAGCAGT 20 

15 

2 ) INFORMATION FOR SEQ ID NO: 806 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 806 

AGCATCACCA GACTTGATAA G 21 



35 



45 



50 



2) INFORMATION FOR SEQ ID NO: 807 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 807 

AAAGTGGCTT CAAAGGTTGC 20 



55 2) INFORMATION FOR SEQ ID NO: 808 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 808 
GCITTAIWRG CATTAGAARA YCCA 



2 ) INFORMATION FOR SEQ ID NO: 809 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 809 
TCTTCCTGTW GCAACTGTTC CTCT 



2) INFORMATION FOR SEQ ID NO:. 810 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 810 
AGAGMWACAG ATAARSCATT CTTA 



2) INFORMATION FOR, SEQ ID NO: 811 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 811 
TRAARTAGAA TTGTGGTCTR TATCC 



2) INFORMATION FOR SEQ ID NO: 812 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 812 
GTIACIGGIT CYTYRARRTT ICCICC 



2) INFORMATION FOR SEQ ID NO: 813 

(i) SEQUENCE CHARACTERISTICS:, 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 813 
AATCYGTYGA AATGCAYCAC GA 



2) INFORMATION FOR SEQ ID NO: 814 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 814 
GCIGGCACGT ACACIGCCTG 



2) INFORMATION FOR SEQ ID NO: 815 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 815 
TGGTGCATYT CKACRGACTT 
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2) INFORMATION FOR SEQ ID NO: 816 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 816 
GCTACGACGA GATCAAGGGC 



2) INFORMATION FOR SEQ ID NO: 817 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 817 
TGGAAGAAGG CCGAGGAGTT 



2) INFORMATION FOR SEQ ID NO: 818 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 818 
AGCCGGGCTG GATCTTCTTC 



2) INFORMATION FOR SEQ ID NO: 819 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 819 
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2) INFORMATION FOR SEQ ID NO: 820 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma brucei 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82 0 

GAAGGAGGTG TCTGCTTACA C 



2) INFORMATION FOR SEQ ID NO: 821 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma brucei 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 821 

GGCGCAAACG TCACCACATC A 



2) INFORMATION FOR SEQ ID NO: 822 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma brucei 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 822 

CGGCGGATGT CCTTAACAGA A 
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2) INFORMATION FOR SEQ ID NO: 823 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 823 
GAGCGGTATG AYGAGATTGT 



2) INFORMATION FOR SEQ ID NO: 824 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 824 
GGCTTCTGCG GCACCATGCG 



2) INFORMATION FOR SEQ ID NO: 825 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 825 
ATGAGCARCG SAACCATCGT TCAGTG 



2) INFORMATION FOR SEQ ID NO: 826 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 826 
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TCGATCGTGC CGACCATGTA GAACGC 



2) INFORMATION FOR SEQ ID NO: 827 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 446 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium novyi 

(B) STRAIN: ATCC 19402 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 827 

CACCAACTTG CTAAATGGGG AGATGCCCAG ATTGTTGTAT ATATAGGCTG 50 

TGGAGAACGT GGAAATGAAA TGACAGATGT TCTTAATGAG TTTCCAGAAC 100 

TTAAAGATCC TAAGACTGGC AAATCAATAA TGGAAAGAAC AGTTTTAATA 150 

GCAAATACTT CTAATATGCC AGTTGCAGCC CGTGAAGCTT GTATATATAC 200 

AGGAATCACA ATAGCAGAAT ATTTTAGAGA TATGGGATAT TCAGTAGCAC 250 

TTATGGCGGA TTCCACTTCA CGTTGGGCAG AGGCATTAAG AGAAATGTCT 300 

GGAAGACTTG AAGAAATGCC TGGTGATGAA GGTTACCCAG CTTATTTAGG 350 

ATCAAGACTT GCTGATTTCT ATGAAAGAGC TGGAAAAGTT GTGTGTTTAG 4 00 

GAGACGATGA AAGAGAAGGT GCCATTACTG CAATAGGTGC TGTATC 44 6 



2) INFORMATION FOR SEQ ID NO: 828 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium difficile 

(B) STRAIN: 9689 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 828 

CAGCATCAGC TTGCTAAATG GGCAGATGCA GATATAGTTG TATATATAGG 50 

CTGTGGCGAG CGTGGAAATG AAATGACAGA TGTTCTTCTT GAATTTCCTG 100 

AATTAAAAGA CCCAAGAACA GGCGAGTCAC TTATGCAAAG AACTGTGCTT 150 

ATAGCAAATA CATCAGATAT GCCGGTTGCT GCACGTGAAG CTTCTATATA 200 

CACTGGTATT ACAATAGCTG AATATTTTAG AGATATGGGA TATAGTGTTG 250 

CACTTATGGC AGACTCTACA TCAAGATGGG CTGAGGCTCT TAGAGAGATG 300 

AGTGGTCGTT TAGAGGAGAT GCCTGGTGAA GAAGGTTATC CTGCATACTT 350 

AGGTTCACGT CTTGCTCAAT TCTATGAGAG AGCAGGAAAG GTAAATTGTC 4 00 

TAGGTATGGA TGAAAGAGAA GGAACACTTA CAGCAATTGG TGCAG 445 
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2) INFORMATION FOR SEQ ID NO: 829 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 445 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium septicum 

(B) STRAIN: ATCC 12464 



15 



45 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 829 



ATGCTATAGC TAAATGGGGA GACAGCGAAA TAGTTGTTTA CGTTGGATGT 50 

GGAGAACGTG GTAACGAAAT GACAGACGTT CTTAACGAAT TCCCAGAACT 100 

2 0 TATTGACCCA AAAACTGGGG AAAGTTTAAT GAAGAGAACA GTACTTATAG 150 

CTAATACTTC AAACATGCCA GTTGCTGCTA GAGAAGCTTG CATATACACA 200 

GGTATTACAA TAGCTGAATA CTTCAGAGAT ATGGGATACT CAGTATCTAT 250 

AATGGCTGAT TCAACTTCAA GATGGGCAGA AGCATTAAGA GAAATGTCAG 300 

GTAGACTTGA AGAAATGCCA GGTGATGAAG GATATCCAGC GTACTTAGGA 350 

25 TCAAGACTTG CTGATTATTA CGAAAGAGCA GGTAAGGTTG TTTGTCTAGG 4 00 

TAAAGATGGT AGAGAAGGTG CTGTAACAGC AATTGGAGCT GTATC 445 



30 2) INFORMATION FOR SEQ ID NO: 830 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 444 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

4 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium botulinum 

(B) STRAIN: 20:3.1 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 830 



TCAAATTGCT AAATGGGGAG ATGCAGAAAT CGTTGTTTAC GTTGGATGCG 50 

GAGAACGTGG TAACGAAATG ACAGACGTTG TTAATGAGTT CCCAGAACTT 100 

ATTGACCCTA AGACTGGCGA AAGCTTAATG AAGAGAACAG TTCTTATAGC 150 

TAATACTTCA AACATGCCAG TTGCAGCGAG AGAAGCTTCA ATATATACAG 200 

50 GTATCACAAT AGCTGAATAT TTCAGAGATA TGGGATATGC AGTATCAATA 250 

ATGGCTGACT CAACTTCAAG ATGGGCTGAG GCATTAAGAG AAATGTCTGG 300 

TAGACTTGAA GAAATGCCTG GTGATGAAGG ATATCCAGCT TACCTTGGAT 350 

CAAGACTTGC TGATTACTAT GAAAGAGCTG GTAAGGTTGA ATGTTTAGGT 400 

AATGATGGAA GAATTGGTTC TATAACAGCA ATCGGTGCGG TATC 444 



2) INFORMATION FOR SEQ ID NO: 831 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 4 56 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium perfringens 

(B) STRAIN: ATCC 13124 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 831 

TAGTTCAGCA CCAAGTTGCT AAATGGGGAG ATACTGAGAT AGTTGTTTAC 50 

GTTGGATGTG GAGAACGTGG TAACGAGATG ACAGACGTTC TTAACGAATT 100 

CCCAGAACTT AAAGACCCTA AAACTGGGGA , AAGCTTAATG AAGAGAACAG 150 

TTCTTATTGC TAATACATCT AACATGCCAG TTGCTGCCAG AGAAGCATCA 2 00 

ATATATACTG GTATAACAAT AGCAGAGTAT TTCAGAGATA TGGGATACTC 2 50 

AGTATCAATC ATGGCTGACT CAACTTCACG TTGGGCAGAG GCTTTAAGAG 300 

AAATGTCAGG AAGACTTGAA GAAATGCCAG GAGACGAAGG TTACCCAGCA 350 

TACTTAGGAT CAAGACTTGC TGATTACTAT GAAAGAGCTG GTAAAGTTGT 400 

AGCTTTAGGT AAAGATGGAA GAGAAGGAGC TGTTACAGCT ATCGGAGCAG 450 

TATCCC 456 



2) INFORMATION FOR SEQ ID NO: 832 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 444 bases 

(B) TYPE; Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium tetani 

(B) STRAIN: ATCC 194 06 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 832 

CCAACTTGCA AAATGGGCTG ATGCTCAAAT AGTTGTGTAC ATAGGATGTG 50 

GAGAACGTGG AAATGAAATG ACAGACGTTT TAAATGAGTT CCCAGAATTA 100 

AAGGATCCTA AAACCGGGGA ATCTTTAATG AAAAGAACTG TGTTAATAGC 150 

AAATACATCT AATATGCCTG TTGCAGCTAG AGAAGCATCT ATATATACTG 200 

GTATAACAAT AGGGGAATAT TTTAGAGATA TGGGATATTC AATAGCACTA 2 50 

ATGGCAGATT CGACTTCTAG ATGGGCAGAG GCTCTAAGAG AAATGTCTGG 300 

^AGACTAGAG GAGATGCCAG GTGAAGAAGG TTATCCAGCT TATTTAGGAT 350 

CTAGATTAGC AGAGTTCTAT GAAAGAGCAG GTAATGTTAT ATGTTTAGGT 4 00 

CAGGATGGAA GAGAAGGAGC ATTAACAGCT ATAGGAGCAG TTTC 444 



2) INFORMATION FOR SEQ ID NO: 833 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1786 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pyogenes 
5 (C) ACCESSION NUMBER: Genome project 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 833 

TGAACCAAGG AAAAATAATA ACCGTTTCGG GACCTCTTGT TGTGGCTTCT 50 

10 GGGATGCAAG AAGCTAATAT TCAAGATATT TGTCGTGTGG GACATCTTGG 100 

CTTAGTCGGA GAAATTATTG AAATGCGTCG CGATCAAGCG TCTATTCAGG 150 

TTTATGAGGA AACATCAGGG ATCGGTCCAG GAGAACCAGT AGTGACTACT 200 

GGTTGTCCTT TGTCGGTCGA GTTAGGCCCG GGCCTGATTT CAGAAATGTT 250 

TGACGGTATT CAGCGACCGC TTGATCGTTT TCAAAAAGCA ACGGACAGCG 300 

15 ACTTTTTAAT CCGTGGTGTG GCTATCCCAA GTCTTGATCG AAAGGCTAAG 350 

TGGGCATTTA TTCCCAAGCT AAGTGTTGGT CAAGAAGTAG TTGCAGGTGA 400 

TATTTTAGGA ACTGTGCAAG AAACAGCTGT CATTGAGCAC CGTATCATGG 450 

TTCCTTATAA AGTTTCAGGG ACCTTGGTGG CTATTCATGC AGGGGACTTC 500 

ACAGTAACAG ATACAGTTTA TGAAATTAAG CAGGAAGACG GTTCCATTTA 550 

20 CCAAGGTAGC CTCATGCAGA CTTGGCCAGT TCGTCAAAGT CGCCCTGTTG 600 

CTCAAAAGCT TATCCCAGTC GAACCTTTGG TTACAGGTCA ACGGGTTATT 650 

GACACCTTTT TCCCTGTTAC AAAAGGTGGT GCCGCTGCCG TTCCTGGACC 700 

ATTTGGGGCA GGAAAAACAG TTGTGCAGCA TCAAATAGCT AAATTTGCCA 750 

ACGTTGATAT TGTTATTTAT GTCGGTTGTG GGGAACGCGG CAACGAGATG 800 

25 ACCGACGTTT TGAATGAGTT TCCAGAGTTA ATTGACCCAA ATACAGGCCA 850 

GTCCATTATG GAGCGCACGG TGTTAATTGC AAACACCTCT AATATGCCAG 900 

TAGCAGCGCG TGAAGCGTCG ATTTACACAG GTATTACCAT TGCCGAATAT 950 

TTCCGTGATA TGGGCTATTC TGTGGCTATC ATGGCAGACT CGACATCACG 1000 

TTGGGCAGAA GCTCTGCGCG AGATGTCAGG ACGCCTACAA GAAATGCCTG 1050 

30 GTGATGAAGG CTACCCGGCT TACTTAGGGA GTCGTATTGC CGAATATTAT 1100 

GAACGGGCTG GTCGTGTTCG GACCTTGGGA AGTCAAGAAC GTGAGGGAAC 1150 
CATTACAGCC ATCGGCGCGG TTTCTCCTCC TGGAGGGGAT ATTTCAGAGC . 1200 

CTGTCACTCA AAACACCCTT CGGATTGTCA AAGTTTTCTG GGGGCTCGAC 1250 

GCGCCTCTTG CGCAACGGCG TCACTTCCCA GCGATTAACT GGCTGACGTC 1300 

35 TTATTCATTG TATCAAGATG ATGTAGGAAG CTATATTGAC CGTAAACAGC 1350 

AATCTAATTG GTCCAACAAG GTAACTCGTG CCATGGCTAT TTTGCAGCGT 14 00 

GAAGCCAGTC TAGAAGAAAT TGTACGCTTG GTGGGGCTTG ATTCACTGTC 1450 

TGAACAAGAT CGTTTGACCA TGGCTGTTGC CCGGCAAATT CGGGAGGATT 1500 

ATCTCCAGCA AAATGCCTTT GATTCGGTGG ATACCTTTAC TTCCTTTCCG 1550 

4 0 AAACAAGAGG CCATGCTAAC CAATATTTTG ACCTTTAATG AGGAAGCCAG 1600 

CAAAGCCCTT TCTTTGGGAG CTTATTTTAA TGAGATTATG GAAGGCACTG 1650 

CTCAGGTACG CGATCGCATC GCACGCAGCA AATTTATCCC AGAAGAAAAC 1700 

TTAGAGCAGA TTAAAGGGCT TACTCAGAAG GTTACCAAAG AGATTCACCA 1750 

CGTTTTAGCA AAGGGAGGAA TTTAGATGAG CGTTCT 1786 

45 



2) INFORMATION FOR SEQ ID NO: 834 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 499 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



55 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia bovis 
60 (B) STRAIN: Suarez-3 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 834 



TATCTCACGT AAGTTTTTGC 
CTAAGTACGC TAATACTGAC 
GGAAACGAGA TTGCGGAGGT 
GGTTGATGGC AAGGAAGTGA 
ATACTTCAAA CATGCCAGTG 
ATTACCCTAT GTGAATACTT 
GGCGGATTCC ACCAGTCGTT 
GTTTAGCTGA GATGCCTGCT 
AGGCTTTCGG CGTTCTATGA 
ACCACTTCGT GAAGGTTCAG 



GCGGCGGTTA TATATCACTC 
GTTACTGTCT ATGTGGGATG 
GCTTAAGGAG TTCCCTGAGC 
GCATTATGAA ACGCACTTGC 
GCCGCCAGGG AGGCTAGTAT 
CAGGGATATG GGATACAACG 
GGGCTGAGGC TTTGCGTGAG 
GATTCAGGTT ATCCCGCCTA 
GCGTGCTGGT ACAGCTGAGT 
TTACCATTGT TGGTGCTGTA 



CAGGCCCTGG 50 

TGGAGAGCGT 100 

TGAAGACCAA 150 

TTGGTGGCCA 200 

CTACACTGGC 250 

CCTGTGTGAT 300 

ATATCAGGTC 350 

CCTTGCTTCT 4 00 

GTATTGGAAC 4 50 

TCTCCACCA 4 99 



2) INFORMATION FOR SEQ ID NO: 835 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 64 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 835 

TTTCTCAAGC TTTGAGTAAA TATAGTAACT CTGATGTTAT TATTTACATT 50 

GGTTGTGGAG AAAGAGGAAA TGAAATGGCA GAAGTTCTTA CAGAATTCCC 100 

TGAGCTTTAT ACTATGGTTG ATGGAAAGAA GGAGTCAATT ATGCAAAGAA 150 

CTTGTTTAGT AGCTAATACA TCAAATATGC CTGTCGCTGC TAGAGAAGCT 200 

TCCATCTACA CTGGTATTAC ACTTTCTGAA TACTTTAGAG ATATGGGATG 250 

TAATGTTTCT ATGATGGCAG ATTCAACTTC TCGTTGGGCT GAAGCTCTTA 3 00 

GAGAAATTTC TGGTAGATTA GCTGAAATGC CTGCAGATTC GGGTTACCCA 350 

GCATATTTAG GCGCCAGACT TGCTTCATTC TATGAAAGAT CAGGAAGAGT 4 00 

TAAATGTATG GGTTCCCCAG ATAGAGAAGG TACAGTAACA ATTGTTGGTG 450 

CAGTTTCTCC ACCT 4 64 



2) INFORMATION FOR SEQ ID NO: 836 
3) 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 446 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania infantum 

(B) STRAIN: MOU 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 836 

467 
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GCCCTCTCCA AGTACTCCAA CTCCGATTGC 
CGAGCGCGGT AATGAGATGG CCGAGGTGCT 
CGACCGTGAT CGATGGCCGC GAGGAGTCGA 
GTGGCGAACA CCTCGAACAT GCCAGTCGCA 
5 CACCGGCATC ACCCTGGCCG AGTACTACCG 
CCATGATGGC TGACTCGACG TCTCGCTGGG 
TCGGGTCGTC TGGCGGAGAT GCCGGCGGAT 
CAGCGCTCGT CTCGCCTCCT TCTACGAGCG 
TCGGCGGGCC GAAGCGCCAG GGCTCCGTCA 



PCT/CAOO/01150 

GTCATCTATG TCfcGCTGCGG ' 50 

CATGGAGTTC CCGACCCTGA 100 

TCATGAAGCG CACCTGCCTC 150 

GCCCGTGAGG CCTCTATTTA 200 

TGATATGGGC AAGCATATCG 250 

CCGAGGCGCT TCGTGAGATT 300 

GGTGGCTACC CCGCCTACC? 350 

CGCCGGCCTC GTCACCTGCA 400 

CGATCGTCGG TGCCGT 446 



2) INFORMATION FOR SEQ ID NO: 837 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 456 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



40 



55 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania major 
25 (B) STRAIN: ATCC 50122 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 837 

TTAGTCAGGC CCTCTCCAAG TACTCCAACT CCGACTGCGT CATCTATGTC 50 

3 0 GGCTGCGGCG AGCGCGGTAA TGAGATGGCT GAGGTGCTCA TGGATTTCCC 100 

AACTCTGACG ACCGTGATCG ATGGTCGCGA GGAGTCCATC ATGAAGCGCA 150 
CCTGCCTCGT GGCAAACACT TCGAACATGC CAGTCGCAGC CCGCGAGGCC - 200 

TCTATTTACA CCGGCATCAC CCTGGCCGAG TACTACCGTG ATATGGGCAA 250 

GCATATTGCC ATGATGGCCG ACTCGACATC TCGCTGGGCC GAGGCGCTTC 300 

35 GTGAGATTTC CGGTCGTCTG GCGGAGATGC CAGCCGATGG TGGCTACCCT 350 

GCCTACCTCA GCGCTCGTCT CGCCTCCTTC TACGAGCGCG CCGGCCTCGT 400 

CACCTGCATC GGCGGGCCGA AGCGCCAGGG CTCCGTCACG ATCGTCGGTG 450 

CTGTGT 456 



2) INFORMATION FOR SEQ ID NO: 838 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 4 50 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM:. Leishmania tarentolae 

(B) STRAIN: MOU-2 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 838 



AGGCCCTCTC CAAGTACTCC AACTCCGACT GCGTCATCTA CGTCGGCTGC 50 

GGCGAGCGCG GTAATGAGAT GGCCGAGGTG CTCATGGAGT TCCCGACCCT 100 

60 GACGACTGTG ATTGATGGCC GTGAGGAGTC GATCATGAAG CGGACCTGCC 150 

468 
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TCGTGGCCAA CACCTCCAAC 
TACACTGGTA TCACCCTGGC 
CGCCATGATG GCTGACTCGA 
TTTCGGGTCG CCTGGCGGAG 
5 CTTAGTGCTC GTCTTGCCTC 
CATCGGTGGG CCGAAGCGCC 



ATGCCAGTCG CAGCCCGTGA 
CGAATACTAC CGTGATATGG 
CGTCTCGCTG GGCGGAGGCG 
ATGCCGGCTG ATGGTGGGTA 
CTTCTACGAG CGTGCCGGCC 
AGGGCTCCGT CACGATCGTC 



PCT/CA00/01150 



A6CCTCTATT ' 200 

GCAAGCATAT 250 

CTTCGTGAGA 300 

CCCCGCCTAC 350 

TCGTCACCTG 4 00 

GGTGCCGTGT 450 



10 2) INFORMATION FOR SEQ ID NO: 839 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 437 bases 

(B) TYPE: Nucleic acid 
15. (C) STRANDEDNESS: Double, 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma brucei 

(B) STRAIN: EATRO 795 



25 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 839 



GCACTCTCGA AGTACTCGAA CAGTGACGCT GTTATTTACG TGGGTTGTGG 50 

TGAGCGTGGC AATGAGATGG CGGAGGTGCT CATGGACTTT CCCACCCTCA 100 

CCACCATTAT TGATGGACGT GAGGAGTCTA TCATGAAGCG CACATGCCTG 150 

GTGGCAAATA CTTCCAATAT GCCTGTTGCT GCTCGTGAGG CATCTATTTA 200 

30 CACTGGTATC ACCTTAGCTG AGTATTATCG GGATATGGGA AAACACATCG 2 50 

CCATGATGGC CGATTCAACC TCCCGCTGGG CTGAGGCTTT GCGTGAGATT 300 

TCCGGTCGTC TTGCTGAAAT GCCTGCAGAT GGAGGTTATC CCGCGTACCT 350 

CAGCGCCCGT TTGGCCTCCT TCTACGAGCG TGCTGGCCGT GTGACATGCA 400 

TCGGTGGGCC GAAGCGTGAA GGGTCTGTAA CAATTGT 437 

35 



2) INFORMATION FOR SEQ ID NO: 840 

4 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1052 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma cruzi 
50 (B) STRAIN: MM3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 840 

TGAAGGCTGA GCGCGAGCGC GGCATCACGA TCGACATCGC GCTCTGGAAG 50 

55 TTCGAGTCGC CCAAGTCTGT GTTCACGATC ATCGACGCCC CCGGCCACCG 100 

CGACTTCATC AAGAACATGA TCACGGGCAC GTCTCAGGCG GACGCCGCCG 150 

TCCTTGTCAT TGCGTCATCG CAGGGTGAGT TTGAGGCGGG CATCTCGAAG 200 

GACGGCCAGA CACGCGAGCA CGCGCTGCTC GCCTTCACGC TCGGCGTGAA 250 

GCAGATGGTT GTGTGCTGCA ACAAGATGGA CGACAAGTCG GTGAACTTCG 300 

60 CCCAGGAGCG CTACGATGAG ATTGTGAAGG AGGTGTCGGC GTACCTGAAG 350 
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AAGGTTGGGT ACAACGTGGA GAAGGTGCGC 
GCAGGGCGAC AACATGATTG ACAAGTCGGA 
GCCCCACGCT GCTGGAGGCA CTCGACATGC 
AGCGACAAGC CGCTGCGCCT GCCGCTGCAG 
5 TATCGGCACC GTGCCGGTCG GTCGCGTGGA 
GCGACGTGGT GACGTTTGCG CCCGCCAACG 
ATTGAGATGC ACCACGAGCA GCTGGCCGAG 
CGGCTTCAAC GTGAAGAACG TGTCCGTGAA 
TGTGCGGCAA CTCGAAGAAC GACCCCCCAA 

10 GCGCAGGTGA TCATCCTGAA CCACCCCGGC 
GCCGGTGCTC GACTGCCACA CCTGCCACAT 
TCGAGTCCAA GATCGACCGC CGCTCCGGCA 
AAGTCGATCA AGTCCGGTGA CGCCGCCATG 
GCCCATGTGC GTGGAGGTGT TCAACGACTA 

15 CC 



PCT/CA00/01150 

TTCATCCCCA TCTCCGGCTG A 00 

AAATATGCCG TGGTACAAGG 4 50 

TGGAGCCCCC GGTGCGCCCC 500 

GACGTGTACA AGATCGGCGG 550 

GACGGGCACG ATGAAGCCCG 600 

TGACGACGGA GGTGAAGTCG 650 

GCCACGCCCG GCGACAACGT 700 

GGACATCCGC CGTGGCAACG 750 

AGGAGGCGGC CGACTTCACG 800 

CAGATCGGCA ACGGCTATGC 850 

CGCGTGCAAG TTCGCCGAGA 900 

AGGAGCTTGA GAAGAACCCC 950 

GTGCGCATGG TGCCGCAGAA 1000 

CGCTCCTCTT GGCCGCTTTG 1050 

1052 



20 



25 



2) INFORMATION FOR SEQ ID NO: 841 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1061 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Trypanosoma cruzi 

(B) STRAIN: CGL-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 841 



3 5 TGAAGGCTGA 

TTCGAGTCGC 
CGACTTCATC 
TCCTTGTCAT 
GACGGCCAGA 

4 0 GCAGATGGTT 

CCCAGGAGCG 
AAGGTTGGGT 
GCAGGGCGAC 
GCCCCACGCT 

4 5 AGCGACAAGC 
TATCGGCACC 
GCGACGTGGT 
ATTGAGATGC 
CGGCTTCAAC 

50 TGTGCGGCAA 
GCGCAGGTGA 
GCCGGTGCTT 
TCGAGTCCAA 
AAGTCGATCA 

55 GCCCATGTGC 
CCGTGCGTGA 



GCGCGAGCGC 
CCAAGTCTGT 
AAGAACATGA 
TGCGTCATCG 
CACGCGAGCA 
GTGTGCTGCA 
CTACGATGAG 
ACAACGTGGA 
AACATGATTG 
GCTGGAGGCA 
CGCTGCGCCT 
GTGCCGGTCG 
GACGTTTGCG 
ACCACGAGCA 
GTGAAGAACG 
CTCGAAGAAC 
TCATCCTGAA 
GACTGCCACA 
GATCGACCGC 
AGTCCGGTGA 
GTGGAGGTGT 
C 



GGCATCACGA 
GTTCACGATC 
TCACGGGCAC 
CAGGGTGAGT 
CGCGCTGCTC 
ACAAGATGGA 
ATTGTGAAGG 
GAAGGTGCGC 
ACAAGTCGGA 
CTCGACATGC 
GCCGCTGCAG 
GTCGCGTGGA 
CCCGCCAACG 
GCTGGCCGAG 
TGTCCGTGAA 
GATCCCCCAA 
CCACCCCGGC 
CCTGCCACAT 
CGCTCCGGCA 
CGCCGCCATG 
TCAACGACTA 



TCGACATCGC 
ATCGACGCCC 
GTCTCAGGCG 
TTGAGGCGGG 
GCCTTCACGC 
CGACAAGTCG 
AGGTGTCGGC 
TTCATCCCCA 
AAATATGCCG 
TGGAGCCCCC 
GATGTGTACA 
GACGGGCACG 
TGACGACGGA 
GCCACGCCCG 
GGACATCCGC 
AGGAGGCGGC 
CAGATCGGCA 
CGCGTGCAAG 
AGGAGCTTGA 
GTGCGCATGG 
CGCTCCTCTT 



GCTCTGGAAG 50 

CCGGCCACCG 100 

GATGCCGCCG 150 

CATCTCGAAG 200 

TCGGCGTGAA 250 

GTGAACTTTG 3 00 

GTACCTGAAG 350 

TCTCCGGCTG 4 00 

TGGTACAAGG 4 50 

GGTGCGCCCC 500 

AGATCGGCGG 550 

ATGAAGCCCG 600 

GGTGAAGTCG 650 

GCGACAACGT 700 

CGTGGCAACG 750 

CGACTTCACG 800 

ACGGCTATGC 850 

TTCGCCGAGA 900 

GAAGAACCCC 950 

TGCCGCAGAA 1000 

GGCCGCTTTG 1050 
1061 



60 2) INFORMATION FOR SEQ ID NO: 842 
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PCT/CA00/01150 



.(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1062 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma cruzi 

(B) STRAIN: PCU-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 842 

GCTGAAGGCT GAGCGCGAGC GCGGCATCAC GATCGACATC GCGCTCTGGA 50 

AGTTCGAGTC GCCCAAGTCT GTGTTCACGA TCATCGACGC CCCCGGCCAC 100 

CGCGACTTCA TCAAGAACAT GATCACGGGC ACGTCTCAGG CGGACGCCGC 150 

CGTCCTTGTC ATTGCGTCAT CGCAGGGTGA GTTTGAGGCG GGCATCTCGA 200 

AGGACGGCCA GACACGCGAG CACGCGCTGC TCGCCTTCAC GCTCGGCGTG 250 

AAGCAGATGG TTGTGTGCTG CAACAAGATG GACGACAAGT CGGTGAACTT 300 

CGCCCAGGAG CGCTACGATG AGATTGTGAA GGAGGTGTCG GCGTACCTGA 350 

AGAAGGTTGG GTACAACGTG GAGAAGGTGC GCTTCATCCC CATCTCCGGC 4 00 

TGGCAGGGCG ACAACATGAT TGACAAGTCG GAAAATATGC CGTGGTACAA 450 

GGGCCCCACG CTGCTGGAGG CACTCGACAT GCTGGAGCCC CCGGTGCGCC 500 

CCAGCGACAA GCCGCTGCGC CTGCCGCTGC AGGACGTGTA CAAGATCGGC 550 

GGTATCGGCA CCGTGCCGGT CGGTCGCGTG GAGACGGGCA CGATGAAGCC 600 

CGGCGACGTG GTGACGTTTG CGCCCGCCAA CGTGACGACG GAGGTGAAGT 650 

CGATTGAGAT GCACCACGAG CAGCTGGCCG AGGCCACGCC CGGCGACAAC 700 

GTCGGCTTCA ACGTGAAGAA CGTGTCCGTG AAGGACATCC GCCGTGGCAA 750 

CGTGTGCGGC AACTCGAAGA ACGACCCCCC AAAGGAGGCG GCCGACTTCA 800 

CGGCGCAGGT GATCATCCTG AACCACCCCG GCCAGATCGG CAACGGCTAT 850 

GCGCCGGTGC TCGACTGCCA CACCTGCCAC ATCGCGTGCA AGTTCGCCGA 900 

GATCGAGTCC AAGATCGACC GCCGCTCCGG CAAGGAGCTT GAGAAGAACC 950 

CCAAGTCGAT CAAGTCCGGT GACGCCGCCA TGGTGCGCAT GGTGCCGCAG 1000 

AAGCCCATGT GCGTGGAGGT GTTCAACGAC TACGCTCCTC TTGGCCGCTT 1050 

TGCCGTGCGT GA 1062 



2) INFORMATION FOR SEQ ID NO: 843 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1057 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia bovis 

(B) STRAIN: Suarez-3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 843 

TACATTGACA GCTGCATTGA CCAAGGTCTG TTCGATGGGT GGTCATGGCG 50 

AGTACACTCC TTATGAAGCA ATTGACCGTG CTCCTGAGGA GCGTAAACGT 100 

GGTATTACTA TAAATTCGAC ACATGTTGAA TATGAGACTA AGAACCGTCA 150 

TTACGGTCAC GTGGACTGTC CAGGTCACTC TGATTATGTG AAGAACATGA 200 
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TATCTGGCGC 
GACGGTCCCA 
TGGTGTACCT 
ACTCTGAGCT 
5 GAGTTTGGTT 
AAAGGCATTG 
TATTGGATGC 
ATGCCGCTCT 
TACTGTTGTA 

10 CTATTGAGGT 
GGTCTTRARA 
GATTGGTGTT 
TTGTATTGAT 
GACTTGTACG 

15 GTCTAACTAC 
CAGTTCATTT 
CGTTGCAAGA 
ATTTGCG 



TGCTCAGATG 
TGCCTCAGAC 
CGTTTAGTTG 
ATTGGAGTTG 
ACGACGGTGA 
AACAGTAGTT 
GTGTGATGCC 
TGGTTGCTAT 
ACCGGTAGGA 
TTGCGCCGGT 
TGTTCCGCAA 
TTGCTCAAGG 
TCAACCCGGA 
TGTTGACTAC 
CGTCCTCAGG 
GGATRAGGGT 
TCAAGTTACT 



GATGGTGCCA 
TAAGGAGCAC 
TGTTTTTGAA 
GTGGAGTTAG 
CAACACGCCT 
CCGAGGCTGA 
TTTTTACTGA 
TGACGATGTT 
TAGAGCAGGG 
CCGAAGTCCG 
GAGTCTCAGT 
GTGTGAAGCG 
AGTTACAAAT 
GRAGGAAGGT 
CGTTTATACG 
GTTGAGATGG 
TTATCCCATG 



TATTGGTTGT 
GTGTTGCTTG 
CAAGCTTGAC 
AGGTTCGTGA 
ATCGTTCGTG 
CATTAAGCCA 
CTCCAGAACG 
CTTGCCATTC 
CAAGATTCGT 
GCAAGAAGAC 
RAGGGTATTG 
CGACRAGGTA 
GTCACGGTGA 
GGGCGCAAGC 
TACTGGAGAC 
CAGCTCCTGG 
CCTGTCCATG 



PCT/CAOO/01150 

TTCT'FGTGTT 2^0 

CTAAGCAGAT 300 

ATGTTAGAGG 350 

GTTATTGAGT 4 00 

GCAGTGCTAT 450 

ATTCAGGATT 500 

TAAGGATGAC 550 

CTGGCAAGGG 600 

TGTGGTGACC 650 

TGTGTGTGTT 700 

CTGGTGACCA 750 

GAGCGCGGGT 800 

ATTTGATGCT 850 

ATCCGTTTGT 900 

GTTTGCTGCT 950 

TGACAACGTG 1000 

AAGGTTTACG 1050 
1057 
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30 



35 



2) INFORMATION FOR SEQ ID NO: 844 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 3 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania aethiopica 

(B) STRAIN: ATCC 50119 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 844 



CGGTGGCATC 
GCGAGCACCT 

4 0 TTCATCAACA 
CATGGAGGTG 
CACCAATCGT 
TACGAGGAGA 
CGACCCGCCG 

4 5 TTTACGAGAT 
GATCAGGGCG 
CGCCAAGAAG 
CGCTGAGCGA 
ACCGGCGACA 

50 AATGGCGGCG 
TGTACGTGCT 
CACTACCGCC 
GAGCTTCCCG 
GCCGCGGCCC 

55 GAGAGCAAGC 
GCTCGCGTAC 



ATTGTGGTGG 
CCTGATCTGC 
AAGTGGATAT 
CGCGAGCAGC 
GCGCGGCTCG 
ACATCCTCGA 
CGCAACACAG 
CGGCAAGGAC 
TGCTGAAGCT 
TCGACGGTGA 
GTGCATGCCC 
CAACCAGTCT 
ACGGGTAGCA 
GACGAAGGAT 
CGCAGCTCTT 
GAGGCGGAGA 
CGAGGAGGAC 
TCGTCTGCAT 
CCGATGCCCA 



CGGCCACCGA 
TCGCAGATTG 
GACGGACGAG 
TGGAGAAATA 
GCCCTCAAGG 
ACTGGTGCGG 
ACAAGCCTTT 
AAGAAGAGCG 
CAACACAGAC 
GGGTGACGGG 
GGTGACTCCG 
GTCCAAGGAC 
CGAACCTGTA 
GAGGGCGGCC 
CTTCCATTGC 
AGCACCGGGA 
AAGAAGAAAG 
GCCGGGCGAT 
TTGAAAAGGG 



CGGCGTCATG 
GGCTTCCGGC 
GACACGTGCG 
CAAGTTTCCG 
CCGTCGAGGG 
AAGTGCGACG 
CCTTATGGCC 
TCATCGTGAC 
GCCGAGCTGG 
CATCGAGATG 
TCGGCGTCAG 
AACGTGGAAC 
CAACAAGGTG 
GCCACACCGG 
GCTGACGTGA 
GGAGCTGAAC 
AGGCGGAGAT 
AACCGCGAGC 
CCTGAAGTTC 



CCGCAGACAC 50 

GCTCGTAGGG 10 0 

ACCTGGTGGA 150 

GCGGAGGAGA 2 00 

CGACGCGAAG 250 

AGTGGATCCC 3 00 

ATCGAGCACG 350 

CGGCCGCGTC 400 

CCGGCTTCAG 450 

TACCACAAGA 500 

CATTGTCGGC 550 

GCGGCATGGT 600 

AAGGCGCAGG 650 

CTTCAGCCCC 700 

CAGCGGACAT 750 

AAGAAATTCG 800 

GAAGGAGTTC 850 

TGATCCTGAC 900 

ACC 943 



60 2) INFORMATION FOR SEQ ID NO: 



845 
472 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 939 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania 

(B) STRAIN: ATCC 50131 

(xi) SEQUENCE DESCRIPTION: SEQ 

CGGTGGCATC ATTGTGGTGG CGGCCACCGA 
GCGAGCATCT CTTGATCTGC TCGCAGATTG 
TTCATCAACA AAGTGGACAT GACGGACGAG 
CATGGAGGTG CGTGAGCAGC TGGAGAAATA 
CGCCCATCGT GCGCGGCTCG GCCCTCAAAG 
TACGAGGAGA ACATCCTCGA ACTGGTGCGG 
CGACCCGCCG CGCAACACAG ACAAGCCTTT 
TTTACGAGAT CGGCAAGGAC AAGAAGAGCG 
GATCAGGGCG TGCTGAAGCT CAACACAGAC 
CGCCAAGAAG TCGACGGTGA GGGTGACGGG 
CGCTGAGTGA GTGCATGCCC GGTGACTCCG 
ACCGGCGACA CGATCAGTCT CTCCAAGGAC 
AATGGCGGCA ACGGGTAGCA CGAACCTGTA 
TGTACGTGCT GACGAAGGAT GAGGGCGGCC 
CACTACCGCC CGCAGCTCTT CTTCCATTGT 
GAGCTTCCCG GAGGCGGAGA AGCACCGCGA 
GCCGCGGCCC CGAGGAGGAC AAGCAGAAGG 
GAGAGCAAGC TCGTCTGCAT GCCGGGCGAC 
GCTGGCGTAC CCGATGCCCA TTGACAAGGG 



amazonensis 



ID NO: 845 

CGGCGTCATG CCGCAGACAC 50 

GGCTTCCGGC GCTCGTAGGG 100 

GACACGTGCG ACCTGGTGGA 150 

CAAGTTTCCG GCGGAAGAGA 200 

CCGTCGAGGG CGACGCGAAG 250 

AAGTGCGACG AATGGATCCC 300 

CCTTATGGCC ATTGAGCACG 350 

TCATCGTGAC CGGCCGCGTC 400 

GCCGAGCTGG CCGGCTTCAG 450 

CATCGAGATG TACCACAAGA 500 

TCGGCGTCAG CATTGTCGGC 550 

AACGTTGAAC GCGGCATGGT 600 

CAACAAGGTG AAGGCGCAGG 650 

GCCACACTGG CTTCAGCCCC 700 

GCTGACGTGA CGGCGGACAT 750 

GGAGCTCAAC AAGAAATTCG 800 

AGGCGGAGAT GAAAGAGTTC 850 

AACCGCGAGC TGATCCTGAC 900 

TCTGAAGTT 939 



2) INFORMATION FOR SEQ ID NO: 84 6 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania donovani 

(B) STRAIN: ATCC 50212 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 846 

CATTGTGGTG GCGGCCACCG ACGGCGTCAT GCCGCAGACA CGCGAGCACC 50 

TCCTGATCTG CTCGCAGATC GGGCTTCCGG CGCTCGTAGG GTTCATCAAC 100 

AAGGTGGACA TGACGGACGA GGACACGTGC GACCTGGTGG ACATGGAGCT 150 

GCGCGAGCAG CTGGAGAAAT ACAAGTTTCC GGCGGAGGAG ACGCCAATCG 200 

TGCGCGGCTC AGCCCTCAAA GCCGTCGAGG GCGATGCGAA GTACGAGGAG 250 

AACATTCTCG AACTGGTGCG GAAGTGTGAC GAGTGGATCC CTGACCCGCC 300 

GCGCAACACA GACAAGCCTT TCCTTATGGC CATCGAGCAC GTTTACGAGA 350 
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TCGGCAAGGA 
ATTCTGAAGC 
GTCGACGGTG 
AGTGCATGCC 
ACGACCAGTC 
GACGGGTAGC 
TGACGAAGGA 
CCGCAGCTCT 
GGAGGCGGAG 
CCGAGGAGGA 
CTCGTCTGCA 
CCCGATGCCC 



CAAGAAGAGC 
TCAACACAGA 
AGGGTGACGG 
TGGTGACTCC 
TATCCAAGGG 
ACGAACCTGT 
TGAGGGCGGC 
TCTTCCATTG 
AAGCACCGCG 
CAAGAAGAAA 
TGCCGGGCGA 
ATTGAAAAGG 



GTTGTCGTGA 
CGCCGAGCTG 
GCATCGAGAT 
GTCGGCGTCA 
CAACGTGGAA 
ACAACAAGGT 
CGCCACACTG 
TGCTGACGTG 
AAGAGCTCAA 
GAGGCAGCGA 
TAACCGCGAG 
GTCTGAAGTT 



CCGGCCGCGT CGATckGGGC 
GCCGGCTTCA GCTCCAAGAA 
GTACCACAAG ACGCTGAGCG 
GCATTGTCGG GACCGGCGAC 
CGCGGCATGG TGATGGCGGC 
GAAGGCGCAG GTGTACGTGC 
GCTTTAGTCC TCACTACCGC 
ACGGCGGACA TGAGCTTCCC 
CAAGAAATTC GGCCGCGGCC 
TGAAGGAGTT CGAGAGCAAG 
CTGATCCTGA CGCTGGCGTA 
CACCATCCGT GAGGG 



40J^ 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
945 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 84 7 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 939 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania infantum 

(B) STRAIN: MOU 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84 7 

GGCATCATTG TGGTGGCGGC CACCGACGGC GTCATGCCGC AGACACGCGA 50 

GCACCTCCTG ATCTGCTCGC AGATCGGGCT TCCGGCGCTC GTAGGGTTCA 100 

TCAACAAGGT GGACATGACG GACGAGGACA CGTGCGACCT GGTGGACATG 150 

GAGCTGCGCG AGCAGCTGGA GAAATACAAG TTTCCGGCGG AGGAGACGCC 2 00 

AATCGTGCGC GGCTCAGCCC TCAAAGCCGT CGAGGGCGAT GCGAAGTACG 250 

AGGAGAACAT TCTCGAACTG GTGCGGAAGT GTGACGAGTG GATCCCTGAC 3 00 

CCGCCGCGCA ACACAGACAA GCCTTTCCTT ATGGCCATCG AGCACGTTTA 3 50 

CGAGATCGGC AAGGACAAGA AGAGCGTTGT CGTGACCGGC CGCGTCGATC 4 00 

AGGGCGTTCT GAAGCTCAAC ACAGACGCCG AGCTGGCCGG CTTCAGCTCC 450 

AAGAAGTCGA CGGTGAGGGT GACGGGCATC GAGATGTACC ACAAGACGCT 500 

GAGCGAGTGC ATGCCTGGTG ACTCCGTCGG CGTCAGCATT GTCGGCACCG 550 

GCGACACGAC CAGTCTATCC AAGGGCAACG TGGAACGCGG CATGGTGATG 600 

GCGGCGACGG GTAGCACGAA CCTGTACAAC AAGGTGAAGG CGCAGGTGTA 650 

CGTGCTGACG AAGGATGAGG GCGGCCGCCA CACTGGCTTT AGTCCTCACT 700 

ACCGCCCGCA GCTCTTCTTC CATTGTGCTG ACGTGACGGC GGACATGAGC 750 

TTCCCGGAGG CGGAGAAGCA CCGCGAAGAG CTCAACAAGA AATTCGGCCG 800 

CGGCCCCGAG GAGGACAAGA AGAAAGAGGC AGCGATGAAG GAGTTCGAGA 850 

GCAAGCTCGT CTGCATGCCG GGCGATAACC GCGAGCTGAT CCTGACGCTG 900 

GCGTACCCGA TGCCCATTGA AAAGGGTCTG AAGTTCACC 939 



2) INFORMATION FOR SEQ ID NO: 84 8 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 933 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania enriettii 

(B) STRAIN: ATCC 50120 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 848 



10 CACCGACGGC 
AGATCGGGCT 
GACGAGGATA 
GAAGTACAAG 
TCAAGGCCCT 

15 GTGCGAAAGT 
GCCTTTCCTC 
AGAGCGTCAT 
ACAGACGCCG 
GACGGGCATC 

20 ACTCTGTCGG 
AAGGATAATG 
CCTGTACAAC 
GTGGCCGCCA 
CACTGCGCTG 

25 CCGCGAGGAG 
AGAAAGAGGC 
GGCGATAACC 
CAAGGGCCTG 



GTCATGCCGC 
GCCGGCGCTT 
CGTGCGACCT 
TTTCCGGCCG 
CGAGGGGGAT 
GCGACGAGTG 
ATGGCTATCG 
CGTTACCGGC 
AGCTGGCCGG 
GAGATGTATC 
TGTCAGCATC 
TTGAGCGCGG 
AAGCTGAAGG 
CACCGGGTTC 
ACGTGACCGC 
CTCAACAAGA 
GGAGATGAAG 
GCGAGCTGAT 
AAGTTCACCA 



AGACACGGGA 
GTAGGGTTCA 
CGTGGACATG 
AGGAGACGCC 
GCGCAATACG 
GATCCCCGAC 
AGCACGTTTA 
CGCGTCGATC 
CTTCAGCGCC 
ACAAGACACT 
GTCGGTACCG 
TATGGTAATG 
CGCAGGTTTA 
AGCCCCCACT 
AGACATGAGC 
AGTTCGGCCG 
GAGTTCGAAA 
CCTAACTCTG 
TCCGTGAGGG 



GCACCTGCTC 
TCAATAAAGT 
GAGGTGCGGG 
CATCGTACGT 
AGGGGAGTAT 
CCGCCGCGCA 
CGAGCTCGGC 
AAGGTGTGCT 
AAGAAGGCGA 
CAATGAGTGC 
GTGACACGAC 
GCGGCAACGG 
CGTGCTGACA 
ACCGCCCGCA 
TTCCCGGAGG 
TGGCCCTGAG 
GCAAACTTGT 
GCGTACCCGA 
CGG 



ATCTGCTCGC 50 

TGACATGACG 100 

AACAGCTGGA 150 

GGCTCGGCCC 2 00 

TCTCGAGCTG 250 

ACACCGAAAA 300 

AAAGACAAGA 350 

GAAGCTCAAC 400 

CAGTCAAAGT 450 

ATGCCCGGCG 500 

CAGCTTATCC 550 

GTAGCACGAA 600 

AAGGAGGAGG 650 

GCTCTTCTTC 700 

CGGAGAAGTA 750 

GAGGACAAGA 800 

CTGCATGCCA 850 

TGCCCATCGA 900 
933 



30 



35 



4 0 



45 



2) INFORMATION FOR SEQ ID NO: 84 9 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 943 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania gerbilli 

(B) STRAIN: ATCC 50121 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 849 



CGGTGGCATC 
GCGAGCACCT 

50 TTCATCAACA 
CATGGAGGTG 
CACCAATCGT 
TACGAGGAGA 
CGACCCGCCG 

55 TTTACGAGAT 
GATCAGGGCG 
CGCCAAGAAG 
CGCTGAGCGA 
ACCGGCGACA 

60 AATGGCGGCG 



ATTGTGGTGG 
CCTGATCTGC 
AAGTGGACAT 
CGCGAGCAGC 
GCGCGGCTCG 
ACATCCTCGA 
CGCAACACAG 
CGGCAAGGAC 
TGCTGAAGCT 
TCGACGGTGA 
GTGCATGCCC 
CGACCAGTCT 
ACGGGTAGCA 



CGGCCACCGA 
TCGCAGATTG 
GACGGACGAG 
TGGAGAAATA 
GCCCTCAAGG 
ACTGGTGCGG 
ACAAGCCTTT 
AAGAAGAGCG 
CAACACGGAC 
GGGTGACGGG 
GGTGACTCCG 
GTCCAAGGAC 
CGAACCTGTA 
4 



CGGCGTCATG 
GGCTTCCGGC 
GACACGTGCG 
CAAGTTTCCG 
CCGTCGAGGG 
AAGTGCGACG 
CCTTATGGCC 
TCATCGTGAC 
GCCGAGCTGG 
CATTGAGATG 
TCGGCGTCAG 
AACGTGGAAC 
CAACAAGGTG 
75 



CCGCAGACAC 50 

GCTCGTAGGG 100 

ACCTGGTGGA 150 

GCGGAGGAGA 200 

CGACGCGAAG 250 

AGTGGATCCC 300 

ATCGAGCACG 350 

CGGCCGCGTC 400 

CCGGCTTCAG 450 

TACCACAAGA 500 

CATTGTCGGC 550 

GCGGCATGGT 600 

AAGGCGCAGG 650 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

TGTACGTGCT GACGAAGGAT GAGGGCGGCC 

CACTACCGCC CGCAGCTCTT CTTCCATTGC 

GAGCTTCCCG GAGGCGGAGA AGCACCGCGA 

GCCGCGGCCC CGAGGAGGAC AAGAAGAAAG 

5 GAGAGCAAGC TCGTCTGCAT GCCGGGCGAT 

GCTGGCGTAC CCGATGCCCA TTGAAAAGGG 



PCT/CA00/01150 

GCCACACTGG CTTCA&CCCC 16(/ 

GCTGACGTGA CAGCGGACAT 750 

GGAGCTCAAC AAGAAATTCG 800 

AGGCGGAGAT GAAGGAGTTC 850 

AACCGCGAGC TGATCCTGAC 900 

TCTGAAGTTC ACC 943 



10 2) INFORMATION FOR SEQ ID NO: 850 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 918 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE:, Genomic DNA 



20 



25- 



30 



35 



40 



45 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania major 

(B) STRAIN: ATCC 50122 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 850 

GGCATCATTG TGGTGGCGGC CACCGACGGC GTCATGCCGC 
ACACCTCCTG ATCTGCTCGC AAATTGGCCT TCCGGCGCTC 
TCAACAAAGT GGACATGACG GACGAGGACA CGTGTGACCT 
GAGGTGCGCG AGCAGCTGGA GAAATACAAG TTTCCGGCGG 
AATCGTGCGC GGCTCGGCCC TCAAGGCCGT CGAGGGCGAC 
AGGAGAACAT CCTCGAACTG GTGCGGAAGT GCGACGAGTG 
CCGCCGCGCA ACACAGACAA GCCTTTCCTT ATGGCCATCG 
CGAGATCGGC AAGGACAAGA AGAGCGTCAT CGTGACCGGC 
AGGGCGTGCT GAAGCTCAAC ACAGACGCCG AGCTGGCCGG 
AAGAAGTCGA CGGTGAGGGT GACGGGCATT GAAATGTACC 
GAGCGAGTGC ATGCCCGGTG ACTCCGTCGG CGTCAGCATT 
GCGACACGAC CAGTCTGTCC AAGGACAACG TGGAGCGCGG 
GCGGCGACGG GTAGCACGAA CCTGTACAAC AAGGTGAAGG 
CGTGCTGACG AAGGATGAGG GCGGCCGCCA CACTGGCTTC 
ACCGCCCGCA GCTCTTCTTC CATTGCGCTG ACGTGACAGC 
TTCCCGGAGG CGGAGAAGCA CCGCGAGGAG CTCAACAAGA 
CGGCCCCGAG GAGGACAAGA AGAAAGAGGC GGAGATGAAG 
GCAAGCTCGT CTGCATGCCG GGCGATAACC GCGAGCTGAT 
GCGTACCCGA TGCCCATT 



AGACACGCGA 50 

GTAGGGTTCA 100 

GGTGGACATG 150 

AGGAGACACC 200 

GCGAAGTACG 250 

GATCCCCGAC 300 

AGCACGTTTA 350 

CGCGTCGATC 4 00 

CTTCAGCGCC 450 

ACAAGACGCT 500 

GTCGGCACCG 550 

CATGGTAATG 600 

CGCAGGTGTA 650 

AGCCCCCACT 700 

GGACATGAGC 750 

AATTCGGCCG 800 

GAGTTCGAGA 850 

CCTGACGCTG 900 
918 



2) INFORMATION FOR SEQ ID NO: 851 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 939 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



55 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania mexicana 
60 (B) STRAIN: ATCC 50156 

476 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 851 



CGGTGGCATC 
5 GCGAGCATCT 
TTCATCAACA 
CATGGAGGTG 
CGCCCATCGT 
TACGAGGAGA 

10 CGACCCGCCG 
TTTACGAGAT 
GATCAGGGCG 
CGTCAAGAAG 
CGCTGAGTGA 

15 ACCGGCGACA 
AATGGCGGCA 
TGTACGTGCT 
CACTACCGCC 
GAGCTTCCCG 

20 GCCGCGGCCC 
GAGAGCAAGC 
GCTGGCGTAC 



ATTGTGGTGG 
CCTGATCTGC 
AAGTGGACAT 
CGTGAGCAGC 
GCGCGGCTCG 
ACATCCTCGA 
CGCAACACAG 
CGGCAAGGAC 
TGCTGAAGCT 
TCGACGGTGA 
GTGCATGCCC 
CGATCAGTCT 
ACGGGTAGCA 
GACGAAGGAT 
CGCAGCTCTT 
GAGGCGGAGA 
CGAGGAGGAC 
TCGTCTGCAT 
CCGATGCCCA 



CGGCCACCGA 
TCGCAGATTG 
GACGGACGAG 
TGGAGAAATA 
GCCCTCAAGG 
ACTGGTGCGG 
ACAAGCCTTT 
AAGAAGAGCG 
CAACACAGAC 
GGGTGACGGG 
GGTGACTCCG 
CTCCAAGGAC 
CGAACCTGTA 
GAGGGCGGCC 
CTTCCATTGT 
AGCACCGCGA 
AAGCAGAAGG 
GCCGGGCGAC 
TTGAGAAGGG 



CGGCGTCATG 
GGCTTCCGGC 
GACACGTGCG 
CAAGTTTCCG 
CCGTCGAGGG 
AAGTGCGACG 
CCTTATGGCC 
TCATCGTGAC 
GCCGAGCTGG 
CATCGAGATG 
TCGGCGTCAG 
AACGTTGAAC 
CAACAAGGTG 
GCCACACTGG 
GCTGACGTGA 
GGAGCTCAAC 
AGGCGGAGAT 
AACCGCGAGC 
TCTGAAGTT 



CCGCAGACAC 
GCTCGTAGGG 
ACCTGGTGGA 
GCGGAAGAGA 
CGACGCGAAG 
AATGGATCCC 
ATTGAGCACG 
CGGCCGCGTC 
CCGGCTTCAG 
TACCACAAGA 
CATTGTCGGC 
GCGGCATGGT 
AAGGCGCAGG 
CTTCAGCCCC 
CGGCGGACAT 
AAGAAATTCG 
GAAAGAGTTC 
TGATCCTGAC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
939 



25 



35 



J 

2) INFORMATION FOR SEQ ID NO: 852 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 912 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tarentolae 

(B) STRAIN: MOU-2 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 852 

TCATTGTGGT GGCCGCCACC GACGGCGTCA TGCCGCAAAC ACGGGAGCAC 50 

CTTTTGATCT GCTCGCAGAT CGGGCTGCCG GCGCTCGTAG GGTTCATCAA 100 

CAAAGTGGAC ATGACAGACG AAGACACGTG CGACCTGGTA GACCTGGAGG 150 

4 5 TGCGTGAGCA GCTGGAGAAG TACAAGTTTC CGGCAGAGGA AACACCAATC 200 

GTGCGTGGCT CGGCCCTCAA GGCCGTTGAG GGCGATGCAA AGTACGAGGA 250 

GAACATCCTC GAACTGGTGC GGAAGTGCGA CGAGTGGATC CCAGACCCGC 300 

CACGCAATAC GGACAAGCCT TTCCTTATGG CCATTGAACA CGTGTACGAG 350 

ATCGGCAAGG ATAGGAAAAG CGTCATCGTA ACCGGCCGCG TCGATCAAGG 4 00 

50 TGTGCTGAAG CTGAACACAG ACGCCGAGCT GGCCGGCTTC AGCGCCAAGA 450 

AGTCGACGGT GAAAGTGACG GGCATTGAGA TGTACCACAA GACACTGACA 500 

GAGTGCATGC CCGGCGACTC TGTCGGCGTC AGCATTGTGG GCACTGGYGA 550 

CACGACCAGC CTCTCTAAGG ACAATGTTGA GCGTGGCATG GTACTGGCCG 600 

CTACGGGTAG CACGAACCTG TACAACAAAG TAAAGGCGCA GGTGTATGTA 650 

55 CTCACGAAGG ATGAGGGCGG CCGCCACACC GGCTTCAGCC CCCACTACCG 700 

TCCGCAGCTC TTCTTCCACT GCGCTGACGT AACGGCGGAC ATGAGCTTCC 750 

CGGAGGCGGA GAAGCACCGC GAGGAACTCA ATAAGAAATT CGGCCGCGGC 800 

CCCGAGGAGG ACAAGAAAAA GGAGGCGGAG ATGAAGGAGT TCGAGAGCAA 850 

GCTGGTCTGC ATGCCAGGCG ATAACCGCGA GCTGATCCTG ACATTGGCGT 900 

60 ACCCGATGCC TA 912 
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2 ) INFORMATION FOR SEQ ID NO: 853 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 936 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Trypanosoma cruzi 

(B) STRAIN: MM3 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 853 



20 ATTCTTGTGG 
CCTGCTTATT 
ATAAGTGTGA 
ATGGAGGTAC 
GCCATTTGTG 

25 ATGAAGGAAA 
GACCCACCGC 
TTTTGAGGTT 
ACCAGGGGCA 
GCAAAGAAGC 

30 TCTGGAGGAT 
GCGGTGAAAC 
CTCTCCGCAC 
GTACGTGTTG 
ACTATCGTCC 

35 AACTTCCCGG 
CCGTGATGCG 
AAAAGACGCT 
CTTGCCTATC 



TGGCAGCTAA 
TGTTCGCAGA 
CATGATGCAA 
GTGAACTTTT 
CGGGGGTCTG 
GATTTTGGAG 
GTGCCATTGA 
GGAAAGGATA 
GTTGAAGGTC 
TGACGGTGAA 
TGCATGCCTG 
AGTGAACCTG 
CAGGTGCAAC 
ACAAAGGAAG 
GCAGCTTTTC 
AAAGCGAGAA 
GCGGAACAGA 
TGTCTGCATG 
CAATGCCAAT 



CGACGGATGC 
TTGGCCTTCC 
GGGCAGGAGG 
GGAGAAGTAC 
CGGTGAAGGC 
CTTGTAAAAA 
AAAACCGTTC 
AGAAGGCCGT 
GGCGCAGATG 
GGTTGCTAGC 
GTGACTCTGT 
TCGAAGGAAA 
GACACTGTTC 
AAGGCGGTCG 
TTCCACTGTG 
GCTTGCAGGG 
AGAAGAAGGA 
CCTGGTGATA 
GGAAAAGGGA 



ATGCCGCAGA 
TGCTCTTGTA 
AAATGATTGA 
AAGTTCCCTG 
ATTGGAGGGT 
AATGTGATGA 
CTTATGGCCA 
TGTTGTGAGC 
CAGAACTTTC 
ATCGAAATGT 
TGGCGCGAAG 
ATGTGGAACG 
AACAAGGTCC 
TCACACAGCC 
CTGATGTCAC 
GAGCTGAACA 
GGCAGAACTG 
ACCGCGAACT 
CTCAAG 



CGCGTGAGCA 50 

TGCTTTATCA 100 

ACTTGTTGAA 150 

CGGAGGAGAC 200 

GATGCTGAAA 250 

ATGGATTCCC 300 

TTGAGCACGT 350 

GGGCGTGTGG 4 00 

CGGGTTTAGC 450 

ACCATAAAAT 500 

ATCGTTGGCA 550 

CGGCATGGTA 600 

GCGCGCAGGT 650 

TTTAGTCCTC 700 

GGCAGATATT 750 

AAAAGTATGG 800 

AAAGAGTTTG 850 

CCTGCTCACC 900 
936 



40 



45 



50 



55 



60 



2 ) INFORMATION FOR SEQ ID NO: 854 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 934 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma cruzi 

(B) STRAIN: PCU-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 854 

CGGCATTCTT GTGGTGGCAG CTAACGACGG ATGCATGCCG CAGACGCGTG 
AGCACCTGCT TATTTGTTCG CAGATTGGCC TTCCTGCTCT TGTATGCTTT 
ATCAATAAGT GTGACATGAT GCAAGGGCAG GAGGAAATGA TTGAGCTTGT 

478 
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TGAAATGGAG GTACGTGAAC TTTTGGAGAA 
AGACGCCATT TGTGCGGGGG TCTGCGGTGA 
GAAAATGAAG GAAAGATTTT GGAGCTTGTA 
TCCCGACCCA CCGCGTGCCA TTGAAAAACC 
5 ACGTTTTTGA GGTTGGAAAG GATAAGAAGG 
GTGGACCAGG GGCAGTTGAA GGTCGGCGCA 
TAGTGCAAAG AAGCTGACGG TGAAGGTTGC 
AAATTCTGGA GGATTGCATG CCTGGTGACT 
GGCAGCGGTG AAACAGTGAA CCTGTCGAAG 

10 GGTACTCTCC GCACCAGGTG CAACGACACT 
AGGTGTACGT GTTGACAAAG GAAGAAGGCG 
CCTCACTATC GTCCGCAGCT TTTCTTCCAC 
TATTAACTTC CCGGAAAGCG AGAAGCTTGC 
ATGGCCGTGA CGCGGCGGAA CAGAAGAAGA 

15 TTTGAAAAGA CGCTTGTCTG CATGCCTGGT. 
CACCCTTGCC TATCCAATGC CAATGGAAAA 



GTACAAGTTC CCTGCGG)*GG 200 

AGGCATTGGA GGGTGATGCT 250 

AAAAAATGTG ATGAATGGAT 300 

GTTCCTTATG GCCATTGAGC 350 

CCGTTGTTGT GAGCGGGCGT 400 

GATGCAGAAC TTTCCGGGTT 450 

TAGCATCGAA ATGTACCATA 500 

CTATTGGCGC GAAGATCGTT 550 

GAAAATGTGG AACGCGGCAT 600 

GTTCAACAGG GTCCGCGCGC 650 

GTCGTCACAC AGCCTTTAGT 700 

TGTGCTGATG TCACGGCGGA 750 

AGGGGAGCTG AACAAAAAGT 800 

AGGAGGCAGA ACTGAAAGAG 850 

GATAACCGCG AACTCCTGCT 900 

GGGA 934 



20 2) INFORMATION FOR SEQ ID NO: 855 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 937 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma cruzi 

(B) STRAIN: CGL-1 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 855 



TGGCGGCATT CTTGTGGTGG CAGCTAACGA CGGATGCATG CCGCAGACGC 50 

GTGAGCACCT GCTTATTTGT TCGCAGATTG GCCTTCCTGC TCTTGTATGC 100 

TTTATCAATA AGTGTGACAT GATGCAAGGG CAGGAGGAAA TGATTGAACT 150 

TGTTGAAATG GAGGTACGTG AACTTTTGGA GAAGTACAAG TTCCCTGCGG 200 

4 0 AGGAGACGCC ATTTGTGCGG GGGTCTGCGG TGAAGGCATT GGAGGGTGAT 250 

GCTGAAAATG AAGGAAAGAT TTTGGAGCTT GTAAAAAAAT GTGATGAATG 300 

GATTCCCGAC CCACCGCGTG CCATTGAAAA ACCGTTCCTT ATGGCCATTG 350 

AGCACGTTTT TGAGGTTGGA AAGGATAAGA AGGCCGTTGT TGTGAGCGGG 400 

CGTGTGGACC AGGGGCAGTT GAAGGTCGGC GCAGATGCAG AACTTTCCGG 450 

45 GTTTAGCGCA AAGAAGCTGA CGGTGAAGGT TGCTAGCATC GAAATGTACC 500 

ATAAAATTCT GGAGGATTGC ATGCCTGGTG ACTCTGTTGG CGCGAAGATC 550 

GTTGGCAGCG GTGAAACAGT GAACCTGTCG AAGGAAAATG TGGAACGCGG 600 

CATGGTACTC TCCGCACCAG GTGCAACGAC ACTGTTCAAC AAGGTCCGCG 650 

CGCAGGTGTA CGTGTTGACA AAGGAAGAAG GCGGTCGTCA CACAGCCTTT 700 

50 AGTCCTCACT ATCGTCCGCA GCTTTTCTTC CACTGTGCTG ATGTCACGGC 750 

AGATATTAAC TTCCCGGAAA GCGAGAAGCT TGCAGGGGAG CTGAACAAAA 800 

AGTATGGCCG TGATGCGGCG GAACAGAAGA AGAAGGAGGC AGAACTGAAA 850 

GAGTTTGAAA AGACGCTTGT CTGCATGCCT GGTGATAACC GCGAACTCCT 900 

GCTCACCCTT GCCTATCCAA TGCCAATGGA AAAGGGA 937 

55 



2) INFORMATION FOR SEQ ID NO: 856 
60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 888 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia bigewina 

(B) STRAIN: Suarez-2 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 856 

CTTATTATGG AGCTGATCAA CAACGTCGCG AAGAAGCACG GTGGTTTCTC 50 

CGTGTTCGCC GGCGTCGGCG AGCGCACCAG GGAGGGCAAC GAGCTGTACC 100 

ACGAGATGAT GG AG AC CGGC GTCATCAAGC GCCGCCAGCT GGATGACGGC 150 

ACGTTCGACT TCTCCGGCTC CAAGGCCGCG CTGGTGTACG GCCAGATGAA 200 

CGAGCCGCCA GGTGCCAGGG CGCGTGTTGC CCTCACTGGC CTGACGGTGG 250 

CCGAGTACTT CCGTGATGAG GACGGCCAGG ACGTGCTGCT CTTCATCGAC 3 00 

AACATCTACC GTTTCACCCA GGCTGGTTCT GAGGTGAGTG CCCTTTTGGG 350 

GCGCATCCCG TCCGCCGTCG GTTACCAGCC GACCCTCGCC ACCGACCTTG 4 00 

GCGCGCTGCA GGAGCGTATC ACGACGACCA ACAAGGGCTC CATCACCTCC 4 50 

GTGCAGGCCG TCTACGTGCC GGCCGACGAT ATCACCGACC CGGCGCCTGC 500 

GACCACCTTC ACCCATCTGG ACGCGACCAC TGTGCTCTCC CGTTCCATCG 550 

CCGAGCTGGG TATCTACCCC GCCGTCGACC CGCTCGACTC CACCTCGCGT 600 

ATGCTGTCCG CGAACATCGT CGGCGAGGAG CAGTACAACG TGGCGCGTGG 650 

CGTGCAGAAA ATACTGCAGG ACTACAAATC GCTGCAGGAT ATCATCGCCA 700 

TCCTGGGTAT GGACGAGCTG TCTGAGCAGG ACAAGTTCGT CGTCGCGCGT 750 

GCGCGCAAGG TTCAGCGTTT CCTATCCCAG CCCTTCCAGG TGGCTGAGGT 800 

ATTCACCGGC AAGCCCGGAC GTTTCGTCGA GCTGCAGGAC ACCATCAGCG 850 

GCGTCAAGGA GATTTTGGAC GGCGAGTGCG ACGACATG 888 



2) INFORMATION FOR SEQ ID NO: 857 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 884 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia bovis 

(B) STRAIN: Suarez-3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 857 

TGATTATGGA ATTGATCAAC AATGTCGCCA AGAAACACGG TGGGTTCTCC 50 

GTGTTCGCTG GTGTTGGTGA ACGTACGAGG GAAGGTAACG AACTGTACCA 100 

TGAAATGATG GAAACGGGTG TCATCAAGCG CCGTCAACTG GAAGACGGAA 150 

CATTTGACTT CTCGGGCTCT AAAGCTGCTT TGGTGTACGG ACAAATGAAC 2 00 

GAACCACCAG GTGCTAGAGC CCGTGTTGCA CTCACGGGAT TGACCGTTGC 250 

CGAGTATTTC CGTGATGAAG AGGGGCAGGA TGTGCTACTC TTCATCGATA 3 00 

ACATCTACCG TTTCACCCAG GCCGGTTCCG AAGTGAGTGC GCTGTTAGGA 350 

AGAATTCCAT CCGCCGTGGG TTATCAACCT ACATTGGCCA CTGATCTCGG 400 

AGCACTCCAG GAACGCATTA CTACAACCAA CAAGGGTTCG ATTACATCAG 450 

TCCAGGCAGT ATACGTCCCA GCCGATGATA TCACTGATCC CGCTCCAGCT 500 

480 
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ACCACTTTCT CGCACTTGGA 
GGAGTTGGGT ATTTACCCTG 
TGCTGTCGGC CAACATTGTA 
GTACAGAAAA TTTTACAGGA 
TCTGGGTATG GACGAGCTGT 
CCCGTAAGGT ACAGCGTTTC 
TTCACCGGCA AGCCTGGGAG 
TGTCAAGGAA ATCTGGAAGG 



TGCCACTACA 
CGGTCGACCC 
GGACAGGAAC 
CTACAAATCA 
CTGAGCAGGA 
CTGTCTCAGC 
GTTCGTTGAA 
TGAGTGTGAC 



GTGCTTTCTC GTTCAATTGC 
GCTTGACTCA ACGTCACGTA 
AGTACGATGC CGCACGTGGT 
CTGCAGGATA TCATTGCCAT 
CAAGTTCGTT GTAGCACGCG 
CGTTCCAAGT GGCTGAGGTG 
CTACAGGATA CCATCAGCGG 
GATA 



550 
600 
650 
700 
750 
800 
850 
884 



2) INFORMATION FOR SEQ ID NO: 858 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 871 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia microtti 

(B) STRAIN: Persing-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 858 

TGGAACTGAT TAATAATGTG GCCAAAAAGC ATGGCGGTTA CTCTGTTTTT 50 

GCAGGTGTAG GTGAAAGGAC GAGGGAGGGT AATGAATTGT ACCATGAAAT 100 

GATGGAGACA GGTGTTATAA AGAAAAAGGC ACTAGGTGGT GGGAAGTTTG 150 

ATTTCAGTGG ATCTAAAGCA GCGCTGGTCT ATGGACAAAT GAACGAGCCA 200 

CCTGGGGCCC GTGCTAGAGT GGCACTAACT GGATTAACAG TCGCAGAATA 250 

TTTCCGTGAC GAACAAGGAC AAGACGTGTT GTTGTTTATT GATAATATTT 3 00 

ACCGATTTAC TCAGGCAGGG TCTGAGGTTT CAGCCTTGCT AGGCCGTATA 350 

CCTTCAGCTG TGGGATACCA GCCTACATTG GCAACAGATC TTGGCTGTTT 4 00 

ACAAGAACGA ATTACTACGA CCAAATCTGG TTCAATCACC AGTGTACAAG 450 

CTGTGTATGT GCCAGCAGAT GATATTACTG ATCCAGCGCC TGCCACAACT 500 

TTTACTCACT TGGACGCTAC TACTGTACTT AGCAGGCCAA TTGCTGAACT 550 

CGGTATTTAT CCAGCGGTAG ACCCGTTGGA TTCAACAAGC CGTATGCTAA 600 

GCGCGAACAT TGTGGGAAAT GAACACTATA GTGTAGCCCG TTCCGTGCAG 650 

AAGATACTGC AAGATTACAA ATCGCTTCAG GACATTATTG CCATTTTGGG 700 

TATGGATGAA CTGTCGGAAC AAGACAAAAA TATAGTAGCC CGAGCAAGGA 750 

AGATGCAAAG GTTCTTATCA CAGCCATTCC AAGTGGCGGA AGTTTTTACT 800 

GGTAAACCGG GAAGATTTGT GGAATTGGAA GATACAATTG CCGGGGCACG 850 

AGATATAATT GCGGGTAATT G 871 



2) INFORMATION FOR SEQ ID NO: 859 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1255 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania guyanensis 

481 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(B) STRAIN: ATCC 50126 



PCT/CAOO/01150 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 859 

5 TGACGGCGCT GGACGTGACG GAGGACCTCG GCCGCGATGA GCCGCTGACG 50 

CTGGAGATCG TGCAGCACTT GGATGCGAAC ACCGGCCGCT GCATTGCGAT 100 

GCAGACGACG GACCTGCTGA AGCTGAAGTC GAAGGTTGTG TCGACCGGCG 150 

GCAACATCTC CGTGCCGGTG GGCCGCGAGA CACTGGGCCG CATCTTCAAC 2 00 

GTGCTGGGCG ACGCGATTGA CCACCGCGGC CCCGTGTGCG AGAAGATGCG 250 

10 CATGGCGATC CACGCCGAGG CGCCGAAGCT GGCGGACCAG GCTGCGGAGG 300 

ACACGATCCT GACGACCGGC ATCAAGGTGA TCGACCTGAT TCTGCCCTAC 350 

TGCAAGGGCG GCAAGATCGG CCTGTTCGGC GGTGCCGGTG TGGGCAAGAC 4 00 

TGTGATCATC ATGGAGCTGA TCAACAACGT CGCGAAGGGG CACGGCGGCT 4 50 

TCTCCGTGTT CGCCGGCGTT GGCGAGCGCA CGCGCGAGGG CACGGACCTG 500 

15 TACCTGGAGA TGATGCAGTC AAAGGTGATT GACCTGAAGG GCGAGTCGAA 550 

GTGCGTGCTT GTGTACGGGC AGATGAACGA GCCCCCGGGT GCGCGCGCGC 600 

GCGTTGCGCA GTCTGCGCTG ACGATGGCCG AGTACTTCCG CGACGTGGAG 650 

GGCCAGAACG TGCTGCTGTT CATCGACAAC ATCTTCCGCT TCACGCAGGC 700 

GAACTCCGAG GTGTCTGCGC TGCTGGGCCG CATCCCGGCC GCCGTGGGTT 750 

20 ACCAGCCGAC GCTTGCGGAG GATCTTGGCA TGCTGCAGGA GCGCATTACG 800 

TCGACGACGA AGGGATCGAT TACGTCTGTG CAGGCTGTGT ACGTGCCTGC 850 

GGATGATATC ACGGACCCCG CGCCCGCGAC GACGTTCTCG CACCTGGACG 900 

CGACGACTGT GCTGGACCGC GCGGTGGCGG AGTCGGGCAT CTACCCTGCC 950 

GTGAACCCGC TGGAGTGCGC GTCGCGCATC ATGGACCCCG ATGTGATCGA 1000 

25 CGTGGACCAT TACAACGTTG CACAGGATAT CGTCCAGATG CTGACCAAGT 1050 

ACAAGGAGCT GCAGGACATC ATTGCGGTGC TTGGCATCGA CGAGCTGAGC 1100 

GAGGAGGACA AGGTCGTGGT GGACCGCGCG CGCAAGGTGA CGCGGTTCCT 1150 

GTCGCAGCCG TTCCAGGTTG CGGAGGTGTT CACCGGCATG ACGGGCCACT 1200 

ACGTGCAGCT GAGCGACACG GTGGAGTCGT TCTCTGGCCT GCTGATGGGG 1250 

30 TCGTA 1255 



2) INFORMATION FOR SEQ ID NO: 860 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 1222 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Leishmania mexicana 

(B) STRAIN: ATCC 50156 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 860 

50 CTCGGAGGGC GTCCCGCCCG TGCTGACGGC GCTGGATGTG ACGGAGGACC 50 

TTGGCCGCGA TGAGCCGCTG ACGCTGGAGA TCGTGCAGCA CCTGGACGCG 100 

AACACCGGCC GCTGCATTGC GATGCAGACG ACGGACCTGC TGAAGCTGAA 150 

GTCGAAGGTT GTGTCGACCG GCGGCAACAT CTCTGTGCCG GTGGGCCGTG 200 

AGACGCTGGG CCGCATCTTC AACGTGCTGG GCGACGCGAT CGACCAGCGC 250 

55 GGCCCCGTGG GTGAGAAGAT GCGCATGGCG ATCCACGCCG AGGCCCCGAA 300 

GCTGGCGGAT CAGGCCGCGG AGGACACGAT CCTGACGACC GGCATCAAGG 350 

TGATCGACCT GATTCTGCCC TACTGCAAGG GTGGCAAGAT CGGCCTGTTT 4 00 

GGCGGCGCCG GTGTGGGCAA GACCGTGATC ATCATGGAGC TGATTAACAA 450 

CGTCGCGAAG GGCCACGGTG GTTTCTCGGT GTTTGCCGGC GTTGGCGAGC 500 

60 GCACGCGCGA GGGCACGGAC CTGTACCTGG AGATGATGCA GTCGAAGGTG 550 

482 
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ATTGACCTGA AGGGCGAGTC GAAGTGCGTG 
CGAGCCCCCG GGTGCGCGCG CGCGCGTTGC 
CGGAGTACTT CCGAGACGTG GAGGGCCAGA 
AACATCTTCC GCTTCACGCA GGCGAACTCC 
5 CCGCATTCCG GCCGCCGTGG GCTACCAGCC 
GTATGCTGCA GGAGCGCATC ACGTCGACGA 
GTGCAGGCCG TGTACGTGCC TGCGGATGAT 
GACGACGTTC TCGCACCTGG ACGCGACGAC 
CGGAGTCGGG GATCTACCCT GCCGTGAACC 
10 ATCATGGACC CCGATGTGAT CGACGTGGAC 
TATCGTGCAG ATGCTGACCA AGTACAAGGA 
TGCTTGGTAT CGACGAGCTG AGCGAGGAGG 
GCGCGCAAGG TGACCCGGTT CCTGTCGCAG 
GTTCACGGGC ATGACGGGCC AC 

15 



PCT/CAOO/01150 

CTTGTGTACG GGCAGAT&AA " 600 

GCAGTCTGCG CTGACGATGG 650 

ATGTGCTGCT GTTCATCGAC 700 

GAGGTCTCTG CGCTGCTGGG 750 

GACGCTTGCG GAGGATCTTG 800 

CGAAGGGGTC GATCACGTCC 850 

ATCACGGATC CGGCGCCCGC 900 

TGTGCTGGAC CGCGCGGTGG 950 

CGCTGGAGTG CGCGTCGCGT 1000 

CACTACAACG TTGCGCAGGA 1050 

GCTGCAGGAT ATCATTGCGG 1100 

ACAAGGTCGT GGTGGACCGC 1150 

CCGTTCCAGG TTGCGGAGGT 1200 

1222 



2 ) INFORMATION FOR SEQ ID NO: 861 

20 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1246 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tropica 
30 (B) STRAIN: ATCC 50129 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 861 

CCGTGCTGAC GGCGCTGGAT GTGACGGAGG ACCTTGGCCG CGATGAGCCG 50 

35 CTGACGCTGG AGATCGTGCA GCACTTGGAC GCGAACACCG GCCGCTGCAT 100 

TGCGATGCAG ACGACGGACC TGCTGAAGCT GAAGTCGAAG GTTGTGTCGA 150 

CCGGCGGCAA CATCTCTGTG CCGGTGGGCC GTGAGACGCT GGGCCGCATC 200 

TTCAACGTTC TGGGCGACGC GATCGACCAG CGCGGCCCCG TGGGCGAGAA 250 

GATGCGCATG GCGATCCACG CCGAGGCCCC GAAGCTGGCG GATCAGGCCG 300 

40 CGGAGGACAC GATCCTGACG ACCGGCATCA AGGTGATCGA CCTGATTCTG 350 

CCCTACTGCA AGGGTGGCAA GATCGGCCTG TTCGGCGGTG CCGGTGTGGG 400 

CAAGACTGTG ATCATCATGG AGCTGATCAA CAACGTCGCG AAGGGCCACG 45 0 

■ GTGGTTTCTC CGTGTTTGCC GGCGTTGGCG AGCGCACGCG CGAGGGCACG 500 

GACCTGTACC TGGAGATGAT GCAGTCGAAG GTGATTGACC TGAAGGGCGA 550 

45 GTCGAAGTGC GTGCTTGTGT ACGGGCAGAT GAACGAGCCC CCGGGTGCGC 600 

GCGCGCGCGT TGCGCAGTCT GCGCTGACGA TGGCGGAGTA CTTCCGCGAC 650 

GTGGAGGGCC AGAACGTGCT GCTGTTCATC GACAACATCT TCCGCTTCAC 700 

GCAGGCGAAC TCCGAGGTGT CTGCGCTGCT GGGCCGCATT CCGGCCGCCG 750 

TGGGCTACCA GCCGACGCTT GCGGAGGATC TTGGTATGCT GCAGGAGCGC 800 

50 ATCACGTCGA CAACGAAGGG GTCGATCACG TCCGTGCAGG CCGTGTACGT 850 

GCCAGCGGAT GATATCACGG ATCCCGCGCC CGCGACGACG TTCTCGCACC 900 

TGGACGCGAC GACTGTGCTG GACCGCGCGG TGGCGGAGTC GGGCATCTAC 950 

CCTGCCGTGA ACCCGCTGGA GTGCGCGTCG CGTATCATGG ACCCCGATGT 1000 

GATCGATGTG GACCACTACA ACGTTGCGCA GGATATCGTG CAGATGCTGA 1050 

55 CCAAGTACAA GGAGCTGCAG GATATCATTG CGGTGCTTGG CATCGACGAG 1100 

CTGAGCGAGG AAGACAAGGT TGTTGTGGAC CGCGCGCGCA AGGTGACCCG 1150 

GTTCCTGTCG CAGCCGTTCC AGGTTGCGGA GGTGTTCACG GGCATGACGG 1200 

GCCACTACGT GCAGCTGGTC GACACGGTGG AGTCGTTCTC TGGCCT 124 6 



60 



483 
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2) INFORMATION FOR SEQ ID NO: 862 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1265 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tropica 

(B) STRAIN: ATCC 30815 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 862 

GGCGTGCCGC CCGTGCTGAC GGCGCTGGAT GTGACGGAGG ACCTTGGCCG 50 

CGATGAGCCG CTGACGCTGG AGATCGTGCA GCACTTGGAC GCGAACACGG 100 

GCCGCTGCAT TGCGATGCAG ACGACGGACC TGCTGAAGCT GAAGTCGAAG 150 

GTCGTGTCGA CCGGCGGCAA CATCTCTGTG CCGGTGGGCC GTGAGACGCT 200 

GGGCCGCATC TTCAAYGTTC TGGGCGACGC GATCGACCAG CGCGGCCCCG 250 

TGGGCGAGAA GATGCGCATG GCGATCCACG CCGAGGCCCC GAAGCTGGCG 300 

GATCAGGCCG CGGAGGACAC GATCCTGACG ACCGGCATCA AGGTGATCGA 350 

CCTGATTCTG CCCTACTGCA AGGGTGGCAA GATCGGCCTG TTCGGCGGTG 400 

CCGGTGTGGG CAAGACTGTG ATCATCATGG AGCTGATCAA CAACGTCGCG 450 

AAGGGCCACG GCGGTTTCTC CGTGTTTGCC GGCGTTGGCG AGCGCACGCG 500 

CGAGGGCACG GACCTGTACC TGGAGATGAT GCAGTCGAAG GTGATTGACC 550 

TGAAGGGCGA GTCGAAGTGY GTGCTTGTGT ATGGGCAGAT GAACGAGCCC 600 

CCGGGTGCGC GCGCGCGCGT TGCGCAGTCT GCGCTGACGA TGGCGGAGTA 650 

CTTCCGCGAC GTGGAGGGCC AGAACGTGCT GCTGTTCATC GACAACATCT 700 

TCCGCTTCAC GCAGGCGAAC TCCGAGGTGT CTGCGCTGCT GGGCCGCATT 750 

CCGGCCGCCG TGGGCTACCA GCCGACGCTT GCGGAGGATC TTGGTATGCT 800 

GCAGGAGCGC ATCACGTCGA CAACGAAGGG GTCGATCACG TCCGTGCAGG 850 

CCGTGTACGT GCCAGCGGAT GATATCACGG ATCCCGCGCC CGCGACGACG 900 

TTCTCGCACC TGGACGCGAC GACTGTGCTG GACCGCGCGG TGGCGGAGTC 950 

GGGCATCTAC CCTGCCGTGA ACCCGCTGGA GTGCGCGTCG CGTATCATGG 1000 

ACCCTGATGT GATCGATGTG GACCACTACA ACGTTGCGCA GGATATCGTG 1050 

CAGATGCTGA CCAAGTACAA GGAGCTGCAG GATATCATTG CGGTGCTTGG 1100 

CATCGACGAG CTGAGCGAGG AAGACAAGGT TGTTGTGGAC CGCGCGCGCA 115 0 

AGGTGACCCG GTTCCTGTCG CAGCCGTTCC AGGTTGCGGA GGTGTTCACG 1200 

GGCATGACGG GCCACTACGT GCAGCTGGTC GACACGGTGG AGTCGTTCTC 1250 

TGGCCTGCTG ATGGG 12 65 



2) INFORMATION FOR SEQ ID NO: 863 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1191 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bordetella pertussis 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 863 

484 
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ATGGCAAAAG GCAAGTTTGA ACGTACCAAG CCGCACGTGA ACGTGGGTAC 50' 

GATTGGTCAC GTTGACCACG GCAAAACGAC GTTGACGGCG GCGATCACGA 100 

CGGTGCTGTC GAACAAGTTC GGCGGCGAGG CTCGCGGCTA CGACCAGATT 150 

5 GACGCGGCGC CGGAAGAGAA GGCGCGTGGG ATCACGATCA ACACCTCGCA 200 

CGTTGAGTAC GAGACGGAGA CGCGTCACTA CGCGCACGTT GATTGCCCGG 250 

GTCACGCTGA CTACGTGAAG AACATGATCA CGGGTGCTGC GCAGATGGAC 300 

GGCGCGATCC TGGTGGTGTC GGCCGCAGAC GGCCCGATGC CGCAGACGCG 350 

CGAGCACATT TTGCTGTCGC GCCAGGTTGG CGTGCCGTAC ATCATCGTGT 4 00 

10 TCCTGAACAA GGCGGACATG GTTGATGACG CGGAGCTGCT CGAGCTGGTG 450 

GAGATGGAAG TCCGCGAACT GCTGAGCAAG TACGATTTCC CGGGCGATGA 500 

CACGCCGATC GTGAATGGTG CGGCCAAGCT GGCGCTGGAA AGCGACAACG 550 

GCGACCTGGG CGAGCAGGCG ATTCTGTCGC TGGCGCAAGC GCTGGACACG 600 

TACATTCCGA CGCCGGAGCG CGCGGTCGAC GGTGCGTTCC TGATGCCGGT 650 

15 GGAAGACGTG TTCTCGATCT CGGGCCGTGG, CACGGTGGTG ACTGGCCGTA 700 

TCGAGCGCGG CGTGGTGAAG GTTGGCGAGG AAATCGAAAT CGTGGGCATC 750 

AAGCCGACGG TGAAGACGAC CTGCACGGGC GTGGAGATGT TCCGCAAGCT 800 

GCTGGACCAG GGCCAGGCGG GCGACAACGT GGGTATCTTG CTGCGCGGCA 850 

CCAAGCGTGA AGACGTCGAG CGTGGCCAGG TGCTGGCCAA GCCGGGTTCG 900 

20 ATCAACCCGC ACACGGACTT CACGGCCGAG GTGTACATTC TGTCCAAGGA 950 

AGAGGGTGGC CGTCACACGC CGTTCTTCAA CGGCTATCGT CCGCAGTTCT 1000 

ACTTCCGCAC GACGGACGTG ACCGGCACGA TCGACCTGCC GGCGGACAAG 1050 

GAAATGGTGC TGCCGGGCGA CAACGTGTCG ATGACCGTCA AGCTGCTGGC 1100 

CCCGATCGCC ATGGAAGAAG GTCTGCGTTT CGCCATCCGT GAAGGCGGTC 1150 

25 GTACCGTCGG TGCCGGCGTC GTCGCCAAGA TCATCAAGTA A 1191 



30 



45 



2) INFORMATION FOR SEQ ID NO: 864 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1350 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Trypanosoma brucei 

(B) STRAIN: LVH/75/USAMRU-K/18 

(C) ACCESSION NUMBER: extracted from U10562 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 864 



ATGGGAAAGG AAAAGGTGCA CATGAATCTT GTGGTGGTGG GCCACGTCGA 50 

TGCCGGTAAA TCCACTGCAA CGGGTCACTT GATCTACAAG TGCGGTGGTA 100 

TTGACAAACG TACGATCGAG AAGTTCGAGA AAGAAGCTGC CGACATTGGT 150 

AAGGCCTCAT TCAAGTACGC ATGGGTGCTG GACAAGCTGA AGGCTGAGCG 200 

50 CGAACGTGGT ATCACGATCG ACATTGCACT GTGGAAATTC GAGTCACCCA 250 

AGTCTGTCTT CACTATTATT GATGCTCCTG GGCACCGTGA CTTCATCAAG 300 

AACATGATCA CCGGCACATC GCAAGCCGAC GCAGCCATCC TCATCATTGC 350 

CTCTGCGCAG GGTGAGTTCG AGGCTGGTAT CTCCAAGGAT GGACAGACCC 400 

GCGAGCACGC GTTGCTGGCC TTCACTTTGG GTGTGAAGCA GATGGTTGTG 450 

55 TGCTGCAACA AAATGGACGA CAAGACTGTG AACTACGGAC AGGAGCGGTA 500 

TGACGAGATT GTGAAGGAGG TGTCTGCTTA CATCAAGAAG GTTGGGTACA 550 

ACGTGGAGAA GGTGCGCTTC GTCCCCATCT CCGGATGGCA GGGCGACAAC 600 

ATGATTGAGA AATCCGAGAA GATGCCATGG TACAAGGGTC CAACGCTCCT 650 

GGAGGCACTA GACATGCTGG AGCCACCAGT GCGTCCGAGC GACAAGCCCC 700 

60 TGCGTCTGCC ACTGCAGACG TGTACAAAGA TCGGTGGTAT TGGCACCGTG 750 

485 
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CCCGTTGGTC GTGTGGAGAC CGGCGTGATG AAGCCTGGTG ATGTGGYGAC 800/ 
GTTTGCCCCC GCCAACGTGA CGACCGAGGT GAAATCGATC GAGATGCACC 850 
ACGAGCAGCT CGCTGAGGCG ACCCCCGGTG ACAACGTCGG CTTTAACGTG 900 
AAGAACGTTT CTGTAAAGGA CATCCGCCGT GGCAACGTCT GCGGTAACAC 950 

5 CAAGAACGAC CCCCCAAAGG AGGCCGCCGA CTTCACGGCA CAGGTGATCA 1000 

TCCTGAACCA CCCCGGACAG ATTGGAAACG GTTATGCGCC CGTGCTGGAC 1050 

TGCCACACAT CGCACATTGC CTGCAAGTTC GCGGAGATCG AGTCGAAGAT 1100 

CGACCGTCGC TCTGGCAAGG AGCTGGAGAA GGCTCCCAAG TCGATCAAGT 1150 

CTGGCGACGC CGCGATCGTG CGCATGGTGC CGCAGAAGCC TATGTGCGTG 12 00 

10 GAGGTCTTCA ACGACTACGC GCCACTCGGC CGCTTTGCCG TGCGTGACAT 1250 

GCGCCAGACC GTCGCTGTCG GTATCATCAA GGCCGTGACC AAGAAGGACG 1300 

GTTCTGGTGG TAAGGTGACG AAGGCTGCGG TGAAGGCTTC GAAGAAATAA 13 50 



15 



25 



30 



2) INFORMATION FOR SEQ ID NO: 865 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1052 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 865 



AAGCTCAAGG CTGAGAGAGA AAGAGGTATT ACCATCGATA TTGCTTTATG 50 

GCAATTCGAA ACCCCAAAAT ACCACTACAC TGTCATTGAT GCCCCAGGTC 100 

ACAGAGATTT CATCAAGAAT ATGATTACTG GTACCTCTCA AGCTGATGTT 150 

GCTTTATTGG TTGTCCCAGC CGATCGTTTC GAAGGTGCCT TCTCCAAGGA 2 00 

35 AGGTCAAACC AGAGAACATG CTTTATTGGC CTTCACTTTG GGTGTCAGAC 250 

AAATGATTGT CGGTATTAAC AAGATGGATA CCTGTGAATA CAAGCAATCT 300 

CGTTTTGATG AAATCTTCAA CGAAGTTGAT GGTTACCTCA AGAAGGTTGG 350 

TTACAACACC GAGAAGATCC CATTCGTTGC CATTTCTGGT TTCGTTGGTG 400 

ATAATATGGT TGAGAGATCT GACAAGATGC CATGGTATAA GGGTAAGACC 450 

40 TTAGTCGAAG CCCTCGACAC TATGGAACCA CCAAAGAGAC CAACTGACAA 500 

GCCACTCCGT CTCCCATTAC AAGATGTTTA CAAGATAGGT GGTGTAGGTA 550 

CTGTCCCAGT CGGTCGTGTT GAGACTGGTA TCATCAGACC AGGTATGAAT 600 

GTTACCTTCG CTCCAGCTGG TGTTACCACT GAAGTTAAGT CAGTAGAAAT 650 

GCACCATGAG CAGATGCCAG AGGCCGTCCC AGGTGACAAC GTTGGTTTCA 700 

45 ATGTTAAGAA CGTCTCCATC AAGGATATCA AGAGAGGTTT CGTTGCTTCT 750 

GATGCCAAGA ATGACCCAGC TAAGGGCTGT GAAGACTTCA CTGCTCAAGT 800 

TATCGTCCTC AACCACCCAG GTGAAATCAA GAACGGTTAC TCTCCAGTCG 850 

TTGACTGTCA CACCGCTCAC ATTTCCTGCA AATTCCAGAC TATCACTGCT 900 

AAGATGGACA AGAGATCTGG TAAGGTTTTG GAAGAAAACC CAAAGCTTAT 950 

50 CAAGTCTGGT GATGCTGCTT TGGTTGTTAT GCAACCTTTG AAGCCACTTT 1000 

GTGTTGAGGC CTTCACTGAC TACCCACCTC TAGGTCGTTT CGCTGTCCGT 1050 

GA 1052 



55 

2) INFORMATION FOR SEQ ID NO: 866 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 837 bases 
60 (B) TYPE: Nucleic acid 

486 
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(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 

(B) STRAIN: ATCC 35552 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 866 

CAATGAAGTT CCAGAAATTA ACAATGCCTT AGTCGTAGAC GTTGAAAGAG 50 

ATGAAGGTAC AGTATCTCTT ACATTAGAAG TGGCATTACA ACTTGGCGAT 100 

GATGTCGTAC GTACAATTGC AATGGATTCT ACTGATGGTG TTAAACGTGG 150 

TACAGAAGTT CGAGATAGCG GAGATAGCAT , CAGTGTTCCA GTTGGTGATG 200 

CTACGTTAGG ACGTGTGTTT AATGTTCTTG GTGATACAAT TGACTTAGAC 250 

GAGAAGCTTG ATACTTCTGT CAAACGTGAT CCAATTCATA GAGAAGCACC 300 

TGCATTCGAT CAATTATCAA CAAAAGTTGA AATCTTAGAA ACAGGTATTA 350 

AAGTAATTGA TTTACTTGCA CCATATATTA AAGGTGGTAA AATCGGTTTA 4 00 

TTCGGTGGCG CTGGTGTAGG TAAAACAGTA TTAATTCAAG AATTAATTAA 4 50 

TAATATAGCT CAAGAACATG GTGGTATTTC AGTATTTGCC GGCGTAGGTG 500 

AACGTACGCG TGAAGGTAAT GACTTATACT ACGAAATGAG TGATAGTGGT 550 

GTTATTAAGA AAACAGCTAT GGTCTTCGGA CAAATGAATG AGCCACCTGG 600 

TGCGCGTATG CGTGTTGCTT TATCAGGCTT AACAATGGCT GAACACTTCC 650 

GTGATGTACA AGGACAAGAT GTTTTACTAT TTATTGATAA CATATTCAGA 700 

TTTACGCAAG CTGGTTCAGA AGTATCAGCA CTATTAGGTC GTATGCCATC 750 

AGCCGTTGGT TATCAACCTA CCCTTGCTAC TGAAATGGGT CAATTACAAG 800 

AACGTATTAC ATCAACAACT AAAGGATCTG TAACGTC 837 



2) INFORMATION FOR SEQ ID NO: 867 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Zoogloea ramigera 

(B) STRAIN: ATCC 25935 



(xi) SEQUENCE DESCRIPTION: SEQ 

AAGGTATTCG ATGCCTTGAA AATGGAAGGC 
ACAACAGCAG CTGGGCGACG GCATTGTCCG 
CCGACGGCCT GCGTCGCGGC ATGATGATCC 
ATGGTGCCAG TCGGTAAAGC AACCCTGGGT 
TAACCCGATC GACGAATGCG GCGCGGTCGC 
TCCACCGCGC TCCTCCTGCG TACGACGAAC 
CTGGAAACCG GCATTAAAGT TATTGACCTG 
CGGTAAAGTC GGTCTGTTCG GCGGTGCAGG 
TGATGGAACT GATCAACAAC ATCGCCAAAG 
TTTGCCGGTG TGGGTGAGCG TACCCGTGAA 
GATGGCTGAC GCCAAAGTGG TCGATCTGGA 
TTGCGATGGT CTACGGTCAG ATGAATGAAC 
GTGGCGCTGA CCGGTCTGAC CATGGCTGAA 

41 



ID NO: 867 

TCCGAGCTGA CCCTGGAAGT 50 

TACCATTGCA CTGGGTACCT 100 

AGAACACCGG CAAACCTATC 150 

CGCATCATGG ACGTGCTGGG 200 

TCACGACCAG ATCGCTTCGA 250 

TGTCGCCATC GCAAGATCTG 300 

GTGTGCCCGT TCGCCAAGGG 350 

TGTGGGCAAG ACCGTGAACA 4 00 

CACACTCGGG TCTGTCCGTG 4 50 

GGTAACGACT TCTACCACGA 500 

AAATCCAGAG AACTCCAAGG 550 

CACCAGGCAA CCGTCTGCGC 600 

GCATTCCGTG ACGAAGGCAA 650 
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AGACGTTCTG TTCTTCGTGG 

CCGAAGTATC GGCACTGCTG 

CCTACGCTGG CCGAAGAAAT 

CAAGACCGGT TCGATCAC 



ACAACATCTA CCTgCTTCACC 
GGCCGTATGC CATCGGCTGT 
GGGTCGCCTG CAAGAGCGCA 



PCT/CA00/01150 

CTGGCCGti¥A~ ' *!0(T 
GGGTTACCAG 750 
TCACTTCGAC 800 
818 



2) INFORMATION FOR SEQ ID NO: 868 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 778 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear. 



15 



50 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophytics 
20 (B) STRAIN: ATCC 43867 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 868 

CTATCTTAGT AGTATCTGCT GCTGATGGCC CAATGCCACA AACTCGTGAA 50 

2 5 CACATTCTTT TATCACGTAA CGTTGGTGTT CCAGCATTAG TTGTATTCTT 100 

AAACAAAGTT GACATGGTTG ACGATGAAGA ATTATTAGAA TTAGTAGAAA 150 

TGGAAGTTCG TGACTTATTA AGCGAATATG ACTTCCCAGG TGACGATGTA 200 

CCTGTAATCT CTGGTTCTGC ATTAAAAGCT TTAGAAGGCG ACGCTGACTA 250 

TGAGCAAAAA ATCTTAGACT TAATGCAAGC TGTTGATGAC TTCATTCCAA 300 

3 0 CACCAGAACG TGATTCTGAC AAACCATTCA TGATGCCAGT TGAGGACGTA 350 

TTCTCAATCA CTGGTCGTGG TACTGTTGCT ACAGGCCGTG TTGAACGTGG 4 00 

TCAAATCAAA GTCGGTGAAG AAATCGAAAT CATCGGTATG CAAGAAGAAT 450 

CAAGCAAAAC AACTGTTACT GGTGTAGAAA TGTTCCGTAA ATTATTAGAC 500 

TACGCTGAAG CTGGTGACAA CATTGGTGCA TTATTACGTG GTGTTTCACG 55 0 

3 5 TGATGACGTA CAACGTGGTC AAGTTTTAGC TGCTCCTGGT ACTATTACAC 600 

CACATACAAA ATTCAAAGCG GATGTTTACG TTTTATCTAA AGATGAAGGT 650 

GGTCGTCATA CACCATTCTT CACTAACTAC CGCCCACAAT TCTATTTCCG 700 

TACTACTGAC GTAACTGGTG TTGTTAACTT ACCAGAAGGT ACTGAAATGG 750 

TTATGCCTGG CGATAACGTT GAAATGGA 778 

40 



2) INFORMATION FOR SEQ ID NO: 869 

4 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseli flaws 
55 (B) STRAIN: R689 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 869 

TGGTCCTATG CCTCAAACAC GTGAACACAT CTTGTTATCA CGTAACGTTG 50 
60 GTGTACCATA CATCGTTGTT TTCTTAAACA AAATGGATAT GGTTGATGAC 100 
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GAAGAATTAC TAGAATTAGT TGAAATGGAA 
ATATGACTTC CCAGGCGACG ATGTTCCTGT 
AAGCTCTTGA AGGCGATGCT TCATACGAAG 
GCTGCAGTTG ACGAATACGT TCCAACTCCA 
ATTCATGATG CCAGTCGAAG ACGTATTCTC 
TTGCTACAGG CCGTGTTGAA CGTGGACAAG 
GAAATCGTTG GTATTGCTGA AGAAACTGCT 
TGAAATGTTC CGTAAATTGT TAGACTATGC 
GTGCATTGCT ACGTGGTGTT GCTCGTGAAG 
TTGGCTAAAG CTGGTACAAT CACACCTCAT 
TTACGTTTTA ACAAAAGAAG AAGGTGGACG 



GTTCGTGACT TATTGT^AGA 150 

AATCGCTGGT TCTGCTTTGA 200 

AAAAAATCAT GGAATTAATG 250 

GAACGTGACA CTGACAAACC 300 

AATCACTGGA CGTGGTACTG 350 

TTCGCGTTGG TGACGAAGTT 4 00 

AAAACAACTG TAACTGGTGT 4 50 

TGAAGCAGGG GATAACATTG 500 

ACATCCAACG TGGACAAGTA 550 

ACAAAATTTA AAGCTGAAGT 600 

TCACACTCCA 640 



2) INFORMATION FOR SEQ ID NO: 870, 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 644 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 

(B) STRAIN: R754 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 870 

GTCCTATGCC TCAAACACGT GAACACATCT TGTTATCACG TAACGTTGGT 50 

GTACCATACA TCGTTGTTTT CTTAAACAAA ATGGATATGG TTGATGACGA 100 

AGAATTACTA GAATTAGTTG AAATGGAAGT TCGTGACTTA TTGTCAGAAT 150 

ATGACTTCCC AGGCGACGAT GTTCCTGTAA TCGCTGGTTC TGCTTTGAAA 200 

GCTCTTGAAG GCGATGCTTC ATACGAAGAA AAAATCATGG AATTAATGGC 250 

TGCAGTTGAC GAATACGTTC CAACTCCAGA ACGTGACACT GACAAACCAT 300 

TCATGATGCC AGTCGAAGAC GTATTCTCAA TCACTGGACG TGGTACTGTT 350 

GCTACAGGCC GTGTTGAACG TGGACAAGTT CGCGTTGGTG ACGAAGTTGA 4 00 

AATCGTTGGT ATTGCTGAAG AAACTGCTAA AACAACTGTA ACTGGTGTTG 4 50 

AAATGTTCCG TAAATTGTTA GACTATGCTG AAGCAGGGGA TAACATTGGT 500 

GCATTGCTAC GTGGTGTTGC TCGTGAAGAC ATCCAACGTG GACAAGTATT 550 

GGCTAAAGCT GGTACAATCA CACCTCATAC AAAATTTAAA GCTGAAGTTT 600 

ACGTTTTAAC AAAAGAAGAA GGTGGACGTC ACACACCATT CTTC 644 



2) INFORMATION FOR SEQ ID NO: 871 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 637 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus flavescens 

(B) STRAIN: R758 
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TCCTATGCCT CAAACACGTG AACACATCTT 
TACCATACAT CGTTGTTTTC TTAAACAAAA 
5 GAATTACTAG AATTAGTTGA AATGGAAGTT 
TGACTTCCCA GGCGACGATG TTCCTGTAAT 
CTCTTGAAGG CGATGCTTCA TACGAAGAAA 
GCAGTTGACG AATACGTTCC AACTCCAGAA 
CATGATGCCA GTCGAAGACG TATTCTCAAT 

10 CTACAGGCCG TGTTGAACGT GGACAAGTTC 
ATCGTTGGTA TTGCTGAAGA AACTGCTAAA 
AATGTTCCGT AAATTGTTAG ACTATGCTGA 
CATTGCTACG TGGGGTTGCT CGTGAAGACA 
GCTAAAGCTG GTACAATCAC ACCTCATACA 

15 CGTTTTAACA AAAGAAGAAG GTGGACGTCA 



GTTATCACGT AACGTTGGTG 50 

TGGATATGGT TGATGACGAA 100 

CGTGACTTAT TGTCAGAATA 150 

CGCTGGTTCT GCTTTGAAAG 200 

AAATCATGGA ATTAATGGCT 250 

CGTGACACTG ACAAACCATT 300 

CACTGGACGT GGTACTGTTG 350 

GCGTTGGTGA CGAAGTTGAA 400 

ACAACTGTAA CTGGTGTTGA 450 

AGCAGGGGAT AACATTGGTG 500 

TCCAACGTGG AC AAGTATTA 550 

AAATTTAAAG CTGAAGTTTA 600 

CACTCCA 637 



2) INFORMATION FOR SEQ ID NO: 872 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 643 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: R631 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 872 

3 5 GTCCTATGCC TCAAACTCGT GAACACATCT TGTTATCACG TAACGTTGGC 50 

GTACCATACA TCGTTGTTTT CTTGAACAAA ATGGATATGG TTGATGACGA 100 

AGAATTGCTA GAATTAGTTG AAATGGAAGT TCGTGACCTA TTGTCTGAAT 150 

ATGACTTCCC AGGCGACGAT GTTCCTGTAA TCGCCGGTTC TGCTTTGAAA 200 

GCTCTTGAAG GAGATCCTTC ATACGAAGAA AAAATCATGG AATTGATGGC 250 

4 0 TGCAGTTGAC GAATACGTTC CAACTCCAGA ACGTGATACT GACAAACCAT 300 

TCATGATGCC AGTCGAAGAC GTATTCTCAA TCACTGGACG TGGTACTGTT 350 

GCTACAGGCC GTGTTGAACG TGGACAAGTT CGCGTTGGTG ATGAAGTAGA 4 00 

AATCGTTGGT ATTGCTGACG AAACTGCTAA AACAACTGTA ACAGGTGTTG 4 50 

AAATGTTCCG TAAATTGTTA GACTATGCTG AAGCAGGGGA TAACATTGGT 500 

4 5 GCATTGCTAC GTGGGGTTGC TCGTGAAGAC ATCCAACGTG GACAAGTATT 550 

GGCTAAAGCT GGTACAATCA CACCTCATAC AAAATTCAAA GCTGAAGTTT 600 

ATGTTTTGAC AAAAGAAGAA GGTGGACGTC ACACTCCATT CTT 64 3 



50 



60 



2) INFORMATION FOR SEQ ID NO: 873 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 641 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: R691 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 873 

GTCCTATGCC TCAAACTCGT GAACACATCT TGTTATCACG TAACGTTGGC 50 

GTACCATACA TCGTTGTTTT CTTGAACAAA ATGGATATGG TTGATGACGA 100 

AGAATTGCTA GAATTAGTTG AAATGGAAGT TCGTGACCTA TTGTCTGAAT 150 

10 ATGACTTCCC AGGCGACGAT GTTCCTGTAA TCGCCGGTTC TGCTTTGAAA 200 

GCTCTTGAAG GAGATCCTTC ATACGAAGAA AAAATCATGG AATTGATGGC 250 

TGCAGTTGAC GAATACGTTC CAACTCCAGA ACGTGATACT GACAAACCAT 300 

TCATGATGCC AGTCGAAGAC GTATT.CTCAA TCACTGGACG TGGTACTGTT 350 

GCTACAGGCC GTGTTGAACG TGGACAAGTT CGCGTTGGTG ATGAAGTAGA 400 

15 AATCGTTGGT ATTGCTGACG AAACTGCTAA, AACAACTGTA ACAGGTGTTG 450 

AAATGTTCCG TAAATTGTTA GACTATGCTG AAGCAGGGGA TAACATTGGT 500 

GCATTGCTAC GTGGGGTTGC TCGTGAAGAC ATCCAACGTG GACAAGTATT 550 

GGCTAAAGCT GGTACAATCA CACCTCATAC AAAATTCAAA GCTGAAGTTT 600 

ATGTTTTGAC AAAAGAAGAA GGTGGACGTC ACACTCCATT C 54 1 

20 



2) INFORMATION FOR SEQ ID NO: 874 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 681 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 
35 (B) STRAIN: LSPQ 2514 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 874 

ACCAGCATTA GTAGTATTCT TAAATAAAGT TGACATGGTT GACGATGAAG 50 

4 0 AATTATTAGA ATTAGTTGAA ATGGAAGTAC GTGACTTATT ATCTGAATAC 100 

GACTTCCCAG GTGACGATGT ACCTGTAATC GCTGGTTCAG CATTAAAAGC 150 

TTTAGAAGGC GATGCTCAAT ACGAAGAAAA AATCTTAGAA TTAATGCAAG 200 

CAGTTGATGA CTACATTCCA ACTCCAGAAC GTGATTCTGA CAAACCATTC 250 

ATGATGCCAG TTGAGGACGT ATTCTCAATC ACTGGTCGTG GTACTGTTGC 300 

45 TACAGGCCGT GTTGAACGTG GGCAAATCAA AGTTGGTGAA GAAGTTGAAA 350 

TCATTGGTAT CCATGACACT TCTAAAACAA CTGTTACTGG TGTAGAAATG 400 

TTCCGTAAAT TATTAGACTA CGCTGAAGCT GGTGACAACA TCGGTGCATT 450 

ATTACGTGGT GTTGCTCGTG AAGACGTACA ACGTGGTCAA GTATTAGCTG 500 

CTCCAGGTTC AATCACACCT CACACAAAAT TTAAAGCAGA CGTATACGTT 550 

50 TTATCTAAAG ACGAAGGTGG ACGTCACACT CCATTCTTCA CAAACTATCG 600 

TCCACAATTC TATTTCCGTA CTACTGACGT AACTGGTGTT GTTAACTTAC 650 

CAGAAGGTAC TGAAATGGTT ATGCCTGGCG A 681 



55 

2) INFORMATION FOR SEQ ID NO: 875 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 675 bases 
60 (B) TYPE: Nucleic acid 
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(C) 
(D) 
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STRANDEDNESS: Double 
TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL -SOURCE: 

(A) ORGANISM: Staphylococcus epidermidis 

(B) STRAIN: R591 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 875 



ATTATCACGT 
TTGACATGGT 
CGTGACTTAT 

15 CGCTGGTTCT 
AAATCTTAGA 
CGTGATTCTG 
CACTGGTCGT 
AAGTTGGTGA 

20 ACTGTTACTG 
TGGTGACAAC 
AACGTGGTCA 
TTCAAAGCTG 
TCCATTCTTC 

25 TAACTGGTGT 



AACGTTGGTG 
AGACGACGAA 
TAAGCGAATA 
GCATTAAAAG 
CTTAATGCAA 
ACAAACCATT 
GGTACTGTTG 
AGAAGTTGAA 
GTGTAGAAAT 
ATCGGTGCTT 
AGTATTAGCT 
AAGTATACGT 
ACTAACTATC 
TGTAAACTTA 



TACCAGCATT 
GAATTATTAG 
TGACTTCCCA 
CATTAGAAGG 
GCAGTTGATG 
CATGATGCCA 
CTACAGGCCG 
ATCATCGGTA 
GTTCCGTAAA 
TATTACGTGG 
GCTCCTGGTT 
ATTATCTAAA 
GCCCACAATT 
CCAGA 



AGTTGTATTC 
AATTAGTTGA 
GGTGACGATG 
CGATGCTGAA 
ATTACATTCC 
GTTGAGGACG 
TGTTGAACGT 
TGCACGAAAC 
TTATTAGACT 
TGTTGCACGT 
CTATTACACC 
GATGAAGGTG 
CTATTTCCGT 



TTAAACAAAG 50 

AATGGAAGTT 100 

TACCTGTAAT 150 

TACGAACAAA 200 

AACTCCAGAA 250 

TATTCTCAAT 300 

GGTCAAATCA 350 

TTCTAAAACA 400 

ACGCTGAAGC 450 

GAAGACGTAC 500 

ACACACAAAA 550 

GACGTCACAC 600 

ACTACTGACG 650 
675 
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2) INFORMATION FOR SEQ ID NO: 876 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 704 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



40 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus epidermidis 

(B) STRAIN: CSG 10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 876 



TCTTATTATC 
AAAGTTGACA 
AGTTCGTGAC 
TAATCGCTGG 
CAAAAAATCT 
50 AGAACGTGAT 
CAATCACTGG 
ATCAAAGTTG 
AACAACTGTT 
AAGCTGGTGA 
55 GTACAACGTG 
AAAATTCAAA 
ACACTCCATT 
GACGTAACTG 
TGGC 

60 



ACGTAACGTT 
TGGTAGACGA 
TTATTAAGCG 
TTCTGCATTA 
TAGACTTAAT 
TCTGACAAAC 
TCGTGGTACT 
GTGAAGAAGT 
ACTGGTGTAG 
CAACATCGGT 
GTCAAGTATT 
GCTGAAGTAT 
CTTCACTAAC 
GTGTTGTAAA 



GGTGTACCAG 
CGAAGAATTA 
AATATGACTT 
AAAGCATTAG 
GCAAGCAGTT 
CATTCATGAT 
GTTGCTACAG 
TGAAATCATC 
AAATGTTCCG 
GCTTTATTAC 
AGCTGCTCCT 
ACGTATTATC 
TATCGCCCAC 
CTTACCAGAA 



CATTAGTTGT 
TTAGAATTAG 
CCCAGGTGAC 
AAGGCGATGC 
GATGATTACA 
GCCAGTTGAG 
GCCGTGTTGA 
GGTATGCACG 
TAAATTATTA 
GTGGTGTTGC 
GGTTCTATTA 
TAAAGATGAA 
AATTCTATTT 
GGTACAGAAA 



ATTCTTAAAC 50 

TTGAAATGGA 100 

GATGTACCTG 150 

TGAATACGAA 200 

TTCCAACTCC 250 

GACGTATTCT 300 

ACGTGGTCAA 350 

AAACTTCTAA 400 

GACTACGCTG 4 50 

ACGTGAAGAC 500 

CACCACACAC 550 

GGTGGACGTC 600 

CCGTACTACT 650 

TGGTTATGCC 700 
704 
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2) INFORMATION FOR SEQ ID NO: 877 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 770 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus epidermidis 

(B) STRAIN: ATCC 35984 , 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 877 

TCTTAGTTGT ATCTGCTGCT GACGGTCCAA TGCCACAAAC TCGTGAACAC 50 

ATCTTATTAT CACGTAACGT TGGTGTACCA GCATTAGTTG TATTCTTAAA 100 

CAAAGTTGAC ATGGTAGACG ACGAAGAATT ATTAGAATTA GTTGAAATGG 150 

AAGTTCGTGA CTTATTAAGC GAATATGACT TCCCAGGTGA CGATGTACCT 200 

GTAATCGCTG GTTCTGCATT AAAAGCATTA GAAGGCGATG CTGAATACGA 2 50 

ACAAAAAATC TTAGACTTAA TGCAAGCAGT TGATGATTAC ATTCCAACTC 3 00 

CAGAACGTGA TTCTGACAAA CCATTCATGA TGCCAGTTGA GGACGTATTC 350 

TCAATCACTG GTCGTGGTAC TGTTGCTACA GGCCGTGTTG AACGTGGTCA 400 

AATCAAAGTT GGTGAAGAAG TTGAAATCAT CGGTATGCAC GAAACTTCTA 450 

AAACAACTGT TACTGGTGTA GAAATGTTCC GTAAATTATT AGACTACGCT 500 

GAAGCTGGTG ACAACATCGG TGCTTTATTA CGTGGTGTTG CACGTGAAGA 550 

CGTACAACGT GGTCAAGTAT TAGCTGCTCC TGGTTCTATT ACACCACACA 600 

CAAAATTCAA AGCTGAAGTA TACGTATTAT CTAAAGATGA AGGTGGACGT 650 

CACACTCCAT TCTTCACTAA CTATCGCCCA CAATTCTATT TCCGTACTAC 700 

TGACGTAACT GGTGTTGTAA ACTTACCAGA AGGTACAGAA ATGGTTATGC 750 

CTGGCGACAA CGTTGAAATG 770 



2) INFORMATION FOR SEQ ID NO: 878 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 716 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus epidermidis 

(B) STRAIN: ATCC 35983 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 878 

TTGTATTCTT AAACAAAGTT GACATGGTAG ACGACGAAGA ATTATTAGAA ' 50 

TTAGTTGAAA TGGAAGTTCG TGACTTATTA AGCGAATATG ACTTCCCAGG 100 

TGACGATGTA CCTGTAATCG CTGGTTCTGC ATTAAAAGCA TTAGAAGGCG 150 

ATGCTGAATA CGAACAAAAA ATCTTAGACT TAATGCAAGC AGTTGATGAT 200 

TACATTCCAA CTCCAGAACG TGATTCTGAC AAACCATTCA TGATGCCAGT 250 

TGAGGACGTA TTCTCAATCA CTGGTCGTGG TACTGTTGCT ACAGGCCGTG 300 

TTGAACGTGG TCAAATCAAA GTTGGTGAAG AAGTTGAAAT CATCGGTATG 350 
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CACGAAACTT CTAAAACAAC TGTTACTGGT 
ATTAGACTAC GCTGAAGCTG GTGACAACAT 
TTGCACGTGA AGACGTACAA CGTGGTCAAG 
ATTACACCAC ACACAAAATT CAAAGCTGAA 
TGAAGGTGGA CGTCACACTC CATTCTTCAC 
ATTTCCGTAC TACTGACGTA ACTGGTGTTG 
GAAATGGTTA TGCCTGGCGA CAACGTTGAA 
TCCAATCGCT ATCGAA 
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GTAGAAATGT TCCGTAAATT 4 00 

CGGTGCTTTA TTACGTGGTG 450 

TATTAGCTGC TCCTGGTTCT 500 

GTATACGTAT TATCTAAAGA 550 

TAACTATCGC CCACAATTCT 600 

TAAACTTACC AGAAGGTACA 650 

ATGACAGTTG AATTAATCGC 700 

716 



2) INFORMATION FOR SEQ ID NO: 879 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY:. Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: R764 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 879 

CGGTCCTATG CCTCAAACTC GTGAACACAT CTTGTTATCA CGTAACGTTG 50 

GCGTACCATA CATCGTTGTT TTCTTGAACA AAATGGATAT GGTTGATGAC 10 0 

GAAGAATTGC TAGAATTAGT TGAAATGGAA GTTCGTGACC TATTGTCTGA 150 

ATATGACTTC CCAGGCGACG ATGTTCCTGT AATCGCCGGT TCTGCTTTGA 200 

AAGCTCTTGA AGGAGATCCT TCATACGAAG AAAAAATCAT GGAATTGATG 250 

GCTGCAGTTG ACGAATACGT TCCAACTCCA GAACGTGATA CTGACAAACC 3 00 

ATTCATGATG CCAGTCGAAG ACGTATTCTC AATCACTGGA CGTGGTACTG 350 

TTGCTACAGG CCGTGTTGAA CGTGGACAAG TTCGCGTTGG TGATGAAGTA 4 00 

GAAATCGTTG GTATTGCTGA CGAAACTGCT AAAACAACTG TAACAGGTGT 4 50 

TGAAATGTTC CGTAAATTGT TAGACTATGC TGAAGCAGGG GATAACATTG 500 

GTGCATTGCT ACGTGGGGTT GCTCGTGAAG ACATCCAACG TGGACAAGTA 550 

TTGGCTAAAG CTGGTACAAT CACACCTCAT ACAAAATTCA AAGCTGAAGT 600 

TTATGTTTTG ACAAAAGAAG AAGGTGGACG TCACACTCCA 64 0 



2) INFORMATION FOR SEQ ID NO: 880 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas aeruginosa 

(B) STRAIN: PAO-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 880 

CGGCGCGATC CTGGTTTGCT CGGCTGCCGA CGGCCCCATG CCGCAGACCC 50 
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GCGAGCACAT 
TTCCTGAACA 
CGAGATGGAA 
ACACTCCGAT 
GACAACGGCA 
CTCCTACATT 
CGATCGAAGA 
CGTGTAGAGC 
CATCAAGGCG 
AGCTGCTCGA 
GGCACCAAGC 
CACCATCAAG 
AGGAAGAAGG 
TTCTACTTCC 
CGTAGAGATG 
TCGCTCCGAT 



CCTGCTGTCC 
AGGCCGACAT 
GTTCGCGATC 
CATCATCGGT 
TCGGCGTAAG 
CCGGAGCCGG 
CGTGTTCTCG 
GCGGCATCAT 
ACCACCAAGA 
CGAAGGTCGT 
GTGAAGACGT 
CCGCACACCA 
TGGTCGTCAC 
GTACCACCGA 
GTAATGCCGG 
CGCCATGGAA 



CGCCAGGTAG GCGTTCCCTA CATCGTCGTG 
GGTCGACGAC GCCGAGCTGC TGGAACTGGT 
TGCTGAACAC CTACGACTTC CCGGGCGACG 
TCCGCGCTGA TGGCGCTGGA AGGCAAGGAT 
CGCCGTGCAG AAGCTGGTAG AGACCCTGGA 
TTCGTGCCAT CGACCAGCCG TTCCTGATGC 
ATCTCCGGCC GCGGTACCGT GGTAACCGGT 
CAAGGTCCAG GAAGAAGTGG AAATCGTCGG 
CTACCTGCAC CGGCGTTGAA ATGTTCCGCA 
GCTGGTGAGA ACGTTGGTAT CCTGCTGCGT 
AGAGCGTGGC CAGGTTCTGG CCAAGCCGGG 
AGTTCGAGTG CGAAGTGTAC GTGCTGTCCA 
ACCCCGTTCT TCAAGGGCTA CCGTCCGCAG 
YGTGACCGGT AACTGCGAAC TGCCGGAAGG 
GCGACAACAT, CAAGATGGTT GTCACCCTGA 
GATGGCCTGC G 



100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
831 



2) INFORMATION FOR SEQ ID NO: 881 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 642 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 

(B) STRAIN: R421 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 881 

CCTATGCCTC AAACACGTGA ACACATCTTG TTATCACGTA ACGTTGGTGT 50 

ACCATACATC GTTGTTTTCT TAAACAAAAT GGATATGGTT GATGACGAAG 100 

AATTACTAGA ATTAGTTGAA ATGGAAGTTC GTGACTTATT GTCAGAATAT 150 

GACTTCCCAG GCGACGATGT TCCTGTAATC GCTGGTTCTG CTTTGAAAGC 200 

TCTTGAAGGC GATGCTTCAT ACGAAGAAAA AATCATGGAA TTAATGGCTG 250 

CAGTTGACGA ATACGTTCCA ACTCCAGAAC GTGACACTGA CAAACCATTC 300 

ATGATGCCAG TCGAAGACGT ATTCTCAATC ACTGGACGTG GTACTGTTGC 350 

TACAGGCCGT GTTGAACGTG GACAAGTTCG CGTTGGTGAC GAAGTTGAAA 400 

TCGTTGGTAT TGCTGAAGAA ACTGCTAAAA CAACTGTAAC TGGTGTTGAA 450 

ATGTTCCGTA AATTGTTAGA CTATGCTGAA GCAGGGGATA ACATTGGTGC 500 

ATTGCTACGT GGTGTTGCTC GTGAAGACAT CCAACGTGGA CAAGTATTGG 550 

CTAAAGCTGG TACAATCACA CCTCATACAA AATTTAAAGC TGAAGTTTAC 600 

GTTTTAACAA AAGAAGAAGG TGGACGTCAC ACACCATTCT TC 642 



2) INFORMATION FOR SEQ ID NO: 882 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 636 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 

(B) STRAIN: R775 



(xi) SEQUENCE DESCRIPTION: SEQ 



ID NO: 882 



CCTATGCCTC 
ACCATACATC 
AATTACTAGA 
GACTTCCCAG 
TCTTGAAGGC 
CAGTTGACGA 
ATGATGCCAG 
TACAGGCCGT 
TCGTTGGTAT 
ATGTTCCGTA 
ATTGCTACGT 
CTAAAGCTGG 
GTTTTAACAA 



AAACACGTGA 
GTTGTTTTCT 
ATTAGTTGAA 
GCGACGATGT 
GATGCTTCAT 
ATACGTTCCA 
TCGAAGACGT 
GTTGAACGTG 
TGCTGAAGAA 
AATTGTTAGA 
GGTGTTGCTC 
TACAATCACA 
AAGAAGAAGG 



ACACATCTTG 
TAAACAAAAT 
ATGGAAGTTC 
TCCTGTAATC 
ACGAAGAAAA 
ACTCCAGAAC 
ATTCTCAATC 
GACAAGTTCG 
ACTGCTAAAA 
CTATGCTGAA 
GTGAAGACAT 
CCTCATACAA 
TGGACGTCAT 



TTATCACGTA ACGTTGGTGT 
GGATATGGTT GATGACGAAG 
GTGACTTATT GTCAGAATAT 
GCTGGTTCTG CTTTGAAAGC 
AATCATGGAA TTAATGGCTG 
GTGACACTGA CAAACCATTC 
ACTGGACGTG GTACTGTTGC 
CGTTGGTGAC GAAGTTGAAA 
CAACTGTAAC TGGTGTTGAA 
GCAGGGGATA ACATTGGTGC 
CCAACGTGGA CAAGTATTGG 
AATTTAAAGC TGAAGTTTAC 
ACACCA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
636 



2) INFORMATION FOR SEQ ID NO: 883 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 641 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 

(B) STRAIN: R422 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 883 

GGTCCTATGC CTCAAACACG TGAACATATC TTATTATCAC GTAACGTTGG 50 

TGTACCATAC ATCGTTGTAT TCTTAAACAA AATGGATATG GTTGATGACG 100 

AAGAATTATT AGAATTAGTA GAAATGGAAG TTCGTGACTT ATTATCAGAA 150 

TACGATTTCC CAGGCGATGA TGTTCCAGTT ATCGCAGGTT CTGCTTTGAA 2 00 

AGCTTTAGAA GGCGACGAGT CTTATGAAGA AAAAATCTTA GAATTAATGG 250 

CTGCAGTTGA CGAATATATC CCAACTCCAG AACGTGATAC TGACAAACCA 3 00 

TTCATGATGC CAGTCGAAGA CGTATTCTCA ATCACTGGAC GTGGTACTGT 350 

TGCTACAGGC CGTGTTGAAC GTGGTGAAGT TCGCGTTGGT GACGAAGTTG 4 00 

AAATCGTTGG TATTAAAGAC GAAACATCTA AAACAACTGT TACAGGTGTT 4 50 

GAAATGTTCC GTAAATTATT AGACTACGCT GAAGCAGGCG ACAACATCGG 500 

TGCTTTATTA CGTGGTGTAG CACGTGAAGA TATCGAACGT GGACAAGTAT 550 

TAGCTAAACC AGCTACAATC ACTCCACACA CAAAATTCAA AGCTGAAGTA 600 

TACGTATTAT CAAAAGAAGA AGGCGGACGT CACACTCCAT T 641 



2) INFORMATION FOR SEQ ID NO: 884 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 0 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecal is 

(B) STRAIN: R575 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 884 

TATGCCTCAA ACACGTGAAC ATATCTTATT ATCACGTAAC GTTGGTGTAC 50 

CATACATCGT TGTATTCTTA AACAAAATGG ATATGGTTGA TGACGAAGAA 100 

TTATTAGAAT TAGTAGAAAT GGAAGTTCGT GACTTATTAT CAGAATACGA 150 

15 TTTCCCAGGC GATGATGTTC CAGTTATCGC AGGTTCTGCT TTGAAAGCTT 2 00 

TAGAAGGCGA CGAGTCTTAT GAAGAAAAAA TCTTAGAATT AATGGCTGCA 250 

GTTGACGAAT ATATCCCAAC TCCAGAACGT GATACTGACA AACCATTCAT 300 

GATGCCAGTC GAAGACGTAT TCTCAATCAC TGGACGTGGT ACTGTTGCTA 350 

CAGGCCGTGT TGAACGTGGT GAAGTTCGCG TTGGTGACGA AGTTGAAATC 4 00 

20 GTTGGTATTA AAGACGAAAC ATCTAAAACA ACTGTTACAG GTGTTGAAAT 450 

GTTCCGTAAA TTATTAGACT ACGCTGAAGC AGGCGACAAC ATCGGTGCTT 500 

TATTACGTGG TGTTGCACGT GAAGATATCG AACGTGGACA AGTATTAGCT 550 

AAACCAGCTA CAATCACTCC ACACACAAAA TTCAAAGCTG AAGTATACGT 600 

ATTATCAAAA GAAGAAGGCG GACGTCACAC TCCATTCTTC 640. 

25 



2) INFORMATION FOR SEQ ID NO: 885 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 632 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 
40 (B) STRAIN: R492 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 885 

TGCCTCAAAC TCGTGAACAC ATCCTATTGT CTCGTCAAGT TGGTGTTCCT 50 

45 TACATCGTTG TATTCTTGAA CAAAGTAGAC ATGGTTGATG ACGAAGAATT 100 

ACTAGAATTA GTTGAAATGG AAGTTCGTGA CCTATTAACA GAATACGAAT 150 

TCCCTGGTGA CGATGTTCCT GTAGTTGCTG GATCAGCTTT GAAAGCTCTA 200 

GAAGGCGACG CTTCATACGA AGAAAAAATT CTTGAATTAA TGGCTGCAGT 2 50 

TGACGAATAC ATCCCAACTC CAGAACGTGA CAACGACAAA CCATTCATGA 300 

50 TGCCAGTTGA AGACGTGTTC TCAATTACTG GACGTGGTAC TGTTGCTACA 350 

GGTCGTGTTG AACGTGGACA AGTTCGCGTT GGTGACGAAG TTGAAGTTGT 400 

TGGTATTGCT GAAGAAACTT CAAAAACAAC AGTTACTGGT GTTGAAATGT 4 50 

TCCGTAAATT GTTAGACTAC GCTGAAGCTG GAGACAACAT TGGTGCTTTA 500 

CTACGTGGTG TTGCACGTGA AGACATCCAA CGTGGACAAG TTTTAGCTAA 550 

55 ACCAGGTACA ATCACACCTC ATACAAAATT CTCTGCAGAA GTATACGTGT 60 0 

TGACAAAAGA AGAAGGTGGA CGTCATACTC CA 632 



60 2) INFORMATION FOR SEQ ID NO: 886 

497 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecixim 

(B) STRAIN: R576 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 886 

CGGCCCAATG CCTCAAACTC GTGAACACAT CCTATTGTCT CGTCAAGTTG 50 

GTGTTCCTTA CATCGTTGTA TTCTTGAACA AAGTAGACAT GGTTGATGAC 100 

GAAGAATTAC TAGAATTAGT TGAAATGGAA GTTCGTGACC TATTAACAGA 150 

ATACGAATTC CCTGGTGACG ATGTTCCTGT AGTTGCTGGA TCAGCTTTGA 200 

AAGCTCTAGA AGGCGACGCT TCATACGAAG AAAAAATTCT TGAATTAATG 250 

GCTGCAGTTG ACGAATACAT CCCAACTCCA GAACGTGACA ACGACAAACC 300 

ATTCATGATG CCAGTTGAAG ACGTGTTCTC AATTACTGGA CGTGGTACTG 350 

TTGCTACAGG TCGTGTTGAA CGTGGACAAG TTCGCGTTGG TGACGAAGTT 400 

GAAGTTGTTG GTATTGCTGA AGAAACTTCA AAAACAACAG TTACTGGTGT 450 

TGAAATGTTC CGTAAATTGT TAGACTACGC TGAAGCTGGA GACAACATTG 500 

GTGCTTTACT ACGTGGTGTT GCACGTGAAG ACATCCAACG TGGACAAGTT 550 

TTAGCTAAAC CAGGTACAAT CACACCTCAT ACAAAATTCT CTGCAGAAGT 600 

ATACGTGTTG ACAAAAGAAG AAGGTGGACG TCATACTCCA 640 



2) INFORMATION FOR SEQ ID NO: 887 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Zoogloea ramigera 

(B) STRAIN: ATCC 25935 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 887 

ATCCTGGTTT GCTCCGCAGC TGACGGCCCA ATGCCACAGA CCCGCGAGCA 50 

CATCCTGCTG GCCCGCCAAG TTGGCGTTCC ATACATCATC GTGTTCCTGA 100 

ACAAGTGCGA CCTGGTTGAC GACGCAGAAC TGCTGGAACT GGTCGAAATG 150 

GAAGTGCGTG AATTGCTGTC GAAATACGAG TTCCCAGGCG ACGACGTACC 200 

AATCATCAAG GGTTCGGCAC GTATGGCGCT GGAAGGCAAA GAAGGCGAGA 250 

TGGGCGTTGA CGCCATCATG CGTCTGGCCG ATGCACTGGA CAGCTACATC 300 

CCTACGCCAG AGCGCGCAGT CGATGGCGCC TTCCTGATGC CAGTGGAAGA 350 

CGTGTTCTCG ATCTCGGGTC GCGGTACCGT TGTGACCGGT CGTATCGAGC 4 00 

GCGGCGTGAT CAAGGTCGGC GAAGAGATCG AAATCGTCGG CATTATCGAC 450 

ACCGTCAAAA CCACTTGCAC CGGCGTGGAA ATGTTCCGCA AGCTGCTGGA 500 

CCAGGGTCAA GCCGGCGACA ACGTTGGTCT GCTGCTGCGC GGCACCAAGC 550 

GTGAAGACGT ACAGCGTGGT CAGGTTCTGG CCAAGCCAGC GTCGATCAAG 600 

CCGCACAACC ACTTCACCGG CGAGATCTAC GTTCTGTCGA AAGATGAAGG 650 
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CGGCCGTCAC ACCCCGTTCT TCAACAACTA TCGTCCACAG TTCTACTTCC " "7 OCT 

GTACGACTGA CGTGACCGGT TCGATCGAAC TGCCAGCAGA CAAAGAAATG 750 

GTCATGCCAG GCGACAACGT GTCGATCACC GTCAAGCTGA TCAACCCGAT 800 

CGCGAT 806 



10 



15 



2) INFORMATION FOR SEQ ID NO: 888 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 634 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecal is 
20 (B) STRAIN: R503 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 888 



25 



30 



35 



TATGCCTCAA 
CATACATCGT 
TTATTAGAAT 
TTTCCCAGGC 
TAGAAGGCGA 
GTTGACGAAT 
GATGCCAGTC 
CAGGCCGTGT 
GTTGGTATTA 
GTTCCGTAAA 
TATTACGTGG 
AAACCAGCTA 
ATTATCAAAA 



ACACGTGAAC 
■TGTATTCTTA 
TAGTAGAAAT 
GATGATGTTC 
CGAGTCTTAT 
ATATCCCAAC 
GAAGACGTAT 
TGAACGTGGT 
AAGACGAAAC 
TTATTAGACT 
TGTAGCACGT 
CAATCACTCC 
GAAGAAGGCG 



ATATCTTATT 
AACAAAATGG 
GGAAGTTCGT 
CAGTTATCGC 
GAAGAAAAAA 
TCCAGAACGT 
TCTCAATCAC 
GAAGTTCGCG 
ATCTAAAACA 
ACGCTGAAGC 
GAAGATATCG 
ACACACAAAA 
GACGTCACAC 



ATCACGTAAC 
ATATGGTTGA 
GACTTATTAT 
AGGTTCTGCT 
TCTTAGAATT 
GATACTGACA 
TGGACGTGGT 
TTGGTGACGA 
ACTGTTACAG 
AGGCGACAAC 
AACGTGGACA 
TTCAAAGCTG 
TCCA 



GTTGGTGTAC 50 

TGACGAAGAA 100 

CAGAATACGA 150 

TTGAAAGCTT 2 00 

AATGGCTGCA 250 

AACCATTCAT 300 

ACTGTTGCTA 350 

AGTTGAAATC 400 

GTGTTGAAAT 450 

ATCGGTGCTT 500 

AGTATTAGCT 550 

AAGTATACGT 600 
634 



40 2) INFORMATION FOR SEQ ID NO: 889 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 93 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 

(B) STRAIN: ATCC 14110 



55 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 889 

TGTCTTCATC CAGGAATTGA TTGTGAGTCG TTCCACATGC TCACCTAGTT 50 

TTCGCTCGAT CTTTTCACTA ACGCAAACCA TGTAGAACAA CATTGCCAAG 100 

GCCCACGGTG GTTACTCCGT CTTCACTGGT GTTGGTGAGC GTACTCGTGA 150 

GGGTAACGAT CTGTACCACG AAATGCAGGA GACTGGTGTC ATTCAGCTCG 200 

AGGGTGAATC CAAGGTCGCA CTGGTGTTCG GACAGATGAA CGAGCCCCCC 250 
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GGTGCCCGTG CCCGTGTCGC CCTTACCGGT CTGACCATTG CCGAGTACTT 
CCGTGACGAG GAGGGTCAGG ACGTGCTGCT CTTCATTGAC AACATTTTCC 
GTTTCACCCA GGCCGGTTCT GAGGTGTCTG CCCTTCTCGG TCGTATCCCC 
TCTGCCGTCG GTTACCAGCC CACCCTGGCC GTCGACATGG GTGGTATGCA 
GGAGCGTATC ACCACCACCA AGAAGGGTTC TATTACCTCC GTC 



300 
350 
400 
450 
493 



2) INFORMATION FOR SEQ ID NO: 890 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 466 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Penicillium marneffei 

(B) STRAIN: ATCC 64101 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 890 

TTGATTGTAC GTCTTTACCT TTCTGCCTGA CTGTTTACGA CAACTAACGA 50 

AAGCGTAGAA CAACATTGCC AAGGCTCACG GTGGTTACTC TGTCTTCACT 100 

GGTGTCGGTG AACGTACTCG TGAGGGTAAC GATTTGTACC ACGAAATGCA 150 

GGAAACTGGT GTCATTCAGC TCGAGGGTGA ATCCAAGGTC GCCCTCGTGT 2 00 

TCGGTCAGAT GAACGAGCCC CCCGGTGCCC GTGCCCGTGT CGCTCTTACT 250 

GGTTTGACCA TTGCCGAGTA CTTCCGTGAC GAGGAAGGTC AGGACGTGCT 300 

TCTCTTCATT GACAACATTT TCCGTTTCAC TCAGGCCGGT TCTGAGGTGT 350 

CTGCCCTTCT GGGTCGTATC CCCTCTGCCG TCGGTTACCA GCCCACCCTT 4 00 

GCCGTCGACA TGGGTATCAT GCAGGAGCGT ATTACCACCA CCACCAAGGG 4 50 

TTCCATCACC TCCGTC 4 66 



2) INFORMATION FOR SEQ ID NO: 891 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 78 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paecilomyces lilacinus 

(B) STRAIN: ATCC 42570 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 891 

AGGAGCTGAT CAACAACATC GCCAAGGCTC ACGGTGGTTA CTCCGTCTTC 50 

ACTGGTGTCG GTGAGCGTAC CCGTGAGGGT AACGATCTGT ACCACGAAAT 100 

GCAGGAGACC TCGGTCATTC AGCTCGAGGG CGAGTCTAAG GTGGCCCTGG 150 

TCTTTGGTCA GATGAACGAG CCCCCGGGTG CTCGTGCCCG TGTCGCTCTT 200 

ACTGGTCTTA CCGTCGCCGA GTACTTCCGT GACCAGGAGG GTCAGGATGG 250 

TTAGTTCTCG TCCACTCATG CCGAAACATG TGCGTGTTCC GAGGCTAATC 300 

AACGTGCCAG TGCTGCTTTT CATCGACAAC ATTTTCCGAT TCACACAGGC 350 
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